• Entries from ENA and GenBank during a specific period are not being reflected in getentry

DDBJ Annotated/Assembled Sequences

  • Home
  • Submission
    • Before Submission
    • Web submission
    • Mass Submission
    • Data Update
  • Search
    • getentry
    • ARSA
  • Flat file
    • Feature Table
    • Feature key
    • Qualifier key
    • Nucleotide Sequences
    • Organism qualifier
    • Identifiers
    • Description of Location
    • Protein Coding Sequence
    • The Genetic Codes
    • Codes Used in Sequence Description
    • Description Examples of Sequence Data
  • Data categories
    • Data Submission from Genome Project
    • Pseudohaplotype
    • WGS
    • Finished level genomic sequences
    • Metagenome Assembly
    • Single amplified genome
    • HTG
    • Environmental sample
    • ENV
    • TLS
    • Data Submission from Transcriptome Project
    • TSA
    • EST
    • HTC
    • Third Party Data (TPA)
  • FAQ
  • Other
    • Patent
    • MGA
  • Home
  • ddbj
  • TLS

TLS

Since 2016, INSDC has accepted sequence data including 16S rRNA or some other targeted loci mainly to be clustered into operational taxonomic unit as Targeted Locus Study (TLS) data type.

With regard to comprehensive analysis of marker sequences by using NGS, TLS data submission is not required in many cases, because it is reproducible if the original reads are submitted to DDBJ Read Archive.
So, please consider if you have to submit TLS data to DDBJ or not.

See the list of publicized TLS data.

You can submit TSA data to DDBJ through Mass Submission System (MSS).

Notes on the TLS data submission
  • Prior to TLS data submission, it is required to submit to BioProject Database and BioSample Database.
  • It is strongly recommended that the TLS data submission with the original sequence data of primary sequences are classified into DDBJ Read Archive.
  • Remove low quality reads and chimeric sequences before submission.

Sample flat file

Aspects of TLS

  • Basically, each TLS sequence submitted to DDBJ is assigned an accession number that consists of 4 alphabet characters and 8 digits.
  • “TLS:” is shown at the beginning of DEFINITION line.
  • “TLS” and “Targeted Locus Study” are indicated in KEYWORDS line.
LOCUS       TZZZ01000001             800 bp   mRNA     linear   TLS 15-NOV-2017
DEFINITION  TLS: Uncultured bacterium OTU:MS213 gene for 16S ribosomal RNA, 
            partial sequence.
ACCESSION   TZZZ01000001
VERSION     TZZZ01000001.1
DBLINK      BioProject:PRJDA43211
            Sequence Read Archive: DRR800001
            BioSample: SAMD98765439
KEYWORDS    TLS; Targeted Locus Study; ENV.
SOURCE      uncultured bacterium
  ORGANISM  uncultured bacterium
            Bacteria; environmental samples.
REFERENCE   1  (bases 1 to 800)
  AUTHORS   Mishima,H. and Shizuoka,T.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-SEP-2017) to the DDBJ/EMBL/GenBank databases.
            Contact:Hanako Mishima
            National Institute of Genetics, DNA Data Bank of Japan; Yata 1111,
            Mishima, Shizuoka 411-8540, Japan
REFERENCE   2  
  AUTHORS   Mishima,H., Shizuoka,T. and Fuji,I.
  TITLE     Metagenomic Taxonomy Profile in Sea Water
  JOURNAL   TLS Biol 15, 161-170 (2017)
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: FLASH v. 2015
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..301
                     /altitude="-20 m"
                     /collection_date="2007"
                     /db_xref="taxon:77133"
                     /environmental_sample
                     /geo_loc_name="Pacific Ocean"
                     /isolation_source="marine water"
                     /lat_lon="29.3116 N 148.6778 E"
                     /mol_type="genomic DNA"
                     /organism="uncultured bacterium"
                     /submitter_seqid="OTU:MS213"
     rRNA            <1..>301
                     /product="16S ribosomal RNA"
BASE COUNT           75 a           59 c          102 g           65 t
ORIGIN      
        1 cagtcgccgc gggaatacgg agggggctag cgttgttcgg aattactggg cgtaaagcgc
        :
        -- The rest of nucleotide sequence is omitted --
        :
//

Related pages

  • Data Submission from Genome Project
  • Submission of environmental sequences
  • Data Submission from Transcriptome Project
  • Third Party Data (TPA)