Since 2008, INSDC (DDBJ/EMBL-Bank/GenBank) has accepted the sequence data of "Transcriptome Shotgun Assembly (TSA)" categorised into TSA division for assembled cDNA sequences.
With new sequencing technologies, INSDC has faced many requests to accept assembled EST sequences. These sequence data have become more useful than used to be, although they may not be correctly assembled or exist in nature. Therefore, DDBJ/EMBL-Bank/GenBank decided to collect assembled EST sequences and classified them into the TSA division.
Note that it is required that the TSA submission with the original sequence data of primary transcripts (primary entries) are classified into the EST division of DDBJ/EMBL-Bank/GenBank, DDBJ Trace Archive or DDBJ Read Archive.
If a primary entry belonged to the submitter who is other than TSA submitter, the TSA entry is classified into TPA category.
You can submit TSA data to DDBJ through Mass Submission System (MSS).
Primary entries used to build a TSA sequence are mRNA sequences that have been experimentally determined by their submitters and are publicly available on INSDC, Trace Archive, or Sequence Read Archive.
It is possible that primary entries are not yet publicized at the TSA submission. However, the primary entries must be publicized by when the corresponding TSA entry is open to the public.
[1] the allowable percent of bases that are 'n' should be less than 5% and
[2] a TSA entry can have a stretch of no more than 5 n' s in a row
LOCUS FS000000 800 bp mRNA linear TSA 15-OCT-2008 DEFINITION TSA: Homo sapiens GAPD gene for glyceraldehyde-3-phosphate dehydrogenase, complete cds. ACCESSION FS000000 VERSION FS000000.1 DBLINK BioProject:PRJDA43210 KEYWORDS TSA; Transcriptome Shotgun Assembly. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 800) AUTHORS Mishima,H. and Shizuoka,T. TITLE Direct Submission JOURNAL Submitted (30-SEP-2008) to the DDBJ/EMBL/GenBank databases. Contact:Hanako Mishima National Institute of Genetics, DNA Data Bank of Japan; Yata 1111, Mishima, Shizuoka 411-8540, Japan REFERENCE 2 AUTHORS Mishima,H., Shizuoka,T. and Fuji,I. TITLE Glyceraldehyde-3-phosphate dehydrogenase of human JOURNAL TSA Biol 12, 61-70 (2008) COMMENT PRIMARY TSA_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP 1-599 ZZ000004.1 2-598 1-669 ZZ000005.1 11-679 2-596 ZZ000006.1 1-595 2-575 ZZ000007.1 1-574 5-676 ZZ000008.1 1-672 6-725 ZZ000009.1 1-720 59-369 ZZ000010.1 13-322 605-800 ZZ000011.1 1-196 c FEATURES Location/Qualifiers source 1..800 /db_xref="taxon:9606" /mol_type="transcribed RNA" /organism="Homo sapiens" CDS 73..669 /codon_start=1 /gene="GAPD" /product="glyceraldehyde-3-phosphate dehydrogenase" /protein_id="BAA00000.1" /transl_table=1 /translation="MWYQSLVIIEKLNLEANIGKLINTKDNINIRCRLSHTEEHSWHS -- The rest of amino acid sequence is omitted -- " BASE COUNT 199 a 203 c 198 g 200 t ORIGIN 1 attaatataa gctaaatatg tttttcaata tatattgata atagaatatc aacaatttgg : -- The rest of nucleotide sequence is omitted -- : //