If your annotation meets the requirements of TPA Submission Guidelines, DDBJ can accept it as TPA (Third Party Data).
Select from following two ways.
In general, we recommend to use DDBJ Nucleotide Sequence Submission System
In cases of, large number of sequences, many features, and/or long sequences, MSS is more useful.
See Organism qualifier.
For detail, see either of following cases;
2. In case of unidentified species names, proposing a new species etc.
3. environmental sample
In cases of sequences derived by direct molecular isolation from soil, sea water, etc. i.e. a bulk environmental DNA sample by PCR with or without subsequent cloning of the product, DGGE, or other anonymous methods, see What is ENV ? – environmental samples.
For description of organism qualifier, see 3. Environmental samples.
Though frequently confused, the term, 'environmental samples', does NOT mean "wild type". If sequences are derived from isolated or cultured organisms, the sequence data are not classified into environmental samples.
For description of organism qualifier, see 4. Artificially constructed sequences.
You can use experiment or inference qualifier to describe evidence of speculation in each feature.
For sequence data related to Barcode of Life project, please submit via DDBJ Nucleotide Sequence Submission System or Mass Submission System.
For chromatograms (traces), please submit to DDBJ Trace Archive
Regardless you are to publish academic paper or not, DDBJ accepts your submission of sequence data.
If you have no plan to paper publication, you have to fill following items of REFERENCE.
When you change your plan after sequence data submission, i.e. if you publish a paper, contact us from this form to send request with subject "Our paper was published".
DDBJ accepts updating requests only from the original submitter of the entry.
Basically, we strongly recommend to describe joint submitters more than two persons, e.g. at least a true worker and an adviser, to avoid lost communication in future.
In general, you can describe base substitutions by using variation feature with replace and note qualifiers.
In case of using DDBJ Nucleotide Sequence Submission System, select 'other' for template.
About format of feature annotation, see F01) polymorphism and variation at Example of Submission.
Though you can submit sequence data including SNP (Single Nucleotide Polymorphisms) to DDBJ, the data will not automatically reflect to dbSNP.
dbSNP is an independent database from INSDC, operated by NCBI.
For SNP data, we recommend you to submit to dbSNP.
In case of submission to DDBJ, see format of feature annotation at B13) polymorphism and variation on Example of Submission.
For instance, when the length of sequence is 199035 bp and a CDS feature is located in the range from 199001 to 100, you should describe the location of CDS feature as
join(199001..199035,1..100)
See also Description of Location in detail.
As feature annotation, we strongly recommend you to describe CDS (protein-coding sequence),rRNA,tRNA and so on.
Please inform us in detail, when you apply to Mass Submission System.
At first, please confirm whether The Genetic Code is appropriately selected or not.
Generally, if /transl_table qualifier is appropriately described with a number of the genetic code, the nucleotide sequence is automatically translated to amino acid sequence according to the genetic code.
In exceptional cases of specific codons (selenocysteine etc.) that is not followed the genetic codes, describe /transl_except qualifier, appropriately.
In cases of RNA editing,ribosomal frameshift,mitochondrial TAA stop codon, see Example of submission and describe with /exception and /translation, /ribosomal_slippage, /transl_except, respectively.
In case of rare initiation of translation, staring with an amino acid other than methionine, describe the location of CDS feature with starting from "<", operatively indicating 5'end not complete. And describe brief explanation about the translation mechanism in /note qualifier.
See Contact person.
If your affiliation was changed after sequencing or when you belong two or more institutes, please describe the most responsible one as a representative.
Since 2007, we have removed E-mail addresses and phone numbers from sequence data.
If you can find a related paper at REFERENCE on DDBJ flat file, contact information would be available on the paper.
When you wishes to contact to the submitter(s) of an entry of your interest, please contact us via Inquiry to the sequence submitters (submitted to DDBJ) with reasons briefly, then we will forward your message to the submitter(s).
It is the date of the last release of the data. See LOCUS of Explanation of DDBJ flat file format.
In general, you can find accept date in JOURNAL line of REFERENCE 1 on DDBJ flat file.
Please note that some old data do not have the description of accept date.
DDBJ releases sequence data submitted with a hold date according to Principle of “Hold-Until-Published” data release.
Please confirm if the ID on the paper is Accession Number Assigned by INSD or not.
If accession numbers on the paper, please contact us from contact form by selecting the item, "Updating Submitted Data" with following items.
DDBJ is functioning as one of the international nucleotide sequence databases, including EMBL-Bank/EBI in Europe and GenBank/NCBI in the USA as the two other members.
When DDBJ releases the submitted data, EMBL-Bank and GenBank will load the data into their own services, respectively.
See Sequence Data Transition.
Note that the data are converted into EMBL-Bank or GenBank format.
The amino acid sequence for CDS feature will be automatically translated from nucleotide sequence according to location and other items, and reflected into /translation qualifier. So, in general, do not enter it.
The rule to translate nucleotide sequence into amino acid sequence is specified in accordance with agreements of International Nucleotide Sequence Database Collaboration.
The codon table using a CDS feature is specified in the value of /transl_table qualifier as a number of The Genetic Codes.
There are three points frequently misunderstood.
There are some exceptional cases, represented by RNA editing and so on.
INSD; International Nucleotide Sequence Database are composed of DDBJ, ENA and NCBI, and collect experimentally determined nucleotide sequence data.
A unique accession number issued by INSD for each submitted sequence data is defined as the INSD accession number.
On DDBJ flat file, the accession number is described in ACCESSION line.
If multiple entries are united to an entry, or if an entry is extensively modified after the submission, the responsible data banks may assign a new accession number to it. In these cases, the new accession number is called the primary accession number, and the old accession number(s) is/are called the secondary accession number(s).
In the flat file, the primary accession number is indicated first, then the secondary accession number(s) follows.
example ACCESSION AB999999 AB888888 AB777777
You can find the same updated entry with both the primary and the secondary accession numbers, in general.
However, if the old entry with secondary accession number has previously been open to the public, the old one is not removed. So, you can find the old record by getentry.
The activity of the NIG Supercomputer System are evaluated by the acknowledgments of all of you.
Please acknowledge in your papers, presentations and other publications, the role of NIG Supercomputer System played in your research.
It is no problem to modify the following example for the connection of sentences.
Computations were partially performed on the NIG supercomputer at ROIS National Institute of Genetics.
When you use DDBJ services in your research, we would appreciate it if you would include a reference to DDBJ in your publications.
If you consider citation of DDBJ paper is unsuitable, please consider to acknowledge in your publications, the role of DDBJ services played in your research.
It is no problem to modify the following example for the connection of sentences.
This research was performed using "name of DDBJ Service, analytical tools".