Select from following two ways.
In general, we recommend to use DDBJ Nucleotide Sequence Submission System
In cases of, large number of sequences, many features, and/or long sequences, MSS is more useful.
See Organism qualifier.
For detail, see either of following cases;
2. In case of unidentified species names, proposing a new species etc.
3. environmental sample
In cases of sequences derived by direct molecular isolation from soil, sea water, etc. i.e. a bulk environmental DNA sample by PCR with or without subsequent cloning of the product, DGGE, or other anonymous methods, see What is ENV ? – environmental samples.
For description of organism qualifier, see 3. Environmental samples.
Though frequently confused, the term, 'environmental samples', does NOT mean "wild type". If sequences are derived from isolated or cultured organisms, the sequence data are not classified into environmental samples.
Regardless you are to publish academic paper or not, DDBJ accepts your submission of sequence data.
If you have no plan to paper publication, you have to fill following items of REFERENCE.
When you change your plan after sequence data submission, i.e. if you publish a paper, contact us from this form to send request with subject "Our paper was published".
DDBJ accepts updating requests only from the original submitter of the entry.
Basically, we strongly recommend to describe joint submitters more than two persons, e.g. at least a true worker and an adviser, to avoid lost communication in future.
In general, you can describe base substitutions by using variation feature with replace and note qualifiers.
In case of using DDBJ Nucleotide Sequence Submission System, select 'other' for template.
About format of feature annotation, see F01) polymorphism and variation at Example of Submission.
Though you can submit sequence data including SNP (Single Nucleotide Polymorphisms) to DDBJ, the data will not automatically reflect to dbSNP.
dbSNP is an independent database from INSDC, operated by NCBI.
For SNP data, we recommend you to submit to dbSNP.
For instance, when the length of sequence is 199035 bp and a CDS feature is located in the range from 199001 to 100, you should describe the location of CDS feature as
See also Description of Location in detail.
At first, please confirm whether The Genetic Code is appropriately selected or not.
Generally, if /transl_table qualifier is appropriately described with a number of the genetic code, the nucleotide sequence is automatically translated to amino acid sequence according to the genetic code.
In exceptional cases of specific codons (selenocysteine etc.) that is not followed the genetic codes, describe /transl_except qualifier, appropriately.
In case of rare initiation of translation, staring with an amino acid other than methionine, describe the location of CDS feature with starting from "<", operatively indicating 5'end not complete. And describe brief explanation about the translation mechanism in /note qualifier.
See Contact person.
If your affiliation was changed after sequencing or when you belong two or more institutes, please describe the most responsible one as a representative.
Since 2007, we have removed E-mail addresses and phone numbers from sequence data.
If you can find a related paper at REFERENCE on DDBJ flat file, contact information would be available on the paper.
When you wishes to contact to the submitter(s) of an entry of your interest, please contact us via Inquiry to the sequence submitters (submitted to DDBJ) with reasons briefly, then we will forward your message to the submitter(s).
Please confirm if the ID on the paper is Accession Number Assigned by INSD or not.
If accession numbers on the paper, please contact us from contact form by selecting the item, "Updating Submitted Data" with following items.
DDBJ is functioning as one of the international nucleotide sequence databases, including EMBL-Bank/EBI in Europe and GenBank/NCBI in the USA as the two other members.
When DDBJ releases the submitted data, EMBL-Bank and GenBank will load the data into their own services, respectively.
See Sequence Data Transition.
Note that the data are converted into EMBL-Bank or GenBank format.
The amino acid sequence for CDS feature will be automatically translated from nucleotide sequence according to location and other items, and reflected into /translation qualifier. So, in general, do not enter it.
The rule to translate nucleotide sequence into amino acid sequence is specified in accordance with agreements of International Nucleotide Sequence Database Collaboration.
The codon table using a CDS feature is specified in the value of /transl_table qualifier as a number of The Genetic Codes.
There are three points frequently misunderstood.
INSD; International Nucleotide Sequence Database are composed of DDBJ, ENA and NCBI, and collect experimentally determined nucleotide sequence data.
A unique accession number issued by INSD for each submitted sequence data is defined as the INSD accession number.
On DDBJ flat file, the accession number is described in ACCESSION line.
If multiple entries are united to an entry, or if an entry is extensively modified after the submission, the responsible data banks may assign a new accession number to it. In these cases, the new accession number is called the primary accession number, and the old accession number(s) is/are called the secondary accession number(s).
In the flat file, the primary accession number is indicated first, then the secondary accession number(s) follows.
You can find the same updated entry with both the primary and the secondary accession numbers, in general.
However, if the old entry with secondary accession number has previously been open to the public, the old one is not removed. So, you can find the old record by getentry.