DDBJ Annotated/Assembled Sequences
Submission of environmental sequences
Submission of environmental sequences
Here shows procedures for sequence data obtained from studies that do not specify individuals or species.
These kinds of data contain sequences and other information on environmental samples.
Though frequently confused, the term, ‘environmental samples’, does NOT mean “wild type”.
Environmental samples are identified sequences derived by direct molecular isolation from
a bulk environmental DNA sample (by PCR with or without subsequent cloning of the product, DGGE, or other anonymous methods)
with no reliable identification of the source organism.
The ENV division contains (but is not limited to) the following types of data:
- sequences derived by direct molecular isolation from soil, sea water, etc.
- clinical samples, gut contents, and other sequences from anonymous organisms that may be associated with a particular host
- mixed culture derived from an environmental sample
Cases not treated as environmental samples
- Isoalted and cultured microorganisms
- Though frequently confused, the term, ‘environmental samples’, does NOT mean “wild type”.
- Highly reproducible samples though difficult to be cultured (not limited to follows)
- endosymbionts that can be reliably recovered from a particular host
- organisms from a readily identifiable but uncultured field sample (e.g., many cyanobacteria)
- phytoplasmas that can be reliably recovered from diseased plants (even though these cannot be grown in axenic culture).
Notes on environmental sample submission
- In many cases of environmental samples, we assign the lineage that as far as submitters could specify
is used for the description of organism name with the header “uncultured”.
See also Organism Qualifier 3. Environmental sample for further detail. - The /environmental_sample qualifier is required in source feature.
- The /isolation_source qualifier is required in source feature to describe origin of the sample.
- Either /isolate, /clone or
/submitter_seqid qualifier is required as an identifier.
See also Identifiers for further detail. - Do not use strain qualifier.
For small scale sequence data
Sequence data obtained from environmental samples are treated as ENV division.
For large scale sequence data
For large scale sequencing studies, particularly derived fro NGS, please submit BioProject and BioSample.
Large scale analyses of targeted loci are treated as Targeted Locus Study.
In cases of analyses for assembling sequences for species or individuals, depending on from genomes or from RNA transcripts, the data should be submitted as Genome Project or Transcriptome Project, respectively.