• Entries from ENA and GenBank during a specific period are not being reflected in getentry

DDBJ Annotated/Assembled Sequences

  • Home
  • Submission
    • Before Submission
    • Web submission
    • Mass Submission
    • Data Update
  • Search
    • getentry
    • ARSA
  • Flat file
    • Feature Table
    • Feature key
    • Qualifier key
    • Nucleotide Sequences
    • Organism qualifier
    • Identifiers
    • Description of Location
    • Protein Coding Sequence
    • The Genetic Codes
    • Codes Used in Sequence Description
    • Description Examples of Sequence Data
  • Data categories
    • Data Submission from Genome Project
    • Pseudohaplotype
    • WGS
    • Finished level genomic sequences
    • Metagenome Assembly
    • Single amplified genome
    • HTG
    • Environmental sample
    • ENV
    • TLS
    • Data Submission from Transcriptome Project
    • TSA
    • EST
    • HTC
    • Third Party Data (TPA)
  • FAQ
  • Other
    • Patent
    • MGA
  • Home
  • ddbj
  • Sequence data derived from environmental samples

Sequence data derived from environmental samples

Sequence data derived from environmental samples

Here shows procedures for sequence data obtained from studies that do not specify individuals or species.

These kinds of data contain sequences and other information on environmental samples.
Though frequently confused, the term, ‘environmental samples’, does NOT mean “wild type”.
Environmental samples are identified sequences derived by direct molecular isolation from a bulk environmental DNA sample (by PCR with or without subsequent cloning of the product, DGGE, or other anonymous methods) with no reliable identification of the source organism.

The ENV division contains (but is not limited to) the following types of data:

  • sequences derived by direct molecular isolation from soil, sea water, etc.
  • clinical samples, gut contents, and other sequences from anonymous organisms that may be associated with a particular host
  • mixed culture derived from an environmental sample
  • see Metagenome Assembly

Cases not treated as environmental samples

  • Isoalted and cultured microorganisms
    • Though frequently confused, the term, ‘environmental samples’, does NOT mean “wild type”.
  • Highly reproducible samples though difficult to be cultured (not limited to follows)
    • endosymbionts that can be reliably recovered from a particular host
    • organisms from a readily identifiable but uncultured field sample (e.g., many cyanobacteria)
    • phytoplasmas that can be reliably recovered from diseased plants (even though these cannot be grown in axenic culture).

Notes on environmental sample submission

  • In many cases of environmental samples, we assign the lineage that as far as submitters could specify is used for the description of organism name with the header “uncultured”.
    See also Organism Qualifier 3. Environmental sample for further detail.
  • The /environmental_sample qualifier is required in source feature.
  • The /isolation_source qualifier is required in source feature to describe origin of the sample.
  • Either /isolate, /clone or /submitter_seqid qualifier is required as an identifier.
    See also Identifiers for further detail.
  • Do not use strain qualifier.

For small scale sequence data

Sequence data obtained from environmental samples are treated as ENV division.

For large scale sequence data

For large scale sequencing studies, particularly derived fro NGS, please submit BioProject and BioSample.

Large scale analyses of targeted loci are treated as Targeted Locus Study.

In cases of analyses for assembling sequences for species or individuals, depending on from genomes or from RNA transcripts, the data should be submitted as Genome Project or Transcriptome Project, respectively.

Related pages

  • Data Submission from Genome Project
  • Data Submission from Transcriptome Project
  • Third Party Data (TPA)