A DDBJ genome sequence should be linked to one BioProject and one BioSample. This constraint is set by INSDC to manage genome sequences.
Submit sample(s) used for genome assembly in following packages.
- In the case of isolated, cultured prokaryotes: Cultured Bacterial/Archaeal Genomic Sequences (MIGS.ba)
- In the case of eukaryotes: Eukaryotic Genomic Sequences (MIGS.eu)
Register a locus tag prefix necessary for an annotated genome submission by entering a prefix in the BioSample locus_tag_prefix attribute.
Genome assembled from multiple samples
When submitting a genome sequence assembled from reads of multiple samples, register a derived BioSample because a genome sequence cannot be linked to multiple BioSamples. For example, sequencing reads of male and female samples are submitted to DRA, and submit a genome sequence assembled from both samples, register a derived BioSample citing both BioSample accessions.
Enter the originating BioSample accessions in derived_from attribute of the derived BioSample in the format of “This sample group is the combination of the [number of samples] individual BioSamples: [BioSample accessions]”. For example, “This sample group is the combination of the 2 individual BioSamples: SAMN12623203 and SAMN12623206”
Real examples: A Genome sequence JAGDQO010000000 and a derived sample SAMN17974349.
There are four tiers of metagenome assembly (Metagenome-Assembled Genome, MAG) submission depending on the degree of assembly.
Submit a MAG from a particular organism to the ENV division of DDBJ.
Create a virtual BioSample for MAG submission.
Use the MIMAG package and use an organism name without uncultured (e.g., Methanosarcina thermophila).
Indicate deriving source sample as “derived_from: SAMD00000001”. Examples
A MAG also needs to be linked to a BioSample, create a derived BioSample for MAG.