Last updated:2013.6.13.


The definition briefly describes the information of gene(s). "DEFINITION" is constructed by each of the three data banks in accordance with standard rules in principle.However, in the case of EST or GSS submission using Mass Submission System, DDBJ will sometimes ask submitters to construct "DEFINITION".

case 1; complete sequence of maize catalase coding gene

DEFINITION  Zea mays Cat3 gene for catalase, complete cds.

Format: [organism name] [gene name] gene for [product name], complete cds.

organism name; The scientific name is indicated as the organism name, in principle.
gene name; the symbol of the gene
product name; the general name of product
complete cds; this coding sequence is complete

case 2; partial sequence of human glyceraldehyde-3-phosphate dehydrogenase coding cDNA

DEFINITION  Homo sapiens  mRNA for glyceraldehyde-3-phosphate
            dehydrogenase, partial cds.

Format: [organism name] mRNA for [product name], partial cds.

partial cds; this coding sequence is partial
The gene name is omitted, since the submitter does not report.

case 3; partial sequence of yeast 25S rRNA gene

DEFINITION  Saccharomyces cerevisiae gene for 25S rRNA, partial

Format: [organism name] gene for [product name], partial sequence.

partial sequence; this sequence is partial

case 4; multiple CDS of rat mitochondrial DNA

DEFINITION  Rattus norvegicus mitochondrial genes for cytochrome
            c oxidase subunit II, ATPase subunit 6, cytochrome c
            oxidase subunit III, partial and complete cds.

Format: [organism name] [gene name], ...... genes for [product name], ......., complete cds.

The gene names and/or product names are subsequently described from 5'to 3' end.
"partial, complete and partial cds" is abbreviated to "partial and complete cds".
If some genes have only gene names or product names, only gene name or product name is described principally.
If the "DEFINITION" is too long, some information, such as map position, is described instead of the gene or product names. Sometimes gene cluster or operon name is described, if it is considered reasonable.

case 5; EST data of human brain 3' end

DEFINITION  Homo sapiens cDNA, clone:ABC123, 3' end, expressed
            in brain.

Format: [organism name] cDNA, clone:[clone name], [other information].

The clone name is mandatory.

case 6; GSS data of mouse chromosome 1q

DEFINITION  Mus musculus DNA, clone:1H11A14, 1q region.

Format: [organism name] DNA, clone:[clone name], [other information].

The clone name is mandatory.

case 7; TPA (Third Party Annotation) data of human GAPD

DEFINITION  TPA_exp: Homo sapiens GAPD mRNA for
            glyceraldehyde-3-phosphate dehydrogenase, complete cds.

In the case of TPA (Third Party Annotation), either of "TPA_exp:" (for TPA:experimental) or "TPA_inf" (for TPA:inferential) is described at the beginning of DEFINITION.

case 8; MGA data about 1 month adult cerebellum of Mus musculus

DEFINITION  Mus musculus mRNA, 1 month adult cerebellum RIKEN Cap Analysis
            Gene Expression (CAGE) library.

Organism name, molecular type, developmental stage, tissue, cell, method of producing sequence and else are described.

In most cases of MGA, EST, GSS data, the contents of "DEFINITION" is prepared by the submitter, not by the databank.