Last updated:2013.6.13.

LOCUS

locus name, sequence length, molecule type, molecular form, division, the date of last release

example
LOCUS       AB000000         450 bp    mRNA    linear   HUM 01-JUN-2009

AB000000 -- locus name
450 bp -- length of sequence
mRNA -- molecule type
linear -- molecular form
HUM -- division
01-JUN-2009 -- the date of last release

locus name

Locus name is a unique ID of the entry in the database.

In DDBJ, since July 1996, the locus name has been assigned the same as accession number.

length of sequence

Notice: No information is available on the Master record of MGA data.

molecule type

According to the value of /mol_type qualifier for source feature, it is described as DNA, RNA, mRNA, rRNA, tRNA, or cRNA.

molecular form

This column indicates whether molecular form of nucleotide sequence is "linear" or "circular". If the entry is the full length of circular form, "circular" is appeared.

division

DDBJ classifies entries into 21 divisions as below;

a: taxonomic divisions

HUM human
PRI primates (other than human)
ROD rodents
MAM mammals (other than primates and rodents)
VRT vertebrates (other than mammals)
INV invertebrates (animals other than vertebrates)
PLN plants, fungi, plastids (eukaryotes other than animals)
BCT bacteria (including both Eubacteria and Archaea)
VRL viruses
PHG bacteriophages

b: other divisions

PAT sequence data related to patent application
The data those which Japan Patent Office (JPO), United States Patent and Trademark Office (USPTO),
European Patent Office (EPO), and Korean Intellectual Property Office (KIPO) collected, processed and released.
ENV sequences obtained via environmental sampling methods
SYN synthetic constructs; artificially constructed sequences
EST expressed sequence tags; short single pass cDNA sequences
TSA transcriptome shotgun assemblies; assembled mRNA sequences
GSS genome survey sequences; short single pass genomic sequences
HTC high throughput cDNA sequences;
The sequence submitted from cDNA sequencing projects except for EST. This division is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division.
HTG high throughput genomic sequences
The sequence submitted mainly from genome sequencing projects which regarded a clone as a sequencing unit.
STS sequence tagged sites
The tag site for genome sequencing. The information of chromosome, map, PCR_condition is necessary for this division.
UNA the data not annotated

The UNA division is not used recently.
CON Contig / Constructed
To conjugate a series of entries, such as those submitted from a genome project, each of the three data banks constructs an entry and assign an accession number to a large scale sequence dataset. Such entries are classified into the CON division. The entry in the CON division has the information of joined accession numbers instead of the sequence data. The corresponding entries of the CON entry have been submitted to other divisions.

the date of last release

The current publicized date is described. If the entry is updated and reopened to public site, this date will be changed.



 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

ページの先頭へ戻る