ACCESSION

This line shows accession number of the entry data.

For general data

A unique accession number is issued to the data submitter by each of the three data banks. The accession number is composed of 1 alphabet character and 5 digits (ex. A12345) or 2 alphabet characters and 6 digits (ex. AB123456). The former style was used in 1980s, but later the latter style was introduced because of data explosion.
The alphabet part is called "prefix". Please refer the prefix list.

If multiple entries are united to an entry, or if an entry is extensively modified after the submission, the responsible data banks may assign a new accession number to it. In these cases, the new accession number is called the primary accession number, and the old accession number(s) is/are called the secondary accession number(s). In the flat file, the primary accession number is indicated first, then the secondary accession number(s) follows. You can find the same updated entry with both the primary and the secondary accession numbers.

example
ACCESSION   AB999999 AB888888 AB777777
AB999999 -- primary accession number
AB888888 AB777777 -- secondary accession number

For WGS data

The accession number assigned to each entry of WGS data consists of 4 alphabet characters and 8 digits.
The alphabet part is called "prefix". Please refer WGS prefix list.

Example: ZZZZ01000001

4 letters -- project_id
2 digits -- set_version
6 digits -- contig_id

The set_version goes up for every update of the dataset.
Example:ZZZZ02000001

example
ACCESSION   ZZZZ01000001 ZZZZ01000000
ZZZZ01000001 -- primary accession number
ZZZZ01000000 -- set ID

For MGA data

This (ACEESSION) line shows a number assigned by INSDC to a resource. The number is composed of 5 alphabet characters and 7 digits (ex. ZZZZZ0000000).
An accession number assigned to an entry of a resource units is displayed in the MGA lines.

Example:ZZZZZ0000001

5 alphabetical characters -- project identifier.
    
first two characters -- identifier to each project.
    
third to fifth characters -- identifier to each of resources on each project.
7 digit numeric numbers -- number for each sequence entry in a resource.
    *1 The information about each project id is avilable at the project_index page.
    *2 "resource" here means a unit of identical origin, such as tissue, cells, from which sequence are obtained.
example
ACCESSION   ZZZZZ0000000

ZZZZZ0000000 -- number to a resource unit

 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

 
 
 
 

ページの先頭へ戻る