DDBJ Annotated/Assembled Sequences
配列の記載に用いる略号
塩基
Nucleotide Base Codes
国際塩基配列データベースで使用する核酸コードは以下の通りです。
全て小文字を使用します。大文字で登録された場合には、自動的に小文字に変換されます。
| シンボル | 意味 | 説明 |
|---|---|---|
| a | a | adenine |
| c | c | cytosine |
| g | g | guanine |
| t | t | thymine in DNA; uracil in RNA |
| m | a or c | amino |
| r | a or g | purine |
| w | a or t | |
| s | c or g | |
| y | c or t | pyrimidine |
| k | g or t | keto |
| v | a or c or g | not t |
| h | a or c or t | not g |
| d | a or g or t | not c |
| b | c or g or t | not a |
| n | a or c or g or t | any |
[参考文献]
- Cornish-Bowden, A. Nucl Acid Res 13, 3021-3030 (1985)
- Feature Table Definition: 7.4.1 Nucleotide base codes (IUPAC)
Modified Base Abbreviations
修飾塩基は、以下の例のように modified base を用いて記載します。
例
FEATURES Location/Qualifiers
modified_base 15
/mod_base="m2g"
| 省略形 | 修飾塩基 |
|---|---|
| ac4c | 4-acetylcytidine |
| chm5u | 5-(carboxyhydroxylmethyl)uridine |
| cm | 2’-O-methylcytidine |
| cmnm5s2u | 5-carboxymethylaminomethyl-2-thiouridine |
| cmnm5u | 5-carboxymethylaminomethyluridine |
| dhu | dihydrouridine |
| fm | 2’-O-methylpseudouridine |
| gal q | beta,D-galactosylqueuosine |
| gm | 2’-O-methylguanosine |
| i | inosine |
| i6a | N6-isopentenyladenosine |
| m1a | 1-methyladenosine |
| m1f | 1-methylpseudouridine |
| m1g | 1-methylguanosine |
| m1i | 1-methylinosine |
| m22g | 2,2-dimethylguanosine |
| m2a | 2-methyladenosine |
| m2g | 2-methylguanosine |
| m3c | 3-methylcytidine |
| m4c | N4-methylcytosine |
| m5c | 5-methylcytidine |
| m6a | N6-methyladenosine |
| m7g | 7-methylguanosine |
| mam5u | 5-methylaminomethyluridine |
| mam5s2u | 5-methoxyaminomethyl-2-thiouridine |
| man q | beta,D-mannosylqueuosine |
| mcm5s2u | 5-methoxycarbonylmethyl-2-thiouridine |
| mcm5u | 5-methoxycarbonylmethyluridine |
| mo5u | 5-methoxyuridine |
| ms2i6a | 2-methylthio-N6-isopentenyladenosine |
| ms2t6a | N-((9-beta-D-ribofuranosyl-2-methyltiopurin-6-yl)carbamoyl)threonine |
| mt6a | N-((9-beta-D-ribofuranosylpurine-6-yl)N-methyl-carbamoyl)threonine |
| mv | uridine-5-oxyacetic acid methylester |
| o5u | uridine-5-oxyacetic acid (v) |
| osyw | wybutoxosine |
| p | pseudouridine |
| q | queuosine |
| s2c | 2-thiocytidine |
| s2t | 5-methyl-2-thiouridine |
| s2u | 2-thiouridine |
| s4u | 4-thiouridine |
| m5u | 5-methyluridine |
| t6a | N-((9-beta-D-ribofuranosylpurine-6-yl)carbamoyl)threonine |
| tm | 2’-O-methyl-5-methyluridine |
| um | 2’-O-methyluridine |
| yw | wybutosine |
| x | 3-(3-amino-3-carboxypropyl)uridine, (acp3)u |
| OTHER | Other (*) |
(*) このリストにない修飾塩基は /note qualifier に記載します。
[参考文献]
- Sprinzl, M. and Gauss, D.H. Nucl Acid Res 10, r1 (1982). (note that in Cornish_Bowden, A. Nucl Acid Res 13, 3021-3030 (1985) the IUPAC-IUB declined to recommend a set of abbreviations for modified nucleotides)
- Feature Table Definition: 7.4.2 Modified base abbreviations
アミノ酸
Amino Acid Codes
国際塩基配列データベースで使用するアミノ酸コードは以下の通りです。
CDS feature の /translation には以下の一文字表記で表されます。
/transl_except, /anticodon に記載するアミノ酸は、以下の省略形を使用します。
これ以外のアミノ酸を記載する場合には、Modified and unusual Amino Acidsを参照して下さい。
| Abbreviation | 1 letter abbreviation | Amino acid name |
|---|---|---|
| Ala | A | Alanine |
| Arg | R | Arginine |
| Asn | N | Asparagine |
| Asp | D | Aspartic acid |
| Cys | C | Cysteine |
| Gln | Q | Glutamine |
| Glu | E | Glutamic acid |
| Gly | G | Glycine |
| His | H | Histidine |
| Ile | I | Isoleucine |
| Leu | L | Leucine |
| Lys | K | Lysine |
| Met | M | Methionine |
| Phe | F | Phenylalanine |
| Pro | P | Proline |
| Pyl | O | Pyrrolysine |
| Ser | S | Serine |
| Sec | U | Selenocysteine |
| Thr | T | Threonine |
| Trp | W | Tryptophan |
| Tyr | Y | Tyrosine |
| Val | V | Valine |
| Asx | B | Aspartic acid or Asparagine |
| Glx | Z | Glutamic acid or Glutamine |
| Xaa | X | Any amino acid |
| Xle | J | Leucine or Isoleucine |
| TERM | termination codon |
[参考文献]
- IUPAC-IUB Joint Commission on Biochemical Nomenclature.Nomenclature and Symbolism for Amino Acids and Peptides. Eur. J. Biochem. 138: 9-37 (1984).
- Feature Table Definition: 7.4.3 Amino acid abbreviations
Modified and Unusual Amino Acids
Amino Acid Codes にないアミノ酸を記載する場合には以下の省略形を使用します。
CDS feature の /translation にはいずれも “X” で表されます。
| 省略形 | アミノ酸 |
|---|---|
| Aad | 2-Aminoadipic acid |
| bAad | 3-Aminoadipic acid |
| bAla | beta-Alanine, beta-Aminoproprionic acid |
| Abu | 2-Aminobutyric acid |
| 4Abu | 4-Aminobutyric acid, piperidinic acid |
| Acp | 6-Aminocaproic acid |
| Ahe | 2-Aminoheptanoic acid |
| Aib | 2-Aminoisobutyric acid |
| bAib | 3-Aminoisobutyric acid |
| Apm | 2-Aminopimelic acid |
| Dbu | 2,4-Diaminobutyric acid |
| Des | Desmosine |
| Dpm | 2,2’-Diaminopimelic acid |
| Dpr | 2,3-Diaminoproprionic acid |
| EtGly | N-Ethylglycine |
| EtAsn | N-Ethylasparagine |
| Hyl | Hydroxylysine |
| aHyl | allo-Hydroxylysine |
| 3Hyp | 3-Hydroxyproline |
| 4Hyp | 4-Hydroxyproline |
| Ide | Isodesmosine |
| aIle | allo-Isoleucine |
| MeGly | N-Methylglycine, sarcosine |
| MeIle | N-Methylisoleucine |
| MeLys | 6-N-Methyllysine |
| MeVal | N-Methylvaline |
| Nva | Norvaline |
| Nle | Norleucine |
| Orn | Ornithine |
| OTHER | Other (*) |
(*) このリストにないアミノ酸は /note qualifier に記載します。
[参考文献]