Example of Submission
Please refer to following sample data list to annotate your sequences for DDBJ submission.
A: Ribosomal RNA, ITS, IGS
B: Protein-coding genes
- A01) 16S rRNA gene
- A02) 18S rRNA gene including intron
- A03) ITS1, ITS2
- A04) intergenic spacer, IGS
B: Protein-coding genes
- B01) CDS (mRNA)
- B02) CDS (DNA)
- B03) premature mRNA
- B04) promoter region
- B05) 5' flanking region
- B06) pseudogene
- B07) alternative splicing (mRNA)
- B08) alternative splicing (DNA)
- B09) RNA editing
- B10) ribosomal frameshift in HIV1 complete genome
- B11) partial TAA stop codon in mitochondrial genome
- B12) Major Histocompatibility Complex (MHC)
- B13) trans_splicing
C: EST, GSS, STS
D: Repeat
E: Particular resources
F: Variation
G: Sequencing gap
D: Repeat
- D01.1) microsatellite, recommended annotation
- D01.2) microsatellite, minimal annotation
- D02) transposon
- D03) insertion sequence
E: Particular resources
- E01) environmental sample
- E02) organism name with sp. (unidentified species)
- E03) natural plasmid
- E04) cloning vector
- E05) synthetic construct
F: Variation
- F01) polymorphism and variation
- F02) candidates of polymorphism found by direct sequencing of diploid genome
G: Sequencing gap
A: Ribosomal RNA, ITS, IGS
A01) 16S rRNA gene
DEFINITION Vibrio halioticoli gene for 16S rRNA, partial sequence, strain: IAM
14597.
FEATURES Location/Qualifiers
source 1..1471
/mol_type="genomic DNA"
/organism="Vibrio halioticoli"
/strain="IAM 14597"
rRNA <1..>1471
/product="16S rRNA"
A02) 18S rRNA gene including intron
DEFINITION Sporobolomyces ruber gene for 18S rRNA, partial sequence.
FEATURES Location/Qualifiers
source 1..2072
/mol_type="genomic DNA"
/organism="Sporobolomyces ruber"
/strain="JCM 6884"
rRNA join(<1..1144,1469..>2072)
/product="18S rRNA"
intron 1145..1468
A03) ITS1, ITS2
DEFINITION Microsphaera trifolii var. trifolii genes for 18S rRNA, ITS1,
5.8S rRNA, ITS2, 28S rRNA, partial and complete sequence.
FEATURES Location/Qualifiers
source 1..597
/isolate="MUMH29s"
/mol_type="genomic DNA"
/organism="Microsphaera trifolii var. trifolii"
/specific_host="Trifolium vulgaris"
/tissue_type="conidia"
/variety="trifolii"
rRNA <1..4
/product="18S rRNA"
misc_RNA 5..224
/note="internal transcribed spacer 1"
rRNA 225..378
/product="5.8S rRNA"
misc_RNA 379..562
/note="internal transcribed spacer 2"
rRNA 563..>597
/product="28S rRNA"
If each feature location is not clear, following annotation can be described.
FEATURES Location/Qualifiers
source 1..597
/isolate="MUMH29s"
/mol_type="genomic DNA"
/organism="Microsphaera trifolii var. trifo lii"
/specific_host="Trifolium vulgaris"
/tissue_type="conidia"
/variety="trifolii"
misc_RNA <1..>597
/note="contains 18S rRNA, ITS1, 5.8S rRNA, ITS2 and
28S RNA"
A04) intergenic spacer, IGS
DEFINITION Setaria italica genes for 25S rRNA, IGS, 17S rRNA, partial and
complete sequence.
FEATURES Location/Qualifiers
source 1..2646
/clone="pSIR012"
/country="Japan"
/cultivar="Shimokatsugi"
/db_xref="taxon:4555"
/mol_type="genomic DNA"
/organism="Setaria italica"
rRNA <1..493
/product="25S rRNA"
repeat_region 840..1432
/rpt_unit_range=840..1073
misc_feature 494..2484
/note="intergenic spacer, IGS"
rRNA 2485..>2646
/product="17S rRNA"
B: Protein-coding genes
B01) CDS (mRNA)
DEFINITION Homo sapiens AQP9 mRNA for aquaporin 9, complete cds.
FEATURES Location/Qualifiers
source 1..2878
/mol_type="mRNA"
/organism="Homo sapiens"
/tissue_type="liver"
CDS 217..1104
/codon_start=1
/gene="AQP9"
/product="aquaporin 9"
/translation="--- omitted ---"
/transl_table=1
3'UTR 1105..2878
polyA_signal 2857..2862
polyA_site 2878
B02) CDS (DNA)
DEFINITION Aspergillus oryzae tglA gene for triacylglycerol lipase, complete
cds.
FEATURES Location/Qualifiers
source 1..1025
/mol_type="genomic DNA"
/organism="Aspergillus oryzae"
/strain="RIB128"
CDS join(34..330,378..523,607..764,827..990)
/codon_start=1
/gene="tglA"
/product="triacylglycerol lipase"
/transl_table=1
/translation="--- omitted ---"
B03) premature mRNA
DEFINITION Cynops pyrrhogaster CpTbx3 premature mRNA, partial cds.
FEATURES Location/Qualifiers
source 1..5170
/dev_stage="gastrula"
/mol_type="transcribed RNA"
/organism="Cynops pyrrhogaster"
/tissue_type="embryo"
intron <1..359
CDS <360..2135
/codon_start=1
/gene="CpTbx3"
/note="T-box family member; T-box domain"
/transl_table=1
/translation="--- omitted ---"
B04) promoter region
DEFINITION Mus musculus mNB-3 gene for neural recognition molecule NB-3, 5'
flanking region.
FEATURES Location/Qualifiers
source 1..1300
/dev_stage="4-8 weeks old"
/mol_type="genomic DNA"
/organism="Mus musculus"
/sex="female"
/strain="129SVJ"
promoter <1..1197
prim_transcript 1198..>1300
/gene="mNB-3"
/note="neural recognition molecule NB-3"
B05) 5' flanking region
DEFINITION Mus musculus gene for membrane cofactor protein CD46, 5' flanking
region.
FEATURES Location/Qualifiers
source 1..1614
/mol_type="genomic DNA"
/organism="Mus musculus"
/strain="129SV"
misc_feature 1..1614
/note="5' flanking region"
/product="membrane cofactor protein CD46"
B06) pseudogene
DEFINITION Homo sapiens pseudogene, necdin.
FEATURES Location/Qualifiers
source 1..2285
/mol_type="genomic DNA"
/organism="Homo sapiens"
CDS <1..1319
/note="necdin"
/pseudo
B07) alternative splicing (mRNA)
[isoform 1]
DEFINITION Homo sapiens BAP2 mRNA for BAI-associated protein 2 alpha,
complete cds.
ACCESSION XA000001
FEATURES Location/Qualifiers
source 1..3168
/mol_type="mRNA"
/organism="Homo sapiens"
CDS 94..1659
/codon_start=1
/gene="BAP2"
/note="alternative splicing: see also Acc# XA000002.1"
/product="BAI-associated protein 2 alpha"
/transl_table=1
/translation="--- omitted ---"
[isoform 2]
DEFINITION Homo sapiens BAP2 mRNA for BAI-associated protein 2 beta,
complete cds.
ACCESSION XA000002
FEATURES Location/Qualifiers
source 1..2129
/mol_type="mRNA"
/organism="Homo sapiens"
CDS 94..1659
/codon_start=1
/gene="BAP2"
/note="alternative splicing: see also Acc# XA000001.1"
/product="BAI-associated protein 2 beta"
/transl_table=1
/translation="--- omitted ---"
B08) alternative splicing (DNA)
DEFINITION Homo sapiens KNP-I gene for KNP-I alpha protein, KNP-I beta
protein, partial cds, alternative splicing.
FEATURES Location/Qualifiers
source 1..13051
/cell_line="GM130B"
/cell_type="B-lymphoblastoid"
/chromosome="21"
/clone="D6B5"
/map="21q22.3"
/mol_type="genomic DNA"
/organism="Homo sapiens"
exon 2676..2884
/gene="KNP-I"
/number=1
5'UTR 2676..2743
/gene="KNP-I"
CDS join(2744..2884,3148..3200,5106..5219,6223..6342,9296..
9388,12251..>12407)
/codon_start=1
/gene="KNP-I"
/product="KNP-I alpha protein"
/translation="--- omitted ---"
CDS join(2744..2884,3148..3200,5106..5219,6223..6342,12251..
>12407)
/codon_start=1
/gene="KNP-I"
/note="alternative splicing"
/product="KNP-I beta protein"
/translation="--- omitted ---"
exon 3148..3200
/gene="KNP-I"
/number=2
exon 5106..5219
/gene="KNP-I"
/number=3
exon 6223..6342
/gene="KNP-I"
/number=4
exon 9296..9388
/gene="KNP-I"
/number=5
exon 12251..12407
/gene="KNP-I"
/number=6
B09) RNA editing
DEFINITION Beta vulgaris mitochondrial nad4L gene for NADH dehydrogenase
subunit 4L, complete cds.
FEATURES Location/Qualifiers
source 1..400
/mol_type="genomic DNA"
/organelle="mitochondrion"
/organism="Beta vulgaris"
/strain="TK81-O"
CDS 71..373
/codon_start=1
/exception="RNA editing"
/gene="nad4L"
/note="initiation codon is created by RNA editing"
/product="NADH dehydrogenase subunit 4L"
/transl_table=1
/translation="--- omitted ---"
misc_feature 72
/note="C to U RNA editing"
misc_feature 117
/note="C to U RNA editing"
misc_feature 125
/note="C to U RNA editing"
misc_feature 180
/note="C to U RNA editing"
misc_feature 201
/note="C to U RNA editing"
B10) ribosomal frameshift in HIV1 complete genome
DEFINITION Human immunodeficiency virus 1 proviral DNA, complete genome,
isolate: 95TNIH047.
FEATURES Location/Qualifiers
source 1..9430
/country="Japan"
/isolate="95TNIH047"
/isolation_source="peripheral blood mononuclear
cell of male patient"
/mol_type="genomic DNA"
/organism="Human immunodeficiency virus 1"
/proviral
CDS join(759..2049,2049..5059)
/codon_start=1
/gene="gag-pol"
/note="produced by ribosomal frameshift slip on tttttt
slippery site"
/product="gag-pol fusion polyprotein"
/ribosomal_slippage
/translation="--- omitted ---"
CDS 759..2255
/codon_start=1
/gene="gag"
/product="Gag polyprotein precursor"
/transl_table=1
/translation="--- omitted ---"
--- The rest is omitted ---
B11) partial TAA stop codon in mitochondrial genome
DEFINITION Mus musculus mitochondrial DNA, complete genome, strain: SAMP8.
FEATURES Location/Qualifiers
source 1..16299
/mol_type="genomic DNA"
/organelle="mitochondrion"
/organism="Mus musculus"
/strain="SAMP8"
/tissue_type="liver"
--- omitted ---
CDS 8607..9390
/codon_start=1
/gene="COX3"
/note="TAA stop codon is completed by the addition of 3'
A residues to the mRNA"
/product="cytochrome oxidase subunit 3"
/transl_except=(pos:9390,aa:TERM)
/transl_table=2
/translation="--- omitted ---"
tRNA 9391..9458
/anticodon=(pos:9421..9423,aa:Gly)
/note="codon recognized: GGA"
/product="tRNA-Gly"
CDS 9459..9806
/codon_start=1
/gene="ND3"
/product="NADH dehydrogenase subunit 3"
/transl_table=2
/translation="--- omitted ---"
tRNA 9808..9875
/anticodon=(pos:9838..9840,aa:Arg)
/note="codon recognized: CGA"
/product="tRNA-Arg"
--- The rest is omitted ---
B12) Major Histocompatibility Complex (MHC)
DEFINITION Homo sapiens HLA-A gene for MHC class I antigen, partial cds,
allele: HLA-A2(A*0201V3).
FEATURES Location/Qualifiers
source 1..546
/mol_type="genomic DNA"
/note="HLA-A2(A*0201V3)/24,B35/60,DRB1*1501/1202"
/organism="Homo sapiens"
CDS <1..>546
/allele="HLA-A2(A*0201V3)"
/codon_start=3
/gene="HLA-A"
/product="MHC class I antigen"
/transl_table=1
/translation="--- omitted ---"
B13) trans_splicing
DEFINITION Psilotum nudum chloroplast DNA, complete genome.
FEATURES Location/Qualifiers
source 1..138829
/cultivar="Kingyoku"
/mol_type="genomic DNA"
/organelle="plastid:chloroplast"
/organism="Psilotum nudum"
---- omitted ----
CDS join(complement(68922..69035),133880..134137)
/codon_start=1
/note="trans splicing of 5'rps12 exon and 3'rps 12 exon"
/product="ribosomal protein S12"
/protein_id="BAB84240.1"
/trans_splicing
/transl_table=11
/translation="MPTIQQLIRNARQPIRSRTKSPALRG CPQRRGVCIRVYTTTPK
KPNSALRKVARVRLTSGFEITAYIPGIGHNLQEHSVVSVRGGRVKDLPGVRYHIVRGT
LDAVGVKDRKQGRSRYGVKKPK"
exon complement(68922..69035)
/gene="rps12"
/note="5'rps12 exon, trans splicing"
/number=1
/product="ribosomal protein S12"
---- omitted ----
exon 133880..134137
/gene="rps12"
/note="3'rps12 exon, trans splicing"
/number=2
/product="ribosomal protein S12"
---- omitted ----
C: EST, GSS, STS
C01) EST (Expressed Sequence Tag)
DEFINITION Mus musculus cDNA, clone: 2310009B01, 3' end sequence, expressed
in tongue.
KEYWORDS EST (expressed sequence tag); 3'-end sequence (3'-EST).
FEATURES Location/Qualifiers
source 1..267
/clone="2310009B01"
/clone_lib="RIKEN full-length enriched mouse cDNA library"
/dev_stage="adult"
/mol_type="mRNA"
/organism="Mus musculus"
/sex="male"
/tissue_type="tongue"
C02) GSS (Genome Survey Sequence)
DEFINITION Arabidopsis thaliana DNA, YAC clone: CIC5D1, left end,
chromosome 1 between mi303 and mi259.
KEYWORDS GSS (genome survey sequence).
FEATURES Location/Qualifiers
source 1..532
/chromosome="1"
/clone="YAC clone CIC5D1"
/map="between mi303 and mi259"
/mol_type="genomic DNA"
/organism="Arabidopsis thaliana"
/ecotype="Columbia"
C03) STS (Sequence Tagged Site)
DEFINITION Sus scrofa DNA, STS on chromosome 1, clone:AA12345,
sequence tagged site.
KEYWORDS STS (sequence tagged site).
FEATURES Location/Qualifiers
source 1..200
/chromosome="1"
/clone="AA12345"
/mol_type="genomic DNA"
/organism="Sus scrofa"
primer_bind 1..20
/PCR_conditions="30 cycles 94degC 30 sec,
56degC 30 sec and 72degC 30 sec"
primer_bind complement(180..200)
D: Repeat
D01.1) microsatellite, recommended annotation
DEFINITION Bos taurus DNA, microsatellite locus 3456P.
FEATURES Location/Qualifiers
source 1..300
/chromosome="11"
/map="11p"
/mol_type="genomic DNA"
/organism="Bos taurus"
primer_bind 20..40
/PCR_conditions="denaturation 94degC 2 min; 30 cycles
94degC 30 sec, 56degC 1 min, 72degC 1 min; final
extention 72degC 1 min"
repeat_region 60..200
/rpt_type=tandem
/rpt_unit_seq="ta"
/satellite="microsatellite: 3456P"
primer_bind complement(210..230)
Microsatellite for mapping purpose is categorized into STS division.
D01.2) microsatellite, minimal annotation
DEFINITION Paralichthys olivaceus DNA, microsatellite marker Poli1TUF.
FEATURES Location/Qualifiers
source 1..222
/mol_type="genomic DNA"
/organism="Paralichthys olivaceus"
repeat_region 25..94
/rpt_type=tandem
/rpt_unit_seq="ca"
/satellite="microsatellite: Poli1TUF"
D02) transposon
DEFINITION Escherichia coli transposon Tn2000 DNA, complete sequence.
FEATURES Location/Qualifiers
source 1..5000
/mol_type="genomic DNA"
/organism="Escherichia coli"
repeat_region 1..5000
/mobile_element="transposon:Tn2000"
repeat_region 1..100
/rpt_type=inverted
CDS 146..1576
/codon_start=1
/product="transposase"
/transl_table=11
/translation="--- omitted ---"
CDS 2785..3585
/codon_start=1
/product="streptomycin phosphotransferase"
/transl_table=11
/translation="--- omitted ---"
repeat_region 4900..5000
/rpt_type=inverted
D03) insertion sequence
DEFINITION Streptomyces coelicolor insertion sequence IS123 DNA, complete
sequence.
FEATURES Location/Qualifiers
source 1..2000
/db_xref="taxon:1902"
/mol_type="genomic DNA"
/organism="Streptomyces coelicolor"
/strain="A456"
repeat_region 44..48
/note=target sequence
/rpt_type=flanking
repeat_region 49..1166
/mobile_element="insertion sequence:IS123"
repeat_region 49..78
/note="30nt perfect inverted repeat"
/rpt_type=inverted
CDS 271..1110
/codon_start=1
/gene="tnpA"
/product="transposase"
/transl_table=11
/translation="--- omitted ---"
repeat_region 1137..1166
/note="30nt perfect inverted repeat"
/rpt_type=inverted
repeat_region 1167..1171
/note="target sequence"
/rpt_type=flanking
E: Particular resources
E01) environmental sample sequence
DEFINITION Uncultured bacterium gene for 16S rRNA, partial sequence, clone:
4-11.
FEATURES Location/Qualifiers
source 1..1475
/clone="4-11"
/environmental_sample
/isolation_source="solid waste compost"
/mol_type="genomic DNA"
/organism="uncultured bacterium"
rRNA <1..>1475
/product="16S rRNA"
E02) organism name with sp. (unidentified species)
DEFINITION Eirene sp. EML1 GAPDH mRNA for glyceraldehyde-3-phosphate
dehydrogenase, partial cds.
FEATURES Location/Qualifiers
source 1..245
/mol_type="mRNA"
/organism="Eirene sp. EML1"
/strain="EML1"
CDS <1..>245
/codon_start=3
/gene="GAPDH"
/product="glyceraldehyde-3-phosphate dehydrogenase"
/translation="--- omitted ---"
E03) natural plasmid
DEFINITION Rhodococcus rhodochrous plasmid pRC4 repA, repB genes for
replication protein, DNA-binding replication protein, complete
cds.
FEATURES Location/Qualifiers
source 1..2582
/db_xref="IFO:3338"
/mol_type="genomic DNA"
/organism="Rhodococcus rhodochrous"
/plasmid="pRC4"
/strain="IFO 3338"
CDS 1142..2062
/codon_start=1
/gene="repA"
/product="replication protein"
/transl_table=11
/translation="--- omitted ---"
CDS 2052..2333
/codon_start=1
/gene="repB"
/product="DNA-binding replication protein"
/transl_table=11
/translation="--- omitted ---"
E04) cloning vector
DEFINITION Cloning vector pAP3neo DNA, complete sequence.
FEATURES Location/Qualifiers
source 1..5350
/mol_type="other DNA"
/organism="Cloning vector pAP3neo"
rep_origin 572..1452
/note="ColE1-derived plasmid replication origin"
CDS complement(1513..2373)
/codon_start=1
/gene="Amp"
/product="beta-lactamase"
/transl_table=11
/translation="--- omitted ---"
rep_origin 2504..2959
/note="phage f1 region"
promoter 3290..3485
/note="SV40 early promoter"
CDS 3596..4390
/codon_start=1
/gene="neo"
/product="neomycin phosphotransferase"
/transl_table=11
/translation="--- omitted ---"
promoter 4788..4983
/note="SV40 early promoter"
promoter 5188..5207
/note="T7 promoter"
promoter 5302..5321
/note="T3 promoter"
E05) synthetic construct
DEFINITION Synthetic construct gene for Rai-chu 101, complete cds.
FEATURES Location/Qualifiers
source 1..2223
/db_xref="taxon:32630"
/focus
/mol_type="other DNA"
/organism="synthetic construct"
CDS 1..2223
/codon_start=1
/gene="Rai-chu 101"
/note="fusion protein"
/product="Rai-chu 101"
/standard_name="Ras and interacting protein chimeric
unit 101"
/transl_table=11
/translation="--- omitted ---"
source 724..1239
/db_xref="taxon:9606"
/mol_type=other DNA""
/note="derived from human H-Ras cDNA"
/organism="Homo sapiens"
source 1258..1500
/db_xref="taxon:9606"
/mol_type=other DNA""
/note="derived from human Raf cDNA"
/organism="Homo sapiens"
/focus can be added by DDBJ.
F: Variation
F01) polymorphism and variation
See also representative submissions of identical sequences for variation studies.
DEFINITION Homo sapiens IGF1R mRNA for insulin-like growth factor I receptor,
partial cds.
FEATURES Location/Qualifiers
source 1..1000
/mol_type="mRNA"
/organism="Homo sapiens"
CDS <1..>1000
/codon_start=2
/gene="IGF1R"
/product="insulin-like growth factor I recep tor"
/translation="--- omitted ---"
variation 100
/note="polymorphism"
/note="This substitution causes an amino acid
substitution from Ala to Thr."
/compare="AB000000.1"
/replace="a"
F02) candidates of polymorphism found by direct sequencing of diploid genome
FEATURES Location/Qualifiers
source 1..426
/db_xref="taxon:9606"
/mol_type="mRNA"
/organism="Homo sapiens"
variation 260
/experiment="heterozygous SNP detection from direct
sequencing"
/note="putative SNP, T/C"
variation 342
/experiment="heterozygous SNP detection from direct
sequencing"
/note="putative SNP, T/C"
BASE COUNT 164 a 55 c 55 g 150 t
ORIGIN
1 gagccacatc aagatcacca tatttcagcg gatcatcagc tttctcccct actatagtat
61 cctcagataa agagaccatt gagattatag acctagctaa gaaagattta gagaagctga
121 aaagaaaaga aaagaagaag aaaaaaaggt agtgttataa aatcactttt atatgaatct
181 ctattcttct cacttttaag gatattttaa tttgcagttg caatgagtat atataacctc
241 atttttaaaa atgaaatgty atttaacata gattataatt atatattttt aaagatttat
301 ctcttcctta tatgtaaaaa attaaattaa ttacaataat ayggtagtga attttattta
361 gactatgata aggtaaaatt actattagaa attattttat tatttcttca actagctgcc
421 tttgca
//
G: Sequencing gap
G01) entry with unknown gaps
DEFINITION Homo sapiens HLA-A gene for MHC class I antigen, partial cds,
allele: HLA-A*2601V1.
FEATURES Location/Qualifiers
source 1..1022
/mol_type="genomic DNA"
/note="HLA-A*2601V1/2402,B*4001/4002,DRB1*0901/1401,
TBC191371"
/organism="Homo sapiens"
CDS join(<1..270,371..646,747..>1022)
/allele="HLA-A*2601V1"
/codon_start=3
/gene="HLA-A"
/product="MHC class I antigen"
/translation="--- omitted ---"
exon 1..270
/allele="HLA-A*2601V1"
/gene="HLA-A"
/number=2
gap 271..370
/estimated_length=unknown
exon 371..646
/allele="HLA-A*2601V1"
/gene="HLA-A"
/number=3
gap 647..746
/estimated_length=unknown
exon 747..1022
/allele="HLA-A*2601V1"
/gene="HLA-A"
/number=4
BASE COUNT 166 a 249 c 288 g 119 t
ORIGIN
1 gctcccactc catgaggtat ttctacacct ccgtgtcccg gcccggccgc ggggagcccc
61 gcttcatcgc cgtgggctac gtggacgaca cgcagttcgt gcggttcgac agcgacgccg
121 cgagccagag gatggagccg cgggcgccgt ggatagagca ggaggggccg gagtattggg
181 accggaacac acggaatgtg aaggcccact cacagactga ccgagcgaac ctggggaccc
241 tgcgcggcta ctacaaccag agcgaggacg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
301 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
361 nnnnnnnnnn gttctcacac cgtccagagg atgtatggct gcgacgtggg gccggacggg
421 cgcttcctcc gcgggtacca gcaggacgct tacgacggca aggattacat cgccctgaac
481 gaggacctgc gctcttggac cgcggcggac atggcggctc agatcaccca gcgcaagtgg
541 gagacggccc atgaggcgga gcagtggaga gcctacctgg agggccggtg cgtggagtgg
601 ctccgcagat acctggagaa cgggaaggag acgctgcagc gcacggnnnn nnnnnnnnnn
661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
721 nnnnnnnnnn nnnnnnnnnn nnnnnnacgc ccccaagacg catatgactc accacgctgt
781 ctctgaccat gaggccaccc tgaggtgctg ggccctgagc ttctaccctg cggagatcac
841 actgacctgg cagcgggatg gggaggacca gacccaggac acggagctcg tggagaccag
901 gcctgcaggg gatgggacct tccagaagtg ggcgtctgtg gtggtgcctt ctggacagga
961 gcagagatac acctgccatg tgcagcatga gggtctgccc aagcccctca ccctgagatg
1021 gg
//
