Because of the format extension of the accession numbers, INSDC members have assigned accession numbers with six-letter prefix for bulk sequence data. To provide those sequence data, we will change structure of directories on anonymous FTP site as follows. We appreciate your understanding and cooperation.

See also our previous change.

1) Structure of directories for WGS data

Since February 6th, 2020, we have expanded the directory structure as follows.


ddbj_database/wgs/WGS_ORGANISM_LIST.html
                 /WGS_ORGANISM_LIST.txt
                 /AA/AA/AAAABA.gz
                 /AA/AA/AAAABB.gz
                 /AA/AA/AAAABC.gz
                 /:
                 /AA/AAAA.gz
                 /AA/AAAB.gz
                 /AA/AAAC.gz 

2) Structure of directories for TSA, TLS, and TPA-WGS data

From May 20th, 2020, we will divide the files of TSA, TLS and TPA-WGS data into sub-directories as follows.

Target directories:


ftp://ftp.ddbj.nig.ac.jp/ddbj_database/tsa/
ftp://ftp.ddbj.nig.ac.jp/ddbj_database/tls/
ftp://ftp.ddbj.nig.ac.jp/ddbj_database/tpa/wgs/
Before:
All data in the files named as the prefixes of their accession numbers were placed with under the following directories;

ddbj_database/tsa/TSA_ORGANISM_LIST.html
                 /TSA_ORGANISM_LIST.txt
                 /GAAA.gz
                 /GAAB.gz
                 /GAAC.gz

ddbj_database/tls/TLS_ORGANISM_LIST.html
                 /TLS_ORGANISM_LIST.txt
                 /KAAA.gz
                 /KAAB.gz
                 /KAAC.gz

ddbj_database/tpa/wgs/TPAWGS_ORGANISM_LIST.html
                     /TPAWGS_ORGANISM_LIST.txt
                     /DAAAAB.gz
                     /DAAAAC.gz
                     /DAAAAD.gz
                     /:
                     /DAAB.gz
                     /DAAF.gz
                     /DAAI.gz 
After:
The files will be divided into following sub-directories named after the first two letters of the prefixes of their accession numbers.
In cases of six-letter prefixes, the files will be divided into sub-directories named after the 3rd and 4th letters of the prefixes of their accession numbers under the directories of the first two letters.

ddbj_database/tsa/TSA_ORGANISM_LIST.html
                 /TSA_ORGANISM_LIST.txt
                 /GA/GAAA.gz
                 /GA/GAAB.gz
                 /GA/GAAC.gz

ddbj_database/tls/TLS_ORGANISM_LIST.html
                 /TLS_ORGANISM_LIST.txt
                 /KA/KAAA.gz
                 /KA/KAAB.gz
                 /KA/KAAC.gz

ddbj_database/tpa/wgs/TPAWGS_ORGANISM_LIST.html
                     /TPAWGS_ORGANISM_LIST.txt
                     /DA/AA/DAAAAB.gz
                     /DA/AA/DAAAAC.gz
                     /DA/AA/DAAAAD.gz
                     /:
                     /DA/DAAB.gz
                     /DA/DAAF.gz
                     /DA/DAAI.gz 

Please note that the files for the file lists of TSA, TLS and TPAWGS data are remained in the following directories.


ddbj_database/tsa/TSA_ORGANISM_LIST.html
                 /TSA_ORGANISM_LIST.txt 
ddbj_database/tls/TLS_ORGANISM_LIST.html
                 /TLS_ORGANISM_LIST.txt 
ddbj_database/tpa/wgs/TPAWGS_ORGANISM_LIST.html
                     /TPAWGS_ORGANISM_LIST.txt