Following databases of RefSeq has been added to High Speed BLAST API in addition to DDBJ.

Furthermore, Reference proteins (sample program) has been added as a reference database of usual BLAST API.

Use case and execution time

It is possible to do a fast BLAST search against RefSeq databases as well as DDBJ. For example, the following case can be calculated only in a minute. It may change according to whether our server accepts a lot of jobs or not.

Case 1. Comparison between full length cDNA of human and multiple genomes
Sample program can be available here.
Condition:
  • Program: blastn
  • Reference database: Reference genomic sequences (2 million sequences, Total 140 billion bases)
  • Query: Full length cDNA of human gene ALDH2 (DDBJ Accession: BC002967, Sequence length: 2050 bp)
  • Parameter: -b 100 -v 100
Result:Human and a list of homologous species such as horse, dog, chimpanzee, orangutan and rhesus monkey can be obtained. Furthermore, chromosome number and the position on it of their genome or whole genome shotgun can also be obtained.
Case 2. Comparison between environmental DNA sequence and multiple genome
Sample program can be available here.
Condition:
  • Program: blastn
  • Reference database: Reference genomic sequences (2 million sequences, Total 140 billion bases)
  • Query: Environmental DNA sequence (DDBJ Accession: HQ188503, Sequence length: 508 bp)
  • Parameter: -b 100 -v 100 -W 28 -X 20 (equivalent to megablast)
Result: A list of the psychrophilic bacterial species of Shewanella genus can be obtained. Furthermore, the homologous region of their genomes can be also obtained. It is verified that those regions are 16S ribosomal RNA by checking annotations of genomes.Please refer to the page for details.