MetaboBank
Submission to MetaboBank
Submission flow
MetaboBank data are linked to BioProject and BioSample data so that the data are linked with related data such as sequencing and gene expression data in different databases. Metadata are prepared in the Excel format (IDF and SDRF in the MAGE-TAB style) for each submission type. Processed data (identified metabolites) are prepared as Metabolite assignment file(s) (MAF) with pre-defined fields.
1. Create submission account
- Create a D-way submission account (Manual)
- Register a public key to your account to enable data upload to MetaboBank.
2. Apply submission
Apply your submission from the MetaboBank submission form. MetaboBank team will inform you how to submit your data. If you cannot access the form, download the submission file, and send it to by email.
3. Register BioProject
Register your project to BioProject. An accession number with prefix PRJDB will be assigned to your project.
4. Register BioSample
Register your samples to BioSample by using the Omics package. For an organism name of non-living materials, choose an appropriate name from the NCBI Taxonomy metagenomes.
- Blank as blank sample
- Food as food metagenome
-
For others, see FAQ: Which organism name should I use for non-living sample?
Accession numbers with prefix SAMD will be assigned to samples.
5. Create metadata
Download a metadata excel for the experiment type you want to submit. The excel consists of two sheets, IDF and SDRF. Different experiments are prepared as separate studies, grouped by a BioProject. For example, group MTBKS1000 (LC-MS) and MTBKS1001 (GC-MS) by a BioProject PRJDB100.
- IDF (Investigation Description Format): study overview, experimental design, protocol, publication and submitter information.
- SDRF (Sample and Data Relationship Format): sample characteristics and relationship between sample, measurement instrument and data file.
6. Prepare data files
Prepare the following data files. Identified metabolites are prepared as Metabolite assignment file(s) (MAF). MAF registration is not mandatory but strongly recommended for data reuse.
Provide a list of filename and its md5 checksum for checking file integrity. Upload the output of md5sum command as a file with the extension “.md5” (e.g., my-study.md5). Alternatively, you may provide the md5 values in the columns of SDRF as Comment[Raw Data File md5], Comment[Processed Data File md5] and Comment[Metabolite Assignment File md5].
7. Upload files
Upload metadata and data files (raw, processed data files and MAF) to our file server by scp/sftp. Group all files for each study by using either tar or zip. Upload your files under the metabobank directory.
Regarding how to upload your data files, please see “Data upload”.
Accession number
MetaboBank assigns an accession number to a study (n is a serial number). Cite this accession number in your publication.
MTBKSn (e.g., MTBKS1)
Data release
The data can remain private until your paper publication. Submitters can designate the hold date within three years and extend the date. The private data are released according to the data release policy. Please note that public MetaboBank data are not exchanged with the EBI MetaboLights.
FAQ: How to request data release?
Reviewer access
If you want to provide your private data to journal reviewers via password-protected site, please contact the MetaboBank team.
Update
To update your data, please contact the MetaboBank team.
MD5 checksum value
See “MD5 checksum value” for how to obtain MD5 checksum values.