Adapted from GenBank statistics.

Data Repositories

Data repositories store user-submitted experimental data. Data may be submitted prior to publication so that reviewers can examine the data anonymously, and after publication the data are made publically available. Data can be witheld from the public for a short period to give submitters time to publish additional analyses of the data. Contributors must format their data according to repository guidelines.

  1. Examples of repositories with limited to no curation.
    1. NCBI GenBank - nucleotide sequence database with supporting bibliographic and biological annotation (Benson et al. 2005). GenBank is part of the International Nucleotide Sequence Database Collaboration (INSDC).
    2. Proteomics Identifications Database (PRIDE) - stores, disseminates and analyzes mass spectrometry proteomics data (Vizcaino et al. 2010).
    3. NCBI Sequence Read Archive (SRA) - repository for next generation sequencing data; included in the INSDC.
  2. Curated repositories
    1. International Molecular Exchange Consortium (IMEx) - network of molecular interaction databases that exchange data following the Human Proteome Organization Proteomics Standards Initiative (Orchard et al. 2012 ).