Data Repositories

Data Repositories

Selection and Appraisal of Research Data

Repository Software

  • DSpace – DSpace is an open source software for building open digital repositories. It preserves and enables access to all types of digital content including text, images, and data sets.
  • DuraCloud – A hosted service and open technology developed by DuraSpace, DuraCloud leverages existing cloud infrastructure to enable durability and access to digital content. Particularly focused on providing preservation support services and access services for academic libraries and academic research centers.
  • DuraSpace Blog – News and information from the Fedora Repository and DSpace communities.
  • DuraSpace.org web seminars – Links to slides and content from Duraspace hosted web seminars on repository systems and solutions.
  • Fedora – Fedora (Flexible Extensible Digital Object Repository Architecture) defines a set of abstractions for expressing digital objects, asserting relationships among digital objects, and linking "behaviors" (i.e., services) to digital objects.
  • Fedora-A Repository for the Future (2013) – Beginning In 2013, stakeholders in the Fedora Community initiate Fedora Futures, a 3 year overhaul of Fedora to improve scalability, data management support, and storage flexibilities, among other features requested by users.
  • Islandora/Fedora Repository Software Survey (2010) – Survey done by the Repositories Software Project that provides information on functions, operability, metadata and support for Islandora and Fedora repository software.

Science Data Repositories

Compiled Lists of Research Data Repositories

Astronomy Repositories

Biosciences Repositories

  • Dryad – Dryad is an international repository of data underlying peer-reviewed articles in the basic and applied biosciences.

Genetics Repositories

  • BioGrid – The Biological General Repository for Interaction Datasets (BioGRID) is a public database that archives and disseminates genetic and protein interaction data from model organisms and humans.
  • Mouse Genome Informatics – International database resource developed at Jackson Labs, for the laboratory mouse, to help facilitate better understanding of human health and disease.

Environmental Science/Geosciences Repositories

  • Data.gov – An initiative to increase public access to machine-readable datasets generated by the Executive Branch of the Federal Government, Data.gov provides metadata and information for accessing datasets useful for diverse contexts. Includes a Geodata catalog.
  • ESA: Ecological Society of America Data Registry – A publicly accessible registry describing scientific data sets on ecology and the environment. Its data sets are associated with articles published in the journals of the Ecological Society of America.
  • Pangaea – An OAI-PMH compliant repository for georeferenced data from earth system research. Sample data sets include oceanographic observations and sea ice physics.
  • RealClimate – A directory of climate data repositories and codes.
Last updated: Jun 2, 2014 11:03am