Thank you for visiting the “eScience Portal for Librarians.” The “eScience Portal” is no longer being maintained by the University of Massachusetts. This regional resource has been adapted by the National Network of Libraries of Medicine, and is sustained by the network of regional medical libraries across the country. Please visit for up-to-date data services and resources supported and vetted by the National Libraries of Medicine. We look forward to your continued involvement in the programming in the New England Region and beyond. If you have questions, please contact

Escience Tools

Data Tools

  • Bitbucket -- A platform to store, manage, and share code using Git and Mercurial version control technologies. Alternative to GitHub. Interfaces with Atlassian products like JIRA.
  • Colectica -- A software suite for integrating Data Documentation Initiative metadata standards into survey data, for open science.
  • CRAN -- A place to find and share R packages
  • CSV Fingerprints -- A tool that makes it easier to spot mistakes in CSV files.
  • Dash -- Open source software that interfaces with repositories, allowing researchers to easily deposit and share data.
  • Data analysis: R -- "A free software environment for statistical computing and graphics". Unix, Windows, MacOS
  • Git for Scientists: a Tutorial -- "A tutorial is targeted at users who are new to version control systems, or just new to git"
  • GitHub -- Software that makes it a lot easier to use Git. Also a social network for sharing open source code.
  • iPython Notebook/Jupyter -- Code annotation tool for Python
  • Open Refine -- A tool for working with messy data: cleaning it; transforming it from one format into another; extending it with web services; and linking it to databases like Freebase.
  • PyPI: the Python package index -- A place to find and share Python packages
  • Python -- A popular programming language used in data science.
  • RMarkdown -- Code annotation tool for R.
  • rOpenSci -- A non-profit devoted to creating R packages to be used in open science
  • RStudio -- Desktop software to use when editing R packages
  • -- "A Python-based ecosystem of open-source software for mathematics, science, and engineering." Includes the popular iPython and NumPy packages.
  • Subversion (SVN) -- Version control software, a slightly less popular alternative to Git
  • Tidy Data -- An R package for data cleaning/curation