Escience Tools

Data Tools

  • Bitbucket -- A platform to store, manage, and share code using Git and Mercurial version control technologies. Alternative to GitHub. Interfaces with Atlassian products like JIRA.
  • Colectica -- A software suite for integrating Data Documentation Initiative metadata standards into survey data, for open science.
  • CRAN -- A place to find and share R packages
  • CSV Fingerprints -- A tool that makes it easier to spot mistakes in CSV files.
  • Dash -- Open source software that interfaces with repositories, allowing researchers to easily deposit and share data.
  • Data analysis: R -- "A free software environment for statistical computing and graphics". Unix, Windows, MacOS
  • Git for Scientists: a Tutorial -- "A tutorial is targeted at users who are new to version control systems, or just new to git"
  • GitHub -- Software that makes it a lot easier to use Git. Also a social network for sharing open source code.
  • iPython Notebook/Jupyter -- Code annotation tool for Python
  • Open Refine -- A tool for working with messy data: cleaning it; transforming it from one format into another; extending it with web services; and linking it to databases like Freebase.
  • PyPI: the Python package index -- A place to find and share Python packages
  • Python -- A popular programming language used in data science.
  • RMarkdown -- Code annotation tool for R.
  • rOpenSci -- A non-profit devoted to creating R packages to be used in open science
  • RStudio -- Desktop software to use when editing R packages
  • SciPy.org -- "A Python-based ecosystem of open-source software for mathematics, science, and engineering." Includes the popular iPython and NumPy packages.
  • Subversion (SVN) -- Version control software, a slightly less popular alternative to Git
  • Tidy Data -- An R package for data cleaning/curation