- Bitbucket -- A platform to store, manage, and share code using Git and Mercurial version control technologies. Alternative to GitHub. Interfaces with Atlassian products like JIRA.
- Colectica -- A software suite for integrating Data Documentation Initiative metadata standards into survey data, for open science.
- CRAN -- A place to find and share R packages
- CSV Fingerprints -- A tool that makes it easier to spot mistakes in CSV files.
- Dash -- Open source software that interfaces with repositories, allowing researchers to easily deposit and share data.
- Data analysis: R -- "A free software environment for statistical computing and graphics". Unix, Windows, MacOS
- Git for Scientists: a Tutorial -- "A tutorial is targeted at users who are new to version control systems, or just new to git"
- GitHub -- Software that makes it a lot easier to use Git. Also a social network for sharing open source code.
- iPython Notebook/Jupyter -- Code annotation tool for Python
- Open Refine -- A tool for working with messy data: cleaning it; transforming it from one format into another; extending it with web services; and linking it to databases like Freebase.
- PyPI: the Python package index -- A place to find and share Python packages
- Python -- A popular programming language used in data science.
- RMarkdown -- Code annotation tool for R.
- rOpenSci -- A non-profit devoted to creating R packages to be used in open science
- RStudio -- Desktop software to use when editing R packages
- SciPy.org -- "A Python-based ecosystem of open-source software for mathematics, science, and engineering." Includes the popular iPython and NumPy packages.
- Subversion (SVN) -- Version control software, a slightly less popular alternative to Git
- Tidy Data -- An R package for data cleaning/curation