Sharing your data and making it open available is easy. You can do any of the following:
- Deposit in an appropriate data repository or archive (see list below) (ie. Subject-based)
- Deposit to an institutional repository, like the UMN Digital Conservancy.
- Post online via a project or institutional web site, for example Cedar Creek Ecosystem Science Reserve.
- Submit data to a journal that support data supplements.
- Cite your data in your work so others can find and access it (via a permanent URL or DOI).
Search Data Repositories
DataBib is a tool that searches across hundreds of data repositories available for data deposit. This vetted source helps answers questions such as who can deposit and access the data.
Popular Data Repositories
- EarthStat - hosts global agricultural data for 175 crops, globally, among other data sets. This is the product of a collaboration between the U of MN's Institute on the Environment and McGill University.
- USDA Economic Research Service Data Sets - wide range of data from food safety to farm economy.
- Digital Archaeological Record - tDAR is an international digital archive and repository that houses data about archaeological investigations, research, resources, and scholarship. tDAR provides researchers new avenues to discover and integrate information relevant to topics they are studying.
- Open Context - Use Open Context to discover, reference, and publish primary data and documentation collected in archaeology and other field sciences by professional researchers.
- HEASARC - NASA's High Energy Astrophysics Science Archive Research Center
- Infrared Science Archive - NASA's science and data center for infrared astronomy
- Extragalactic Database - NASA's archive of data for over 3 million extragalactic objects
- National Virtual Observatory - Astronomical data from ground and space-based telescopes. Includes data analysis tools
- National Space Science Data Center - Archive for NASA space mission data
- Sloan Digital Sky Survey - Download optical images of the sky. See also, SkyServer for educational portal to the data.
Biological and Life Sciences
- DigiMorph - Digital Morphology library is a dynamic archive of information on 3D scans, animations, and high-resolution X-ray computed tomography of biological specimens.
- Dryad - Dryad is an international repository of data underlying peer-reviewed articles in the basic and applied biosciences. See their data submission page for instructions.
- PLEXdb - Gene expression data for plants and plant pathogens. It contains smaller databases for specific plants (e.g., BarleyBase) as well as a variety of related tools.
- Protein DataBank - Experimentally determined structures for macromolecules (protein and nucleic acids). The site includes search and visualization tools
- The Cell: An Image Library - Images of all cell types from all organisms, including intracellular structures and movies or animations demonstrating functions. This project relies upon the cell biology community to populate the library.
- UniProt - Free protein sequences
- Cambridge Structural Database - small molecule crystal structures
- eCrystals - x-ray crystallographic data
- PubChem - NCBI's repository of bioactivy/bioassay data and information for "small" molecules (i.e. not macromolecular). Both text-based and structure-based search tools are provided
Computer Science and Source Code
- CodePlex - provided by Microsoft
- GitHub - Hosts developer libraries such as Ruby on Rails, IronRuby, jQuery, Perl
- Google Code Project hosting - open APIs and Google projects like Google Gears, Android, Chromium.
- Launchpad - includes projects such as Ubuntu, MySQL (code hosting)
- SourceForge - the most popular open source code hosting facility according to this Wikipedia comparison
Earth, Environmental and Geosciences
- Goddard Earth Sciences Data and Information Services Center
- IRI/LDEO Climate Data Library - Climate-related datasets from the International Research Institute for Climate and Society at Columbia University
- Marine Geoscience Data System (MGDS) - A data portal, hosted at the Lamont-Doherty Earth Observatory
(Columbia University), for a number of NSF-supported marine research programs
- National Center for Atmospheric Research Computational & Information Systems Library
- National Climatic Data Center (NCDC) - Meteorology and paleoclimatology
- NCAR/UCAR Community Data Portal - Climate and weather datasets and visualization software from the National Center for Atmospheric Research and the University Corporation for Atmospheric Research
- National Oceanographic Data Center (NODC) - World-wide marine environmental and ecosystem data
- National Snow and Ice Data Center (NSIDC) - NSIDC offers hundreds of scientific data sets for research, focusing on the cryosphere and its interactions. Data are from satellites and field observations. All data are free of charge.
- Network for Earthquake Engineering Simulation (NEES) - shared national network of 14 experimental facilities, collaborative tools, a centralized data repository, and earthquake simulation software.
- USGS National Satellite Land Remote Sensing Data Archive - Note that some data access is fee-based
- GEON - Portal for datasets and visualization tools
GIS and Geography
- Geodata.gov - One-stop for federal, state and local geographic data
- GeoCommons.com GIS file repository and finding tool
- Federal Geographic Data Committee - Provides access to the National Spatial Data Infrastructure (NSDI) Clearing House Network and the geodata.gov portal
- National Geographic Data Center - Archive of datasets
Health and Medical Sciences
- Biological Magnetic Resonance DataBank - MRI data
- National Center for Biotechnology Information (NCBI) - Numerous databases with a genomic/proteomic focus
- Neuroimaging Informatics Tools and Resources Clearinghouse (NITRC)
- Neuroscience Information Framework - A virtual community of data, materials, and web-based neuroscience resources with the goal of enabling discovery and access to public research data and tools worldwide through an open source, networked environment.
- The PhysioBank archives of PhysioNet
- HEP Data - high-energy physics reaction database of Numerical HEP scattering cross sections
- NIST Physical Standards Laboratory - physical reference data and property tables
- National Nuclear Data Center - includes nuclear structure, reaction and decay databases
- ICPSR, the Inter-University Consortium for Political and Social Research is a non-profit, membership-based data archive located at the University of Michigan. The University of Minnesota - Twin Cities (UM-TC) membership allows students, staff, and faculty to access ICPSR data files and documentation for the purpose of academic research. The data available through ICPSR are raw, unanalyzed data which require statistical analysis. Individuals who are prepared to analyze raw data can download data themselves from the ICPSR web site. All other users should contact the Data Services Librarian for assistance.
- Social Sciences Data - Library finding aid
- Guide to Social Science Data - at the University of Minnesota (pdf)