The NNLM data thesaurus defines a data repository as a: "place that holds data, makes data available to use, and organizes data in a logical manner."
The NIH has guidance for selecting a repository for data resulting from NIH-supported research.
Directories to Locate Repositories
Although not comprehensive, these directories are a good place to start searching for a data repository.
re3dataRegistry of Research Data Repositories. Searchable interface but can also browse by subject, content type and country.
FAIRsharing.orgThe collections link can be used to find both domain and generalist repositories as well as those recommended by specific journals, funders, and organizations.
Repository FinderThis resource can limit to the repositories in re3data that meet the criteria of the Enabling FAIR Data Project and FAIRsFAIR Project.
Data PortalsA browsable list of open data portals from around the world (not searchable).
Generalist Repositories
These repositories host data regardless of type, format, content, or subject matter. Note: the below is not an exhaustive list; see the generalist repositories listed on the NIH page for additional suggestions.
This comparison chart may be helpful when considering more than one repository.
DRYADNonprofit data repository. Dryad has a team of curators who check every submission to ensure the validity of files and metadata. A data publishing charge of $120 may apply (additional fees may apply to submissions in excess of 50GB). There is a limit of 300GB per data publication uploaded through the web interface (larger submissions are accepted but require technical assistance).
figshareFree account allows upload of files up to 5GB space and 20 GB of free private space. Blinded links can be created for peer review.
NIH figshare ArchiveThis figshare instance was a yearlong pilot project that ended in 2020, but the uploaded research is still searchable.
Harvard DataverseAll researchers from any discipline, both inside and out of the Harvard Community, can deposit files of up to 2.5GB, and store up to 1TB of data.
Mendeley DataPosted datasets are currently moderated to ensure the content constitutes research data, is scientific in nature, and doesn’t solely contain a previously published research article. Personal accounts have a maximum limit of 10 GB per dataset.
ZenodoDeveloped by CERN under the EU FP7 project OpenAIREplus. Currently accept up to 50GB per dataset and users may deposit restricted files with the ability to share access with others if certain requirements are met.
Domain-Specific Repositories
This is a selective list of domain-specific resources, including genomics and clinical. To locate more, please use the directories listed above or reach out to a librarian.
NIH-supported domain-specific data repositoriesRepositories in this list include both those funded by NIH and those with no NIH funding. Filters are available to limit repositories by such properties as ICO and access (controlled, open, registered).