Reference notes
Coverage
NIH DMS Policy · 11 funder mandates · 14 repositories
Sources
NIH DMS Policy + publisher guidelines
Last reviewed
February 2026
Prepared by the Manusights editorial team.
Compliance-and-repository guide
Data Sharing Requirements for Biomedical Research
Data sharing in biomedicine went from optional to expected to mandatory over the last decade. The NIH Data Management and Sharing Policy, which took effect January 25, 2023, now requires every NIH-funded researcher to submit a data management plan, and actually share their data.
This guide covers what the NIH policy requires, what major journals ask for in data availability statements, where to deposit different types of biomedical data, and what FAIR principles mean in practice.
Quick orientation
Use this page when the manuscript is getting close to submission and the data-sharing plan needs to be concrete, not aspirational.
This guide helps translate policy language into operational choices: what the NIH expects, what journals usually require in the statement itself, which repository fits the data type, and what “FAIR” means at the moment of deposit.
Best used with
Open access guide
Use it when funder policy is also shaping the publication route and APC decision.
Author rights guide
Pair repository planning with the rights and embargo terms that govern manuscript sharing.
Reporting guidelines
Move there when the data plan also depends on protocol, checklist, or transparency reporting requirements.
The NIH Data Management and Sharing Policy (2023)
The NIH DMS Policy applies to all NIH-funded research that generates scientific data: including grants, contracts, and intramural research. It applies to all applications and proposals submitted on or after January 25, 2023.
The policy requires:
- 1.Submit a Data Management and Sharing (DMS) Plan with your grant application
- 2.Data must be deposited in an established repository as soon as possible: no later than publication or end of award
- 3.Data Management costs are allowable as direct costs (up to ~$30,000/year without special justification)
- 4.Cite the shared dataset in any resulting publications
Other Major Funder Data Sharing Mandates
Wellcome Trust
Since 2017 (updated 2021)Strong data sharing mandate. Requires a data management plan, data sharing in an appropriate repository, and a data availability statement in all publications. All publications must be OA (CC BY), and the underlying data must be available.
UKRI (all councils)
Since 2015 (RCUK policy, now UKRI)Research outputs including data must be made available as openly as possible. Data underlying publications must be deposited in an appropriate repository. Applies to BBSRC, MRC, ESRC, EPSRC, and other UKRI councils.
Gates Foundation
Since 2017One of the strictest. It requires CC BY for all publications and immediate open access to underlying data. Data sharing plan required with grant application.
European Research Council (ERC)
Since 2017 (expanded Horizon Europe)Open Research Data pilot is now the default for all funded projects. Requires a Data Management Plan and deposition in a trusted repository where possible.
FAIR Principles in Practice
FAIR data principles (Findable, Accessible, Interoperable, Reusable) are now the standard framework for data sharing. NIH, Wellcome, UKRI, and most major journals reference FAIR compliance as the goal. Here's what each means operationally:
Findable
Deposit in a repository that assigns a persistent identifier (DOI or accession number). Include rich metadata so search engines can index it. Don't just put data on a lab website that disappears.
Accessible
Data should be retrievable via open protocols. For controlled-access data (e.g., dbGaP), the access procedure itself must be publicly documented. The process for requesting access counts as 'accessible' even if the data aren't open.
Interoperable
Use standard file formats (CSV over Excel, FASTQ over proprietary formats, TSV over custom delimiters). Include a data dictionary. Make it so someone without your specific software can use the data.
Reusable
Attach a license (CC BY for open data). Include enough provenance (how the data were collected, processed, and what quality checks were applied) so that someone else can reproduce your decisions.
Practical note
Three data-sharing mistakes that create late-stage submission problems
Most data-sharing problems surface late, when the manuscript is already in editorial review. Run a pre-submission check on your data-availability statement to catch a missing repository, accession, or FAIR gap before an editor does.
References
- National Institutes of Health. (2023). NIH Data Management and Sharing Policy. U.S. Department of Health and Human Services. [sharing.nih.gov ↗]
- Wilkinson MD, Dumontier M, Aalbersberg IJ, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018. [doi.org/10.1038/sdata.2016.18 ↗]
- UK Research and Innovation (UKRI). Research data management and sharing policy. Retrieved February 2026. [ukri.org ↗]
- Zenodo. About Zenodo: open repository for research. CERN. Retrieved February 2026. [zenodo.org ↗]
- Figshare. About figshare: open data repository. Digital Science. Retrieved February 2026. [figshare.com ↗]
- Dryad. About Dryad: open data repository. Dryad Digital Repository. Retrieved February 2026. [datadryad.org ↗]
Ready to apply this to a real draft?
Move from reference guidance to a manuscript-specific check
Use the public submission-readiness path when you already have a manuscript and need a draft-specific signal, not just a general guide.
Best for researchers who want a fast readiness read before deciding whether to revise, retarget, or submit.
Related guides in this collection
Open Access Guide
Use this when repository and data-sharing decisions also depend on funder OA mandates or APC planning.
Author Rights Guide
Pair data-sharing planning with the rights and embargo rules that govern how manuscripts and accepted versions can be shared.
Reference Library
Return to the broader reference library for submission specs, timelines, and checklists.
Frequently Asked Questions
Are data sharing requirements mandatory for all research types?
Requirements vary by journal, publisher, and research type. Nature Portfolio journals, Cell Press, PLOS journals, and BMJ require data sharing for all original research by default, with exceptions for privacy-restricted data. Most journals with data sharing policies require a Data Availability Statement in the manuscript regardless of whether data can be fully shared. If data cannot be shared, you must explain why - privacy, legal restrictions, or third-party ownership are all accepted reasons at most journals.
Where should I deposit research data before submitting a manuscript?
The right repository depends on your data type. Genomics data goes to NCBI (GEO for gene expression, SRA for raw sequencing, dbGaP for controlled human genomics data). Protein structures go to the RCSB Protein Data Bank. Clinical trial data goes to ClinicalTrials.gov. For general research data without a domain-specific repository, Zenodo, Figshare, or Dryad are widely accepted. Most journals provide a list of recommended repositories in their data sharing policy.
What should a Data Availability Statement include?
A Data Availability Statement should specify: (1) where the data can be accessed (repository name and URL or DOI), (2) any access restrictions and the reason for them, and (3) the accession number or identifier for the deposited dataset. If all data are in the manuscript itself (in figures and tables), state that explicitly. If data cannot be shared due to privacy or ethical restrictions, state the restriction and whether data can be requested from the corresponding author. Many journals provide a template - use it, since reviewers and editors check the statement during review.