When writing a Data Management Plan, you will be expected to answer a number of questions related to how you will share your data and / or make it easy to reuse.
Below are several examples of the types of questions you will be asked. Additional information about many of the topics is provided via links to our Best Data Handling Practices section.
What data will you be sharing and in what form?
How will you share the data?
Consider where, how, and to whom data with acknowledged long-term value should be made available.
Note: If you need help managing intellectual property
|Are any restrictions on data sharing required?
Outline any expected difficulties in sharing data with
acknowledged long-term value, along with causes and
possible measures to overcome these.
Restrictions may be due to confidentiality, lack of consent
agreements or IPR, for example.
Consider whether a non- disclosure agreement would
give sufficient protection for confidential data.
The Tri-Agency’s Research Data Management Policy States:
“Grant recipients are not required to share their data. However, the agencies expect researchers to provide appropriate access to the data where ethical, cultural, legal and commercial requirements allow, and in accordance with the FAIR principles and the standards of their disciplines. Whenever possible, these data, metadata and code should be linked to the publication with a persistent digital identifier”.
This statement does not mitigate the need to (1) create DMPs, (2) manage data appropriately and (3) preserve data.
It does indicates that where possible (I.e. appropriate access as allowed) data should be shared in a manner makes the most sense.
When deciding How to Share your research, an excellent place to start is Brandon University’s institutional instance of Dataverse. It:
Dataverse Does NOT accept content that contains confidential or sensitive information.
Dataverse at Brandon University Does Not accept file sets over 3GB. For massive file sets see FRDR.
Scholars at Brandon University:
FRDR provides many data services - like Dataverse. It does, however have one major advantage over our institutional instance of Dataverse: it can accommodate massive data sets. Our institutional instance of Dataverse is limited to 3 GB.
For additional information about using FRDR consult their Preparing Your Data site and view their Getting Started with FRDR: Upload your Dataset or Getting Started with FRDR: Download a Data Set videos on YouTube.
A variety of additional repositories that archive research data are available.
Researchers who wish to locate a specific repository are encouraged to start by using Re3data.org which provides a comprehensive listings of disciplinary and institutional repositories to host and share research data.
You can use Re3Data to browse for data repositories by:
Other places to find relevant repositories include a few publisher recommended lists such as
....as well as a few, reputable general data repositories:
Note: Funders or publishers may have specific recommendations around where you share your data. A good practice is to make use of our instance of dataverse to house / backup data - AS WELL AS sharing it via a subject repository recommended by a funder or publisher. The latter will make your data highly visible to researchers in a discipline. The former ensures the data is protected.
Datasets are increasingly being recognized as scholarly products in their own right, and as such, are now being submitted for standalone publication. In many cases, the greatest value of a dataset lies in sharing it, not necessarily in providing interpretation or analysis. For example, this paper presents a global database of the abundance, biomass, and nitrogen fixation rates of marine diazotrophs. This benchmark dataset, which will continue to evolve over time, is a valuable standalone research product that has intrinsic value. Under traditional publication models, this dataset would not be considered "publishable" because it doesn't present novel research or interpretation of results. Data papers facilitate the sharing of data in a standardized framework that provides value, impact, and recognition for authors. Data papers also provide much more thorough context and description than datasets that are simply deposited to a repository (which may have very minimal metadata requirements).
What is a data paper?
Data papers thoroughly describe datasets, and do not usually include any interpretation or discussion (an exception may be discussion of different methods to collect the data, e.g.). Some data papers are published in a distinct “Data Papers” section of a well-established journal (see this article in Ecology, for example). It is becoming more common, however, to see journals that exclusively focus on the publication of datasets. The purpose of a data journal is to provide quick access to high-quality datasets that are of broad interest to the scientific community. They are intended to facilitate reuse of the dataset, which increases its original value and impact, and speeds the pace of research by avoiding unintentional duplication of effort.
Are data papers peer-reviewed?
Data papers typically go through a peer review process in the same manner as articles, but being new to scientific practice, the quality and scope of the process is variable across publishers. A good example of a peer reviewed data journal is Earth System Science Data (ESSD). Their review guidelines are well described and aren't all that different from manuscript review guidelines that we are all already familiar with.
You might wonder, What is the difference between a 'data paper' and a 'regular article + dataset published in a public repository'? The answer to that isn’t always clear. Some data papers necessitate just as much preparation as, and are of equal quality to, ‘typical’ journal articles. Some data papers are brief, and only present enough metadata and descriptive content to make the dataset understandable and reusable. In most cases however, the datasets or databases presented in data papers include much more description than datasets deposited to a repository, even if those datasets were deposited to support a manuscript. Common practices and standards are evolving in the realm of data papers and data journals, but for now, they are the Wild West of data sharing.
Where do the data from data papers live?
Data preservation is a corollary of data papers, not their main purpose. Most data journals do not archive data in-house. Instead, they generally require that authors submit the dataset to a repository. These repositories archive the data, provide persistent access, and assign the dataset a unique identifier (DOI). Repositories do not always require that the dataset(s) be linked with a publication (data paper or ‘typical’ paper; Dryad does require one), but if you’re going to the trouble of submitting a dataset to a repository, consider exploring the option of publishing a data paper to support it.
How can I find data journals?
Additional information on data journals
Oregon State Libraries. Research Data Services. Data Paper & Data Journals. Accessed March 1st, 2021.
https://guides.library.oregonstate.edu/research-data-services/data-management-data-papers-journals This work is licensed under a Creative Commons Attribution NonCommercial 4.0 International License
A persistent identifier is a long-lasting reference to a digital resource. Unlike URLs, which may break, a persistent identifier reliably points to a digital entity.
It is important to have a persistent identifier for your data set(s) as they:
Before uploading your data into your preferred repository, make sure of the following:
1. You have addressed all confidentiality and privacy issues pertaining to the data you intend to share such as::
2. The data is in a useful format that others can easily make use of and that will remain useful over time.
3. You have created information about the data that will accompany the data in the repository. This should include:
4. You have addressed all Intellectual Property issues such as:
5. You have identified relevant permissions for accessing data in the repository if the data cannot be made openly available.
6. You have identified who to contact for permission to access data.
7. You have either assigned a DOI to your data - OR - have verified that the repository you are placing your data in will assign it a DOI.
Deciding whether you can share you data was recently made easier to understand with Portage’s Can I Share My Data Decision Tree. It will walk you through many of the issues discussed and help you make an informed decision.