In order to promote open access to research data, many funding agencies require that research data produced as part of a funded project be made publicly available and/or have instituted requirements for formal data management plans.
National Institutes of Health
From the NIH Website regarding public access:
"The NIH Public Access Policy implements Division G, Title II, Section 218 of PL 110-161 (Consolidated Appropriations Act, 2008). The law states:
The Director of the National Institutes of Health shall require that all investigators funded by the NIH submit or have submitted for them to the National Library of Medicine’s PubMed Central an electronic version of their final, peer-reviewed manuscripts upon acceptance for publication, to be made publicly available no later than 12 months after the official date of publication: Provided, That the NIH shall implement the public access policy in a manner consistent with copyright law."
To learn more about the policy visit http://publicaccess.nih.gov/policy.htm
National Science Foundation
From the NSF Website regarding Public Access:
"Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees are expected to encourage and facilitate such sharing." See Award & Administration Guide (AAG) Chapter VI.D.4.
Regardless of whether research is funded by a public agency or not, data curation covers a range of important data management activities to ensure your data is preserved and available for future research.
“Data curation is the active and ongoing management of data through its lifecycle
of interest and usefulness to scholarship, science, and education. Data curation
enables data discovery and retrieval, maintains data quality, adds value, and
provides for re-use over time through activities including authentication, archiving,
management, preservation, and representation.”
-Graduate
School of Library and Information Science, University of Illinois at Urbana
Champaign
The digital curation lifecycle model developed by the Digital Curation Centre is one of the most widely used models and it covers the following curation actions:
A Data Curation Profile is a document about the origin of a dataset or a collection and its lifecycle within a research project. It describes the data generated and used in research that may be published, shared and preserved for future reuse and repurposing. The Data Curation Profile records requirements for specific data generated by a single scientist, scholar or research group based on their needs and requirements. It can be created by librarians, archivists, IT professionals and/or data managers through interviewing the researcher(s) and documenting the results.
The DCP Toolkit sponsored by Institute of Museum and Library Services (IMLS) can be used as a tool to conduct data curation interview.
"Good data management is the foundation for good research. Today, more and more publishers and funding agencies are requiring researchers to share their data. Having a data managment plan fullfills agency requirements and makes your data easier to share."
-Retrieved from http://www2.lib.virginia.edu/brown/data/whymanage.html
Benefits of data management include:
A formal data management plan may be required by your funding agency and different types of data and different data formats require different types of data management plans.
For information on getting started with data management, data management plan templates, and more visit UCF Libraries Data Management Guide
As stated in NSF’s “Information about the Data Management Plan Required for all Proposals” for Biological Sciences, the Federal government defines data (OMB Circular A-110) as: “…the recorded factual material commonly accepted in the scientific community as necessary to validate research findings.” This definition includes both original data (observations, measurements etc.) as well as metadata (e.g., experimental protocols, software code for statistical analysis etc.).
In the funder requirement document, it normally includes metadata related sections and questions. Funder requirements for NSF, GBMF, IMLS, NEH, NIH and NOAA can be found at https://dmp.cdlib.org/pages/funder_requirements
The NSF Grant Proposal Guide recommends the inclusion of a “data management plan” that explains how your proposal will comply with NSF’s data sharing policies. The description of your data management plan may include “the standards to be used for data and metadata format and content.” See NSF's Grant Proposal Guide for more information.
For researchers, there are common metadata standards and domain specific metadata standards to consider before you start to document and manage your data. Some of the common metadata standards are: Dublin Core (DC) for general e-resources, Directory Interchange Format (DIF) for scientific datasets, Data Documentation Initiative (DDI) for social sciences data and Text Encoding Initiative (TEI) for humanities, social sciences and linguistics data. Domain specific metadata standards are available for different disciplines such as agriculture, astronomy, biological and ecological sciences, e-commerce, engineering, music and arts. At the same time, there are some general fields across all domains you may consider so that you will be able to back up your data in an institutional data repository and most digital repositories. Please refer to Basic Metadata Fields for detailed information.
Metadata is data about data, data associated with an object, a document, or a dataset for purposes of description, administration, technical functionality and preservation.
There are three important aspects about metadata: framework, encoding and content. Framework is the general structure of the documents, encoding refers to syntactic scheme to write down the content, and content is specific information conveyed by the metadata.
There are three main types of metadata: descriptive, structural and administrative metadata. Please refer to Metadata Introduction in the library’s Metadata LibGuide for details.
Common Metadata Standards can meet your basic needs and at the same time allow greater interoperability for data harvesting and sharing due to their generic nature. Domain Specific Metadata Standards are designed for different disciplines and allow more granular presentations of your data. You can choose a common or specific metadata standard to document, archive and preserve your data based on your needs, your discipline and the funder requirements. In either case, local fields can be added to address your unique needs.
Depositing your data in a repository will facilitate its discovery and preservation. There are several discipline specific data repositories available but not all repositories will ensure long-term preservation. Contact the individual repository for more details.
You can search for an appropriate repository using the following resources:
Last updated February 27, 2013 2:37:55 PM