Checklist items
| Digital Data | Software | Samples |
| Curricular Materials | Physical Collections | Other |
Observational data captured around the time of the event
Examples: Sensor readings, telemetry, survey results, neuroimages
Usually irreplaceable
Experimental data from lab equipment
Examples: gene sequences, chromatograms, toroid magnetic field readings
Often reproducible, but can be lengthy and expensive
Simulation data generated from test models
Examples: climate models, economic models
Models and metadata (inputs) more important than output data.
Reproducible, but possibly expensive
Examples: text and data mining, compiled database, 3D models
Reproducible, but possibly expensive
Samples and other non-digital data forms
Samples, physical collections, notebooks
All may be considered data for the purposes of presenting a dmp
How are you generating the data?
| Examples | Software | Equipment |
| Drilling | Imaging |
| Text | e.g. ASCII, Word, PDF |
| Numerical | e.g. ASCII, SAS, Stata, Excel, netCDF, HDF |
| Database | e.g. MySQL, MS Access, Oracle |
| Multimedia | e.g. JPEG, TIFF, Dicom, MPEG, Quicktime |
| Models | e.g. 3D VRML, X3D |
| Software | e.g. Java, C |
| Domain Specific | e.g. FITS in Astronomy, CIF in Chemistry |
| Vendor Specific | e.g. Varian NMR data format, LeCroy digital oscilloscope format. |
Are you using a sustainable digital format - one that is compatible, for the foreseeable future, with software needed to open and read it?
Will these file types be long-lived?
Are there tools or software you will need to process or view the data that need to be archived along with the data?
How fast will the data be growing?
Where will the data be stored?
Personal computer
Cloud storage
Lab server
ThayerFS
Webserver
rSTor
What about backups (or copies of) your data?
Frequency - How often?
Location(s) of backups or file copies - Office, building, off-site
What kind of system or software - College backup (NetBackup), Retrospect, Online: Mozy or Carbonite
Testing procedures - will you test the restore process to make sure backups are working correctly.
Raw data -> Cleaned data -> Processed data -> Summary Level data -> Publication data
Metadata. Information about the data.
What will you keep?
How long will you keep the data?
What are the procedures envisioned for long-term archiving and preservation of the data, including succession plans for the data should the expected archiving entity go out of existence.
Retention?
Destruction?
How will you document your data?
Is there good project and data documentation?
What directory and file naming convention will be used?
Will you be using versioning controls?
Is there an ontology or other community standard for data sharing/integration?
Who will assign the metadata?
Are there any data quality issues?
How will the (technical) quality of the data be assured?
How will adherence to this data management plan will be checked or demonstrated?
Audience: Who are the potential secondary users of the data?
How will data be made available for public use and secondary uses?
What are my options for sharing?
Self-dissemination
Discipline based repositories
Institutional repositories
Websites - www.dartmouth.edu account, departmental server, hosted server space
Cloud (Amazon, RackShare, Google, etc)
Restricted use collections
Are there any embargo periods?
What kind of rights will be granted to different user groups?
Who will decide on access to the data?
Include explanations about how data may be re-used and how the source of the data should be acknowledged
What are the plans for preserving data in accessible form?
Any sharing requirements? e.g. funder data sharing policy
If there are partnerships, how will data be shared and managed with partners?
Protected personal information: medical (HIPPA), student information (FERPA)?, other?
National security?
Patent related
Other confidentiality concerns
Informed consent
Will you have physical restrictions, like firewalls or off-network devices?
Will you or the institution have policies that restrict access or enforce security measures?
Who will make decisions about data security?
How the data management plan will maximize the value of the data?
IMPACT: What is the possible impact of the data within the immediate field, in other fields, and any broader, societal impact?
What about transfer of people or data?

