Research data refers to any factual information that is collected, observed, or generated through systematic investigation and analysis for the purpose of producing original research findings. This data can take various forms, including numerical data, textual data, images, audio recordings, or any other format relevant to the research inquiry. Research data serves as the foundation for scientific inquiry, enabling researchers to test hypotheses, validate theories, and draw conclusions. It may be gathered through primary data collection methods (such as surveys, experiments, or observations) or obtained from secondary sources (such as existing datasets, literature reviews, or archival records). Proper management, documentation, and sharing of research data are essential for transparency, reproducibility, and the advancement of knowledge within the scientific community.
Primary vs. Secondary Data
Primary data refers to information collected directly from its original source through methods such as surveys, interviews, experiments, or observations. This data is newly gathered for a specific research purpose and has not been previously collected or analyzed by others. It offers researchers the advantage of obtaining data tailored to their specific research questions but may require time and resources to collect.
Primary data can manifest in diverse forms, here are some examples below and these are things you create as the researcher.
Secondary data, on the other hand, refers to information that has already been collected, processed, and analyzed by others for purposes, but it still may be useful for your research project depending on the topic. This data can include sources such as government statistics, academic research papers, industry reports, or data shared through public repositories.
Below are links to websites where you can access secondary data for your project.
Social Sciences, Education, and Humanities
Natural, Environmental, and Biological Sciences:
Computer Science, Engineering, and Information Technology:
Business and Economics:
Health and Human Sciences
Understanding the Research Data Lifecycle:
The research data lifecycle delineates the stages encompassing the collection, recording, processing, publication, sharing, and preservation of research data.
Employing the research data lifecycle as a navigational compass in data management is invaluable. It not only fosters meticulous planning but also ensures comprehensive coverage across all facets of the research data journey, including creation, processing, analysis, preservation, sharing, and potential reuse. You can find a description of each phase below:
Planning Phase:
Data Collection:
Data Processing and Analysis:
Data Sharing and Preservation:
Publication and Dissemination:
Reuse and Repurposing:
Evaluation and Feedback:
The requirements for the information to be included in your Data Management Plan (DMP) can vary by funding agency and institution. The required length and structure of your plan will also differ. The best way to start is to check your funder’s specific expectations, which can be found on the agencies website. Following a comprehensive approach, will help ensure your DMP is thorough, compliant, and useful throughout and beyond the duration of your research project.
Online Resources
DMPTool: This free, interactive online tool assists researchers in creating data management plans. It provides step-by-step guidance, funder-specific templates, and sample data management plans.
DMPonline: DMPonline helps you to create, review, and share data management plans that meet institutional and funder requirements. It is provided by the Digital Curation Centre (DCC).
Public DMPs: Public DMPs are sample plans created using the DMPTool and shared publicly by their authors. Note that these plans are not vetted for quality, completeness, or adherence to funder guidelines.
Data Production and Collection:
Data Organization and Description:
Data Backup and Storage:
Access and Security Management:
Communication with Collaborators:
Data Sharing Plans:
Sensitive or Restricted Data:
Copyright and Intellectual Property Rights: