Sharing routes for individual-level research data and code |
|
Coordinator 1 | Dr Aida Sanchez-Galvez (Centre for Longitudinal Studies, UCL Social Research Institute ) |
Coordinator 2 | Ms Cristina Magder (UK Data Service, UK Data Archive, University of Essex) |
Sharing individual-level research data is a core activity of many research projects, which often collect and manage a variety of data types, such as survey data, biomarkers, complex sensor data, genomics, linked administrative data, and geographical data. Some researchers are also generating synthetic data for teaching purposes and preliminary development of analysis code. Each of these data types presents its own challenges when it comes to sharing and dissemination for future research purposes. Additionally, the sharing of programming code is fundamental in ensuring reproducibility and transparency.
Data releases are internally managed by the studies themselves, and/or externally by national archives or Trusted Research Environments. Balancing the wide sharing of detailed research data with the need to maintain confidentiality and security, while also ensuring easy and swift access without significant barriers or delays, is a complex challenge. This balance becomes further challenging when dealing with sensitive and/or potentially disclosive data. Data that fall under the GDPR definition of “special category data” require additional protection and a higher degree of security and governance measures often involving Data Access Committees oversight and dedicated legal and sharing frameworks.
The aim of this session is to provide a platform for colleagues to discuss their experience and approaches to sharing individual-level research data, sensitive and non-sensitive, original or synthetic. Participants are encouraged to share their techniques to assess and manage disclosure risk, and best practices and challenges of code sharing. We invite colleagues to submit ideas relating to, but not restricted to:
- Sharing routes for individual-level research data
- Publication of programming code or syntax
- Management and sharing of synthetic data
- Methods of risk assessment of disclosivity and sensitivity
- Research data classification or data tiers
- Technical tools used to generate bespoke datasets
- Data access via Trusted Research Environments
- International data sharing