This document provides guidance on organizing and documenting datasets that contain synthetic data to simplify publication in a research data repository. Unlike datasets collected from the "real world", synthetic data often require additional details to facilitate reproduction and reuse. This document summarizes the essential information that you should provide when sharing synthetic data in a research data repository to ensure that the data can be easily understood and efficiently reused by others. In many cases, synthetic data must be handled differently if it is based on personal data, and a section specifically addressing synthetic personal data is included.
Funding
Swedish Research Council: Swedish National Data Service (SND)2021-00165_VR
Linköping University: Verifiering för nyttiggörande (VFN)