Sample data for each dataset

till.hoffmann · 15 January 2021 10:39

What do you want to be able to do?

Access a data sample for each dataset in the catalogue. These data need not be (and shouldn’t be) real data, but they should conform to the schema that the real dataset has.

Why is this important?

This would allow researchers to get a good understanding of the data they will have access to before going through the data access request process.

Any suggestions for how we could solve this?

The attributes associated with different records could be permuted and a small subset be made available for variables that are not particularly sensitive. This randomised dataset would have minimal privacy risks (although they should be evaluated on a case by case basis, of course).

tanika.patel · 20 January 2021 14:36

Thanks for the feedback! We have added your suggestion to our backlog.

You can track the progress of your suggestion here

Read more about what will happen next https://www.notion.so/Feature-requests-and-feedback-d26dd1c0b14c40a985982c1269d7aeca