Community forum
This is an open space to discuss health research topics, feedback on the Gateway functionality and comment on resources such as datasets. Everyone is welcome to join existing discussions or start a new topic.

Sample data for each dataset

What do you want to be able to do?

Access a data sample for each dataset in the catalogue. These data need not be (and shouldn’t be) real data, but they should conform to the schema that the real dataset has.

Why is this important?

This would allow researchers to get a good understanding of the data they will have access to before going through the data access request process.

Any suggestions for how we could solve this?

The attributes associated with different records could be permuted and a small subset be made available for variables that are not particularly sensitive. This randomised dataset would have minimal privacy risks (although they should be evaluated on a case by case basis, of course).

Thanks for the feedback! We have added your suggestion to our backlog.

You can track the progress of your suggestion here

Read more about what will happen next https://www.notion.so/Feature-requests-and-feedback-d26dd1c0b14c40a985982c1269d7aeca