How the datasets interlink

We have had fun showing how datasets interlink - with the more serious aspect that these Venn diagrams show that not all participants have all data; and that some are no longer contactable for follow-up.  Perhaps the 2nd hits the sweet spot between meaningful diagrams and pattern-making for its own sake?

 The 2nd image is revised, while the others are new, reflecting newly curated data.

Venn Diagram of data linkage between core NIHR Datasets featuing Eligible, Demographics and Participant Contact Details for each of the three circles. Venn diagram of Data Linkage between core NIHR Datasets, featuring eligible for data release (consented), Case Report Form, Demographics, Health and Lifestyle Questionnaire and SNP Chip and Imputation (Genotype) Venn diagram of Data Linkage between core NIHR Datasets of Eligible (consent), Participant Contact Details, Case Report Form, Demographics, Health and Lifestyle Questionnaire and SNP Chip and Imputation (Genotype) Venn diagram of Data Linkage between core NIHR Datasets of Eligible (consent), Participant Contact Details, Case Report Form, Demographics, Health and Lifestyle Questionnaire and SNP Chip and Imputation (Genotype)

NHS Trust data is harder to represent, as it itself covers so many domains - this is our attempt to visualise this, showing the variety of data available from across Trusts.

Bar chart showing records per Data subtype for Chrons Form, Clinical Info, Clinical notes, Commissioning, Demographics, Diagnostic, Imaging, Patient Summary, Prescription, Request, Surgery and UK Form

We should also publish the counts in a table, for those who prefer tables to graphics!

Cohort discovery tool

Screenshots and commentary required

In terms of the HDR UK Data Utility Framework, showing how datasets interlink and how they may be queried together, brings this improvement:

  • Pathway Coverage (bronze to silver/gold)
    • Action: Publish a linkage matrix will bring them to gold.