Data dictionaries, schema definitions, and metadata for research datasets from Nottingham University Hospitals NHS Trust (NUH), one of the largest acute NHS trusts in England, serving a population of approximately 2.5 million across Nottinghamshire and surrounding counties.
This repository describes the structure and content of each dataset — field names, types, permitted values, and coding standards. All datasets are pseudonymised research views; access to the data requires a formal data access request.
Full documentation is at health-informatics-uon.github.io/nuh-data.
Pydantic v2 models for each dataset are published directly from this repository. Install with pip:
pip install git+https://github.com/Health-Informatics-UoN/nuh-data.gitThen validate or annotate your data against the schema:
from nuh_data.datasets.sact import SACTResearchView
from nuh_data.datasets.cosd import COSDResearchView
# Validate a record — Pydantic will raise on type or enum mismatches
record = SACTResearchView(
pseudo_nhs_number="abc123",
administration_date="2023-06-01",
drug_name="CARBOPLATIN",
sact_administration_route="01", # Intravenous
intent_of_treatment="P", # Palliative
)Models are generated directly from the LinkML schemas, so field names, types, and permitted values stay in sync with the data dictionaries published in this repository.