Our unique biomedical database is the largest, most detailed and most widely accessible of its kind.
Whole genome sequences for 500,000 people
The largest, most detailed, most widely accessible biomedical database of its kind.
50,000 proteomes
85,000 people imaged
(And there will be 100,000 before you know it!)
15,000,000 bio samples, available as data.
Longitudinal data collected since 2006
Types of data:
- Activity monitor
- Biomarkers
- Body scans
- Cognition and hearing
- Covid
- Genetics
- Health conditions
- Lifestyle
- Links to health records
- Physical measures
- Proteomics
- Specialist questionnaires
Our participants
Information about how our participants were selected and continue to help us.
Biological samples
Information on the samples etc here.
Data timeline
The timeline of past data releases and planned future releases.
Protecting the data
Find out who uses our data, how we verify data users and projects, and how we keep data secure.
Data browser
Information on the data browser before linking to the third party site.
Discoveries
See how our data is impacting healthcare >
12 July 2024
Revealing the impact of COVID-19 on the brain
12 July 2024
Predicting Parkinson’s disease earlier
We have a huge amount of data.
We have collected more genetic, proteomic, imaging data
than anyone else, anywhere.
