About our data

Our unique biomedical database is the largest, most detailed and most widely accessible of its kind.

Whole genome sequences for 500,000 people

The largest, most detailed, most widely accessible biomedical database of its kind.

50,000 proteomes

85,000 people imaged

(And there will be 100,000 before you know it!)

15,000,000 bio samples, available as data.

Longitudinal data collected since 2006

Types of data:

  • Activity monitor
  • Biomarkers
  • Body scans
  • Cognition and hearing
  • Covid
  • Genetics
  • Health conditions
  • Lifestyle
  • Links to health records
  • Physical measures
  • Proteomics
  • Specialist questionnaires

Types of data >>

Our participants

Information about how our participants were selected and continue to help us.

Biological samples

Information on the samples etc here.

Data timeline

The timeline of past data releases and planned future releases.

Protecting the data

Find out who uses our data, how we verify data users and projects, and how we keep data secure.

Data browser

Information on the data browser before linking to the third party site.


Discoveries

See how our data is impacting healthcare >