About the dataset

Primary description: Large-scale population biobank

The UK Biobank is a major national and international health resource, and a registered charity in its own right, with the aim of improving the prevention, diagnosis and treatment of a wide range of serious and life-threatening illnesses – including cancer, heart diseases, stroke, diabetes, arthritis, osteoporosis, eye disorders, depression and forms of dementia. UK Biobank recruited 500,000 people aged between 40-69 years in 2006-2010 from across the country to take part in this project. They have undergone measures, provided blood, urine and saliva samples for future analysis, detailed information about themselves and agreed to have their health followed. Over many years this will build into a powerful resource to help scientists discover why some people develop a disease and others do not. The UK Biobank dataset is publicly available for researchers via [https://www.UK Biobank.ac.uk/].
Study UK Biobank
Study Website UKBIOBANK
Data Descriptor Paper https://doi.org/10.1016/j.neuroimage.2017.10.034
DUA https://www.ukbiobank.ac.uk/use-our-data/apply-for-access/
Country/Region UK
Disease Tag multi_disease
Age (mean ± SD)
(Baseline)
64.1 ± 7.5
Age Range
(Baseline)
45-81
% Female 52.9
CUBIC Project /cbica/projects/UKBB_Processed
PI for CUBIC Access Approval Christos Davatzikos
N Subjects 22308
N MR Sessions 42081
Diagnoses
AD
Value V01 V02
CN 39562 1408
nan 10
AD 4
DLMUSE (v1.0.7)
(41067/41067 complete)
/cbica/projects/UKBB_Processed/Pipelines/UKBB_DLMUSE_2025/Results/DLMUSE_Volumes.csv
RAVENS (DRAMMS-1.4.1)
(40973/40973 complete)
/cbica/projects/UKBB_Processed/Pipelines/UKBB_3.5D_2020/Batches/Batch*/Protocols/RAVENS/ (*=0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,or 17)
DLWMLS (v0.1.0)
(38701/38701 complete)
/cbica/projects/UKBB_Processed/Pipelines/UKBB_DLWMLS_2025/Results/UKBB_DLWMLS_DLMUSE_Segmented_Volumes.csv
XCP-D (0.7.1rc6)
(2365/42081 complete)
/cbica/projects/UKBB_Processed/Pipelines/UKBB_fMRI_2024/Protocols/XCP_D
/usr/local/miniconda/bin/xcp_d \ /cbica/projects/UKBB_Processed/Pipelines/UKBB_fMRI_2024/Input/Data/ \ /cbica/projects/UKBB_Processed/Pipelines/UKBB_fMRI_2024/Protocols/XCP_D/ \ participant \ --participant_label 1000038 \ --input-type ukb \ -p gsr_only \ -w /cbica/projects/UKBB_Processed/Pipelines/UKBB_fMRI_2024/Protocols/ \ --motion-filter-type notch \ --band-stop-min 12 \ --band-stop-max 18 \ --motion-filter-order 4 \ --despike \ --fd-thresh 0 \ --lower-bpf 0.01 \ --upper-bpf 0.08 \ --bpf-order 2 \ --head-radius auto \ --dummy-scans auto \ --min-coverage 0.5 \ --smoothing 0 \ --fs-license-file /cbica/projects/UKBB_Processed/Pipelines/UKBB_fMRI_2024/license.txt
Non-Imaging Data
/cbica/projects/ISTAGING/Pipelines/ClinicalDataConsolidation_201911/Data/External_Data/UKBiobank

Raw data downloaded from source.

/cbica/projects/ISTAGING/Pipelines/ClinicalDataConsolidation_201911/dictionaries/UKBiobank

Data dictionary of raw data downloaded from source.

/cbica/projects/ISTAGING/Pipelines/ISTAGING_Data_Consolidation_2020/v2.0/istaging.csv

Data consolidated and harmonized in 2020.

This study: subset with df[df.Study == 'UKBIOBANK']

README: /cbica/projects/ISTAGING/Pipelines/ISTAGING_Data_Consolidation_2020/v2.0/README.md

Release Notes: /cbica/projects/ISTAGING/Pipelines/ISTAGING_Data_Consolidation_2020/v2.0/Release_Notes.md

/cbica/home/harmang/for_others/final_combined_istaging_withMUSE.csv

Data consolidated and harmonized in 2026.

This study: subset with df[df.Study == 'UKBIOBANK']


Funding
UK Biobank’s core funding is provided by a collaboration of Wellcome, MRC, Cancer Research UK, the British Heart Foundation and the National Institute for Health and Care Research (NIHR).