Status
Completed
Title
Dynamic evaluation and updating of Covid-19 risk prediction models over time: Methods and pipeline
What were the objectives of the study?
Since the first cases of COVID-19 were confirmed in the UK in early 2020, the pandemic has impacted all of our daily lives. It became clear that some people were at greater risk of hospitalisation or death if they became infected. Researchers developed a prediction model called QCOVID to help identify those most at risk of getting infected and then being hospitalised or dying due to COVID based on their individual characteristics such as age, sex, ethnicity and longstanding illnesses. The QCOVID model was used to prioritise people for vaccination and to inform the government’s shielding list. The model was developed using data from a GP records database called QResearch and it was extended in 2021 to account for people having one or two doses of covid-19 vaccination. The QResearch data is anonymous; there is no information that would allow an individual person to be identified.
The QCOVID models were developed in a rapidly changing environment in terms of availability of vaccines, vaccine uptake, infection prevalence, levels of immunity in the population, new variants and availability of treatments. As the pandemic continues to evolve, it is important that the QCOVID models are kept up to date both to provide accurate predictions of risk and to enable their use, if needed, to prioritise groups in the population for new treatments or vaccine boosters.
Our objective is to find out how best to update a clinical prediction model such as QCOVID to account for changes over time. The updating method must be flexible because changes related to COVID-19 can happen quickly and are not always easy to foresee. An example is the speed with which the Omicron variant became dominant in the UK at the end of 2021. The updating method must also be able to include new treatments or vaccines (e.g. the booster dose) , and for their effects to possibly wane over time.
Our study will compare a variety of statistical techniques for updating prediction models over time, taking into account the specific factors that influence risk in the COVID-19 context. To do this, we will use computer-generated data to represent hypothetical future scenarios and also actual data from QResearch, the database of health information used for the development of the original QCOVID models. Based on the performance of different model updating methods, we will make a recommendation for how to update QCOVID going forward.
Chief Investigator
Professor Julia Hippisley-Cox
Lead Applicant Organisation Name
Sponsor
University of Oxford
Location of research
Oxford
Date on which research approved
08-Jun-2022
Project reference ID
OX318
Generic ethics approval reference
18/EM/0400
Are all data accessed are in anonymised form?
Yes
Brief summary of the dataset to be released (including any sensitive data)
(These are the same data points as were used in the final QCOVID3 model)
Data regarding:
Demographic variables:
Age (continuous variable)
Sex
Townsend deprivation score
Ethnicity (9 categories)
Lives in a care home (nursing home or residential care)
Homelessness
+++
Lifestyle factors :
Body mass index (continuous variable)
+++
Vaccination: first, second, booster dose
Record of a SARS-CoV-2 positive test result
+++
Co-morbidities (includes variables in the QCovid models)
Chronic kidney disease
Chemotherapy in previous 12 months
Type 1 or type 2 diabetes (with glycated haemoglobin (HbA1c) levels <59 or ≥59 mmol/mol)
Blood cancer
Bone marrow transplantation in past six months
Respiratory cancer
Radiotherapy in past six months
Solid organ transplantation
Chronic obstructive pulmonary disease
Asthma
Rare lung diseases (cystic fibrosis, bronchiectasis, or alveolitis)
Pulmonary hypertension or pulmonary fibrosis
Coronary heart disease
Stroke
Atrial fibrillation
Heart failure
Venous thromboembolism
Peripheral vascular disease
Congenital heart disease
Dementia
Parkinson’s disease
Epilepsy
Rare neurological conditions (motor neurone disease, multiple sclerosis, myasthenia gravis, or Huntington’s chorea)
Cerebral palsy
Osteoporotic fracture
Rheumatoid arthritis or systemic lupus erythematosus
Liver cirrhosis
Bipolar disorder or schizophrenia
Inflammatory bowel disease
Sickle cell disease
HIV/AIDS
Severe combined immunodeficiency
Data regarding admission and treatment for the above conditions.
For example, OPCS and ICD10 codes for transplantation, chemotherapy, radiotherapy, operative treatment for congenital heart disease.
Linkage to mortality records to enable ascertainment of deaths from COVID-19 during the study period.
There will also be linkage to SGSS data regarding the results of COVID-19 testing.
systemic anticancer and radiotherapy dataset will be requested to identify patients on these treatments.
Funding Source
NIHR Program Grant
Public Benefit Statement
Research Team
Professor Carol Coupland - University of Nottingham
Professor Ruth Keogh - London School of Hygiene & Tropical Medicine
Professor Karla Diaz-Ordaz - London School of Hygiene & Tropical Medicine
Dr Kamryn Tanner - London School of Hygiene & Tropical Medicine
Approval Letter
Publications
-
Dynamic updating of clinical survival prediction models in a changing environment
Authors: Tanner KT, Keogh RH, Coupland CAC, Hippisley-Cox J & Diaz-Ordaz K
Ref:
https://diagnprognres.biomedcentral.com/articles/10.1186/s41512-023-00163-z
Access Type
Trusted Research Environment (TRE)