Article
Open Access
Expand
Development and validation of a machine learning model elucidating risk factors in severe COVID-19
1 AI/ML, Quantitative and Digital Sciences (AQDS), Global Biometrics and Data Management (GBDM), Pfizer Inc, Cambridge, MA, USA
2 Data Science and Advanced Analytics, Pfizer Inc, Collegeville, PA, USA
3 Global Biometrics and Data Management (GBDM), Pfizer Inc, Cambridge, MA, USA
4 Anti-Infective Research Unit, Pfizer Inc, Pearl River, NY, USA
5 Global Product Development, Pfizer Inc, Collegeville, PA, USA
Abstract

Objectives: COVID-19 remains a significant healthcare burden. Leveraging the combined power of clinical trial data and big data from the real world, this study elucidated baseline factors predictive of subsequent outcomes relating to severe COVID-19 disease (SD) and the effect of nirmatrelvir/ritonavir (Tx), a protease inhibitor, on disease progression. Methods: We retrospectively analyzed data from the Evaluation of Protease Inhibition for COVID-19 in High-Risk Patients (EPIC-HR) clinical trial (NCT04960202) to discern observational associations between baseline factors and subsequent SD outcome. Baseline factors, including demographics, clinical laboratory results, symptoms, medical history, vital signs, and electrocardiogram features, were studied using machine learning (ML) for their importance in predicting hospitalization or death through Day 28, with Tx effects analyzed statistically. Generalizability of results was evaluated using real-world data (RWD) Optum Electronic Health Records. Results: Modeling indicated Tx was the greatest predictor of whether a patient progressed to SD. The most important baseline factors associated with increased risk of SD were elevated baseline (1) viral load (VL; > ~4 log10 copies/mL), (2) hsCRP (> ~1 mg/dL), (3) ferritin (> ~280 ug/L), (4) haptoglobin (> ~210 mg/dL), and (5) increased age (> ~48 years). Tx reduced VL and abnormally high hsCRP and haptoglobin to greater extents than placebo at the measured time points. RWD validation supported findings on increased risk with elevated hsCRP and ferritin and increased age (no data were available on VL and haptoglobin). Conclusion: ML analysis identified critical baseline factors immediately before or at the beginning of COVID-19 infection predictive of progression to SD in adults that are common to a heterogeneous population. This study provides insights on multivariate signatures of COVID-19 disease progression and Tx effects, which may aid future studies and inform treatment decision making.

Keywords

big data; real-world evidence; machine learning; precision medicine; COVID-19; EPIC-HR

Preview
References
  • [1] Cucinotta D, Vanelli M. WHO Declares COVID-19 a Pandemic. Acta Biomed. 2020, 91(1):157–160.
  • [2]Centers for Disease Control and Prevention. Long COVID or Post-COVID Conditions. Available at: https://www.cdc.gov/covid/long-term-effects/?CDC_AAref_Val=https://www.cdc.gov/coronavirus/2019-ncov/long-term-effects/index.html. (accessed May 7, 2024).
  • [3]World Health Organization. WHO Coronavirus (COVID-19) Dashboard. Available at: https://covid19.who.int/. (accessed March 12, 2024).
  • [4]US Centers for Disease Control and Prevention. COVID Data Tracker. Available at: https://covid.cdc.gov/covid-data-tracker/#datatracker-home. (accessed December 5, 2024).
  • [5] Vetter P, Vu DL, L’Huillier AG, Schibler M, Kaiser L, et al. Clinical features of covid-19. BMJ. 2020, 369:m1470.
  • [6]Siordia JA, Jr. Epidemiology and clinical features of COVID-19: A review of current literature. J. Clin. Virol. 2020, 127:104357.
  • [7] Huang C, Wang Y, Li X, Ren L, Zhao J, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020, 395(10223):497–506.
  • [8]World Health Organization. Weekly epidemiological update on COVID-19 - 22 March 2023. Available at: https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---22-march-2023. (accessed January 10, 2024).
  • [9] Sanyaolu A, Okorie C, Marinkovic A, Patidar R, Younis K, et al. Comorbidity and its impact on patients with COVID-1 SN Compr. Clin. Med. 2020, 2(8):1069–1076.
  • [10]Wang B, Li R, Lu Z, Huang Y. Does comorbidity increase the risk of patients with COVID-19: evidence from meta-analysis. Aging (Albany N. Y.) 2020, 12(7):6049–6057.
  • [11] Biswas M, Rahaman S, Biswas TK, Haque Z, Ibrahim B. Association of sex, age, and comorbidities with mortality in COVID-19 patients: a systematic review and meta-analysis. Intervirology. 2020:1–12.
  • [12] Hammond J, Leister-Tebbe H, Gardner A, Abreu P, Bao W, et al. Oral nirmatrelvir for high-risk, nonhospitalized adults with COVID-19. N. Engl. J. Med. 2022, 386(15):1397–1408.
  • [13]Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. 2020, 36(10):3093–3098.
  • [14]Centers for Disease Control and Prevention. CDC Museum COVID-19 Timeline. Available at: https://www.cdc.gov/museum/timeline/covid19.html. (accessed January 10, 2024).
  • [15]Ali N. Elevated level of C-reactive protein may be an early marker to predict risk for severity of COVID-19. J. Med. Virol. 2020, 92(11):2409–2411.
  • [16]Luan YY, Yin CH, Yao YM. Update advances on C-reactive protein in COVID-19 and other viral infections. Front. Immunol. 2021, 12:720363.
  • [17] Tan C, Huang Y, Shi F, Tan K, Ma Q, et al. C-reactive protein correlates with computed tomographic findings and predicts severe COVID-19 early. J. Med. Virol. 2020, 92(7):856–862.
  • [18]Cavezzi A, Troiani E, Corrao S. COVID-19: hemoglobin, iron, and hypoxia beyond inflammation. A narrative review. Clin. Pract. 2020, 10(2):1271.
  • [19]Vargas-Vargas M, Cortes-Rojo C. Ferritin levels and COVID- Rev. Panam. Salud Publica. 2020, 44:e72.
  • [20]Ceciliani F, Giordano A, Spagnolo V. The systemic reaction during inflammation: the acute-phase proteins. Protein Pept. Lett. 2002, 9(3):211–223.
  • [21] Beimdiek J, Janciauskiene S, Wrenger S, Volland S, Rozy A, et al. Plasma markers of COVID-19 severity: a pilot study. Respir. Res. 2022, 23(1):343.
  • [22] Chrostek L, Gan K, Kazberuk M, Kralisz M, Gruszewska E, et al. Acute-phase proteins as indicators of disease severity and mortality in COVID-19 patients. Sci. Rep. 2024, 14(1):20360.
  • [23] Yağcı S, Serin E, Acicbe Ö, Zeren Mİ, Odabaşı MS. The relationship between serum erythropoietin, hepcidin, and haptoglobin levels with disease severity and other biochemical values in patients with COVID-19. Int. J. Lab. Hematol. 2021, 43(suppl 1):142–151.
  • [24] Henkens MTHM, Raafs AG, Verdonschot JAJ, Linschoten M, van Smeden M, et al. Age is the main determinant of COVID-19 related in-hospital mortality with minimal impact of pre-existing comorbidities, a retrospective cohort study. BMC Geriatr. 2022, 22(1):184.
  • [25] Romero Starke K, Reissig D, Petereit-Haack G, Schmauder S, Nienhaus A, et al. The isolated effect of age on the risk of COVID-19 severe outcomes: a systematic review with meta-analysis. BMJ Glob. Health. 2021, 6(12):e006434.
  • [26] Layden JE, Ghinai I, Pray I, Kimball A, Layer M, et al. Pulmonary illness related to E-Cigarette use in Illinois and Wisconsin - final report. N. Engl. J. Med. 2020, 382(10):903–916.
  • [27] Magleby R, Westblade LF, Trzebucki A, Simon MS, Rajan M, et al. Impact of severe acute respiratory syndrome coronavirus 2 viral load on risk of intubation and mortality among hospitalized patients with coronavirus disease 2019. Clin. Infect. Dis. 2021, 73(11):e4197–e4205.
  • [28] Dadras O, Afsahi AM, Pashaei Z, Mojdeganlou H, Karimi A, et al. The relationship between COVID-19 viral load and disease severity: a systematic review. Immun. Inflamm. Dis. 2022, 10(3):e580.
  • [29]Toussi SS, Hammond JL, Gerstenberger BS, Anderson AS. Therapeutics for COVID-19. Nat. Microbiol. 2023, 8(5):771–786.