Validation of an internationally derived patient severity phenotype to support COVID-19 analytics from electronic health record data
Articolo
Data di Pubblicazione:
2021
Abstract:
Objective: The Consortium for Clinical Characterization of COVID-19 by EHR (4CE) is an international collaboration addressing coronavirus disease 2019 (COVID-19) with federated analyses of electronic health record (EHR) data. We sought to develop and validate a computable phenotype for COVID-19 severity. Materials and Methods: Twelve 4CE sites participated. First, we developed an EHR-based severity phenotype consisting of 6 code classes, and we validated it on patient hospitalization data from the 12 4CE clinical sites against the outcomes of intensive care unit (ICU) admission and/or death. We also piloted an alternative machine learning approach and compared selected predictors of severity with the 4CE phenotype at 1 site. Results: The full 4CE severity phenotype had pooled sensitivity of 0.73 and specificity 0.83 for the combined outcome of ICU admission and/or death. The sensitivity of individual code categories for acuity had high variability-up to 0.65 across sites. At one pilot site, the expert-derived phenotype had mean area under the curve of 0.903 (95% confidence interval, 0.886-0.921), compared with an area under the curve of 0.956 (95% confidence interval, 0.952-0.959) for the machine learning approach. Billing codes were poor proxies of ICU admission, with as low as 49% precision and recall compared with chart review. Discussion: We developed a severity phenotype using 6 code classes that proved resilient to coding variability across international institutions. In contrast, machine learning approaches may overfit hospital-specific orders. Manual chart review revealed discrepancies even in the gold-standard outcomes, possibly owing to heterogeneous pandemic conditions. Conclusions: We developed an EHR-based severity phenotype for COVID-19 in hospitalized patients and validated it at 12 international sites.
Tipologia CRIS:
1.1 Articolo in rivista
Keywords:
computable phenotype; data interoperability; data networking; disease severity; medical informatics; novel coronavirus; Hospitalization; Humans; Machine Learning; Prognosis; ROC Curve; Sensitivity and Specificity; COVID-19; Electronic Health Records; Severity of Illness Index
Elenco autori:
Klann, J. G.; Estiri, H.; Weber, G. M.; Moal, B.; Avillach, P.; Hong, C.; Tan, A. L. M.; Beaulieu-Jones, B. K.; Castro, V.; Maulhardt, T.; Geva, A.; Malovini, A.; South, A. M.; Visweswaran, S.; Morris, M.; Samayamuthu, M. J.; Omenn, G. S.; Ngiam, K. Y.; Mandl, K. D.; Boeker, M.; Olson, K. L.; Mowery, D. L.; Follett, R. W.; Hanauer, D. A.; Bellazzi, R.; Moore, J. H.; Loh, N. -H. W.; Bell, D. S.; Wagholikar, K. B.; Chiovato, L.; Tibollo, V.; Rieg, S.; Li, A. L. L. J.; Jouhet, V.; Schriver, E.; Xia, Z.; Hutch, M.; Luo, Y.; Kohane, I. S.; Brat, G. A.; Murphy, S. N.
Link alla scheda completa:
Pubblicato in: