descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Apr 2022 English Publisher:SAGE PublicationsJournal:Multiple Sclerosis Journal - Experimental, Translational and Clinical, volume 8 (issn: 2055-2173, eissn: 2055-2173,

Authors: Pedro Alves; Eric Green; Michelle Leavy; Haley Friedler; Gary Curhan; Carl Marci; Costas Boussios;

doi: 10.1177/20552173221108635

pmid: 35755008

pmc: PMC9228644

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

- Summary
- Subjects
- Related research
  (8)
- Metrics

Abstract

Background Disability assessment using the Expanded Disability Status Scale (EDSS) is important to inform treatment decisions and monitor the progression of multiple sclerosis. Yet, EDSS scores are documented infrequently in electronic medical records. Objective To validate a machine learning model to estimate EDSS scores for multiple sclerosis patients using clinical notes from neurologists. Methods A machine learning model was developed to estimate EDSS scores on specific encounter dates using clinical notes from neurologist visits. The OM1 MS Registry data were used to create a training cohort of 2632 encounters and a separate validation cohort of 857 encounters, all with clinician-recorded EDSS scores. Model performance was assessed using the area under the receiver-operating-characteristic curve (AUC), positive predictive value (PPV), and negative predictive value (NPV), calculated using a binarized version of the outcome. The Spearman R and Pearson R values were calculated. The model was then applied to encounters without clinician-recorded EDSS scores in the MS Registry. Results The model had a PPV of 0.85, NPV of 0.85, and AUC of 0.91. The model had a Spearman R value of 0.75 and Pearson R value of 0.74 when evaluating performance using the continuous estimated EDSS and clinician-recorded EDSS scores. Application of the model to eligible encounters resulted in the generation of eEDSS scores for an additional 190,282 encounters from 13,249 patients. Conclusion EDSS scores can be estimated with very good performance using a machine learning model applied to clinical notes, thus increasing the utility of real-world data sources for research purposes.

Keywords

Original Research Article

8 Research products, page 1 of 1

Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset
2021IsAmongTopNSimilarDocuments
Accuracy of time to treatment estimates in the CRASH-3 clinical trial: impact on the trial results
2020IsAmongTopNSimilarDocuments
sj-docx-2-mso-10.1177_20552173221108635 - Supplemental material for Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis
2022IsSupplementedBy
Are commonly ordered lab tests useful screens for alcohol disorders in older male veterans receiving primary care?.
2007IsAmongTopNSimilarDocuments
sj-docx-3-mso-10.1177_20552173221108635 - Supplemental material for Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis
2022IsSupplementedBy
Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis
2022IsSupplementedBy
sj-docx-1-mso-10.1177_20552173221108635 - Supplemental material for Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis
2022IsSupplementedBy
Concordance between Patients and Clinicians for Reporting Symptoms Associated with Treatment for Chronic Hepatitis C during a Pragmatic Clinical Trial
2022IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	9
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%