Comparing Predictive Models of Pain Reliever Misuse and Abuse

Name: Comparing Predictive Models of Pain Reliever Misuse and Abuse
Creator: Shiverick, Sean M.
Keywords: 03 medical and health sciences, 0302 clinical medicine, 0202 electrical engineering, electronic engineering, information engineering, Classification Models, 02 engineering and technology, Predictive Modeling, Supervised Learning, 3. Good health

Shiverick, Sean M.

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Article . 2018

License: CC BY

Data sources: Datacite

ZENODO

Article . 2018

License: CC BY

Data sources: ZENODO

ZENODO

Article . 2018

License: CC BY

Data sources: Datacite

Comparing Predictive Models of Pain Reliever Misuse and Abuse

descriptionPublicationkeyboard_double_arrow_right Article 15 Dec 2018 English Publisher:Zenodo

Authors: Shiverick, Sean M.;

doi: 10.5281/zenodo.2301844 , 10.5281/zenodo.2301843

Comparing Predictive Models of Pain Reliever Misuse and Abuse

- Summary
- Subjects
- Metrics

Abstract

The misuse and abuse of prescription opioids (MUPO) has become a major health crisis in the U.S. Predictive modeling provides a useful approach for analyzing pain reliever misuse and abuse and identifying features that contribute to MUPO. This study compared ten classification models using four performance metrics: accuracy, sensitivity (i.e. recall), precision, and f1-score. Data from three years of the National Survey on Drug Use and Health (2015-2017) was combined into a sample of N = 170317 observations. Twenty-six percent of respondents reported using prescription opioid medications in the past year, 11% reported misusing or abusing pain relievers, and 2% reported heroin use. The classifier models were fit to a training set using sixteen features that included demographic variables, medications, and illicit drugs. The binary target variable was pain reliever misuse and abuse. Model performance was evaluated on the testing set. The f1-score was used as the performance metric due to unbalanced classes in the data. Logistic regression, random forests, and decision trees had the highest f1-scores compared to other models. All three models identified cocaine use as the most informative feature for predicting pain reliever misuse and abuse. Amphetamine use was selected as the second most important variable by logistic regression and random forests. The models differed in the importance they assigned to heroin use, tranquilizer use, age category, and mental health as predictors of pain reliever misuse and abuse. Advantages and limitations of the classifier models are considered and the tradeoff between model complexity and interpretability is discussed.

This project was completed in partial fulfillment of requirements for the M.S. in Data Science from the School of Informatics and Computing at IU-Bloomington completed in May, 2018. The manuscript was completed November 23, 2018 and submitted December 15, 2018.

Keywords

Classification Models, Predictive Modeling, Supervised Learning

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average