Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ International Journa...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
International Journal of Population Data Science
Article . 2024 . Peer-reviewed
License: CC BY
Data sources: Crossref
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
versions View all 2 versions
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Model-based algorithms to ascertain smoking in administrative health data: a registry-based validation study

Authors: Md Ashiqul Haque; Nathan C Nickel; Maxime Turgeon; Lisa M Lix;

Model-based algorithms to ascertain smoking in administrative health data: a registry-based validation study

Abstract

Objectives We developed a machine-learning model-based algorithm (MBA) for smoking in Administrative Health Data (AHD). The validity of this MBA was compared to a rule-based algorithm (RBA). Approaches The study included adults (≥18 years) from a clinical registry containing self-reported current smoking from 2017 to 2020 in Manitoba, Canada. Clinical data were linked to up to five years of hospitalization, physician billing claims, and prescription medication records. The RBA was based on diagnosis codes for tobacco use and nicotine dependence medication. MBAs, constructed using random forest (RF) models, included these indicators in addition to comorbid condition diagnoses and sociodemographic factors. Sensitivity, specificity, positive and negative predictive values (PPV, NPV), and 95% confidence intervals (CIs) were estimated. Results The cohort comprised 24,718 individuals (88.6% female); prevalence of current smokers was 10.0%. The RBA had sensitivity of 27.3% (95% CI: 24.2-30.7), specificity of 96.6% (95% CI: 96.1-97.0), and PPV of 47.2% (95% CI: 42.9-51.5). The MBA had sensitivity of 68.6% (95% CI: 65.1-71.9), specificity of 76.3% (95% CI: 75.2-77.3), and PPV of 24.3% (95% CI: 23.2-25.6). NPV was high irrespective of algorithms. Stratified analyses revealed similar estimates for males and females, and the number of years of AHD did not affect the MBA results. ConclusionsAn RF-based MBA for smoking ascertainment in linked AHD sources improved sensitivity compared to the RBA. However, the RBA excelled in specificity and PPV. ImplicationBalancing accurate smoker identification with the risk of false positives is crucial when choosing an algorithm to ascertain current smokers using AHD.

Related Organizations
Keywords

Demography. Population. Vital events, HB848-3697

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
gold