
The Danish National Patient Register (DNPR) is an important data source for research providing detailed information on all hospital contacts in Denmark. With the transition from the second version of the DNPR (DNPR2) to the third version (DNPR3) in early 2019, the patient type variable (inpatient, elective outpatient, acute outpatient) was removed. This study proposes and evaluates algorithms to classify hospital contacts into these categories in DNPR3, aiming for consensus in data interpretation for researchers using Danish registries.We analyzed somatic public hospital contacts in Denmark from 2017 to 2020, with 20,882,018 unique contacts in DNPR2 and 27,694,584 in DNPR3. Several classification algorithms were developed and assessed, including department-based, contact-based, and hybrid methods, to infer patient types in DNPR3 based on contact features, such as duration and admission type. In DNPR3, where the true patient type is unknown, proxy labels were used to train classification algorithms.Compared to the true patient type variable in DNPR2, our department-based classifier showed high positive predictive values (PPVs) and sensitivities in DNPR2 with PPVs ranging from 95.6 to 99.5 and sensitivities ranging from 94.1 to 99.6 across patient types. The hybrid approach showed improved PPVs and sensitivities for acute (PPV = 97.3, sensitivity = 96.8) and elective (PPV = 99.8, sensitivity = 99.9) outpatients. In both DNPR2 and DNPR3 high agreement between contact-based classification algorithms was obtained indicating robustness in our classification methods which suggests the presence of inherent patterns in the data.Our study shows that all presented classification methods are suitable for categorizing patient types in DNPR2 depending on the available data and furthermore demonstrated robustness, supporting their suitability for classification in DNPR3. Future research should explore advanced techniques and comprehensive department classification for enhanced accuracy and applicability.
Original Research
Original Research
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
