DeepDeath: Learning to Predict the Underlying Cause of Death with Big Data

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object , Other literature type 06 May 2017Embargo end date: 01 Jan 2017Publisher:openRxivJournal:2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Authors: Hassanzadeh Student Me, Hamid Reza; Sha, Ying; Wang Senior Mem, May D.;

doi: 10.1101/134965 , 10.1109/embc.2017.8037579 , 10.48550/arxiv.1705.03508

pmid: 29060620

pmc: PMC7324297

arXiv: 1705.03508

DeepDeath: Learning to Predict the Underlying Cause of Death with Big Data

- Summary
- Subjects
- Metrics

Abstract

Abstract Multiple cause-of-death data provides a valuable source of information that can be used to enhance health standards by predicting health related trajectories in societies with large populations. These data are often available in large quantities across U.S. states and require Big Data techniques to uncover complex hidden patterns. We design two different classes of models suitable for large-scale analysis of mortality data, a Hadoop-based ensemble of random forests trained over N-grams, and the DeepDeath, a deep classifier based on the recurrent neural network (RNN). We apply both classes to the mortality data provided by the National Center for Health Statistics and show that while both perform significantly better than the random classifier, the deep model that utilizes long short-term memory networks (LSTMs), surpasses the N-gram based models and is capable of learning the temporal aspect of the data without a need for building ad-hoc, expert-driven features.

Related Organizations

Georgia Institute of Technology
United States
Microsoft (United States)
United States
Emory University
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Machine Learning (stat.ML), Machine Learning (cs.LG), Statistics - Machine Learning, Cause of Death, Humans, Neural Networks, Computer, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average