Biomedical time series clustering based on non-negative sparse coding and probabilistic topic model

descriptionPublicationkeyboard_double_arrow_right Article 01 Sep 2013 English Publisher:Elsevier BVJournal:Computer Methods and Programs in Biomedicine, volume 111, pages 629-641 (issn: 0169-2607,

Copyright policy )

Authors: Wang, J.; Liu, P.; F. H. She, M.; Nahavandi, S.; Kouzani, A.;

doi: 10.1016/j.cmpb.2013.05.022

pmid: 23846155

handle: 1959.3/472497

Biomedical time series clustering based on non-negative sparse coding and probabilistic topic model

- Summary
- Subjects
- Metrics

Abstract

Biomedical time series clustering that groups a set of unlabelled temporal signals according to their underlying similarity is very useful for biomedical records management and analysis such as biosignals archiving and diagnosis. In this paper, a new framework for clustering of long-term biomedical time series such as electrocardiography (ECG) and electroencephalography (EEG) signals is proposed. Specifically, local segments extracted from the time series are projected as a combination of a small number of basis elements in a trained dictionary by non-negative sparse coding. A Bag-of-Words (BoW) representation is then constructed by summing up all the sparse coefficients of local segments in a time series. Based on the BoW representation, a probabilistic topic model that was originally developed for text document analysis is extended to discover the underlying similarity of a collection of time series. The underlying similarity of biomedical time series is well captured attributing to the statistic nature of the probabilistic topic model. Experiments on three datasets constructed from publicly available EEG and ECG signals demonstrates that the proposed approach achieves better accuracy than existing state-of-the-art methods, and is insensitive to model parameters such as length of local segments and dictionary size.

Related Organizations

University of South Carolina System
United States
Swinburne University of Technology
Australia
Deakin University
Australia
University of South Carolina
United States
Australian Catholic University
Australia

Keywords

Models, Statistical, sparse coding, probabilistic topic model, 006, Electroencephalography, Signal-To-Noise Ratio, unsupervised learning, Electrocardiography, Cluster Analysis, Humans, bag-of-words, Probability

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	17
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%