Stat-DSM: Statistically Discriminative Sub-Trajectory Mining With Multiple Testing Correction

descriptionPublicationkeyboard_double_arrow_right Article 01 Mar 2022Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Knowledge and Data Engineering, volume 34, pages 1,477-1,488 (issn: 1041-4347, eissn: 2326-3865,

Copyright policy )

Authors: Vo Nguyen Le Duy; Takuto Sakuma; Taiju Ishiyama; Hiroki Toda; Kazuya Arai; Masayuki Karasuyama; Yuta Okubo; +4 Authors

doi: 10.1109/tkde.2020.2994344

Stat-DSM: Statistically Discriminative Sub-Trajectory Mining With Multiple Testing Correction

- Summary
- Metrics

Abstract

We propose a novel statistical approach to evaluate the statistical significance (reliability) of the results from discriminative sub-trajectory mining, which we call Statistically Discriminative Sub-trajectory Mining (Stat-DSM). Given two groups of trajectories, the goal of Stat-DSM is to extract moving patterns in the form of sub-trajectories that occur statistically significantly more often in one group than in the other. An advantage of the proposed method is that the statistical significance of the extracted sub-trajectories are properly controlled in the sense that the probability of finding a false discriminative sub-trajectory is smaller than a specified significance threshold α(e.g. 0.05), which is crucial when the method is used in scientific or social science studies under noisy environments. Finding such statistically discriminative sub-trajectories from a massive trajectory dataset is both computationally and statistically challenging. In the Stat-DSM method, we address these difficulties by introducing a tree representation of sub-trajectories, and applying a permutation-based statistical inference method to the tree. To the best of our knowledge, Stat-DSM is the first method that provides a statistical approach to quantify the reliability of discriminative sub-trajectory mining results. We illustrate the effectiveness and scalability of the Stat-DSM method by applying it to a real-world dataset containing 1,000,000 trajectories.

Related Organizations

RIKEN
Japan
Nagoya Institute of Technology
Japan

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now