Supervised Anomaly Detection in Uncertain Pseudoperiodic Data Streams

Article, Preprint English OPEN
Ma, Jiangang ; Sun, Le ; Wang, Hua ; Zhang, Yanchun ; Aickelin, Uwe (2016)
  • Publisher: Association for Computing Machinery
  • Related identifiers: doi: 10.1145/2806890
  • Subject: Computer Science - Artificial Intelligence

Uncertain data streams have been widely generated in many Web applications. The uncertainty in data streams makes anomaly detection from sensor data streams far more challenging. In this paper, we present a novel framework that supports anomaly detection in uncertain data streams. The proposed framework adopts an efficient uncertainty pre-processing procedure to identify and eliminate uncertainties in data streams. Based on the corrected data streams, we develop effective period pattern recognition and feature extraction techniques to improve the computational efficiency. We use classification methods for anomaly detection in the corrected data stream. We also empirically show that the proposed approach shows a high accuracy of anomaly detection on a number of real datasets.
  • References (24)
    24 references, page 1 of 3

    Charu C. Aggarwal. 2009. On high dimensional projected clustering of uncertain data streams. In IEEE 25th International Conference on Data Engineering (ICDE'09). IEEE, Shanghai, China, 1152-1154. DOI:

    Charu C. Aggarwal and Philip S. Yu. 2008. A framework for clustering uncertain data streams. In IEEE 24th International Conference on Data Engineering (ICDE'08). IEEE, Cancun, Mexico, 150-159. DOI:

    Ian F. Akyildiz, Dario Pompili, and Tommaso Melodia. 2005. Underwater acoustic sensor networks: research challenges. Ad Hoc Netw. 3, 3 (2005), 257-279. DOI:

    Lv an Tang, Bin Cui, Hongyan Li, Gaoshan Miao, Dongqing Yang, and Xinbiao Zhou. 2007. Effective variation management for pseudo periodical streams. In Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data (SIGMOD'07). ACM, New York, NY, USA, 257-268. DOI:

    Arvind Arasu, Shivnath Babu, and Jennifer Widom. 2003. The CQL continuous query language: semantic foundations and query execution. Technical Report 2003-67. Stanford InfoLab. http://ilpubs.stanford. edu:8090/758/

    David Arthur and Sergei Vassilvitskii. 2007. K-means++: the advantages of careful seeding. In Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA'07). Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1027-1035. id=1283383.1283494

    Johannes A falg, Hans-Peter Kriegel, Peer Kro┬Ęger, and Matthias Renz. 2009. Probabilistic similarity search for uncertain time series. In Scientific and Statistical Database Management, Marianne Winslett (Ed.). Lecture Notes in Computer Science, Vol. 5566. Springer Berlin Heidelberg, New Orleans, LA, USA, 435-443. DOI: 31

    Jianjun Chen, David J. DeWitt, Feng Tian, and Yuan Wang. 2000. NiagaraCQ: a scalable continuous query system for internet databases. In Proceedings of ACM SIGMOD International Conference on Management of Data (SIGMOD'00). 379-390.

    CSIRO. 2011. Sensors and Sensor Networks 2010-2011 Year in Review. (2011). news/sensors-and-sensor-networks-2010-2011-year-in-review

    Michele Dallachiesa, Besmira Nushi, Katsiaryna Mirylenka, and Themis Palpanas. 2012. Uncertain time-series similarity: return to the basics. Proc. VLDB Endow. 5, 11 (July 2012), 1662-1673. DOI:

  • Metrics
    No metrics available