An Effective and Efficient Similarity-Matrix-Based Algorithm for Clustering Big Mobile Social Data

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Dec 2016 Italy Publisher:IEEEJournal:2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Authors: Bordogna, Gloria; Cuzzocrea, Alfredo; Frigerio, Luca; Psaila, Giuseppe;

doi: 10.1109/icmla.2016.0091

handle: 11368/2898231 , 20.500.14243/333936 , 10446/82824 , 20.500.11770/312768

An Effective and Efficient Similarity-Matrix-Based Algorithm for Clustering Big Mobile Social Data

- Summary
- Subjects
- Metrics

Abstract

Nowadays a great deal of attention is devoted to the issue of supporting big data analytics over big mobile social data. These data are generated by modern emerging social systems like Twitter, Facebook, Instagram, and so forth. Mining big mobile social data has been of great interest, as analyzing such data is critical for a wide spectrum of big data applications (e.g., smart cities). Among several proposals, clustering is a well-known solution for extracting interesting and actionable knowledge from massive amounts of big mobile (geo-located) social data. Inspired by this main thesis, this paper proposes an effective and efficient similarity-matrix-based algorithm for clustering big mobile social data, called TourMiner, which is specifically targeted to clustering trips extracted from tweets, in order to mine most popular tours. The main characteristic of TourMiner consists in applying clustering over a well-suited similarity matrix computed on top of trips. A comprehensive experimental assessment and analysis over Twitter data finally comfirms the benefits coming from our proposal.

Country

Italy

Related Organizations

National Research Council
Sri Lanka
National Research Council
Italy
University of Bergamo
Italy
University of Calabria
Italy
Istituto per il Rilevamento Elettromagnetico dell'Ambiente
Italy

View all View all

Keywords

Machine Learning, Big Data, Clustering algorithms, Big data analytics; Big data clustering; Big mobile social data; Artificial Intelligence; Computer Networks and Communications; Computer Science Applications1707 Computer Vision and Pattern Recognition, Trajectory, social network analytics, Machine Learning; Big Data

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	8
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average