Cost-effective online trending topic detection and popularity prediction in microblogging

Article English OPEN
Miao, Zhongchen ; Chen, Kai ; Fang, Yi ; He, Jianhua ; Zhou, Yi ; Zhang, Wenjun ; Zha, Hongyuan (2017)

Identifying topic trends on microblogging services such as Twitter and estimating those topics’ future popularity have great academic and business value, especially when the operations can be done in real time. For any third party, however, capturing and processing such huge volumes of real-time data in microblogs are almost infeasible tasks, as there always exist API (Application Program Interface) request limits, monitoring and computing budgets, as well as timeliness requirements. To deal with these challenges, we propose a cost-effective system framework with algorithms that can automatically select a subset of representative users in microblogging networks in offline, under given cost constraints. Then the proposed system can online monitor and utilize only these selected users’ real-time microposts to detect the overall trending topics and predict their future popularity among the whole microblogging network. Therefore, our proposed system framework is practical for real-time usage as it avoids the high cost in capturing and processing full real-time data, while not compromising detection and prediction performance under given cost constraints. Experiments with real microblogs dataset show that by tracking only 500 users out of 0.6 million users and processing no more than 30,000 microposts daily, about 92% trending topics could be detected and predicted by the proposed system and, on average, more than 10 hours earlier than they appear in official trends lists.
  • References (48)
    48 references, page 1 of 5

    Mohamed Ahmed, Stella Spagna, Felipe Huici, and Saverio Niccolini. 2013. A Peek into the Future: Predicting the Evolution of Popularity in User Generated Content. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining (WSDM '13). ACM, New York, NY, USA, 607-616. DOI:http://dx.doi.org/10.1145/2433396.2433473

    James Allan (Ed.). 2002. Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers, Norwell, MA, USA. http://dl.acm.org/citation.cfm?id=772260

    Foteini Alvanaki, Sebastian Michel, Krithi Ramamritham, and Gerhard Weikum. 2012. See What's enBlogue: Real-time Emergent Topic Identification in Social Media. In Proceedings of the 15th International Conference on Extending Database Technology (EDBT '12). ACM, New York, NY, USA, 336-347. DOI:http://dx.doi.org/10.1145/2247596.2247636

    Sitaram Asur, Bernardo A. Huberman, Gabor Szabo, and Chunyan Wang. 2011. Trends in Social Media: Persistence and Decay. SSRN Electronic Journal (Feb. 2011). DOI:http://dx.doi.org/10.2139/ssrn.1755748

    Roja Bandari, Sitaram Asur, and Bernardo A. Huberman. 2012. The Pulse of News in Social Media: Forecasting Popularity. In Proceedings of the Sixth International Conference on Weblogs and Social Media (ICWSM '12). http://www.aaai.org/ocs/index.php/ICWSM/ICWSM12/paper/view/4646

    Livio Bertacco. 2006. Exact and Heuristic Methods for Mixed Integer Linear Programs. Ph.D. Dissertation. Ph. D. thesis, Universita` degli Studi di Padova.

    Bin Bi, Yuanyuan Tian, Yannis Sismanis, Andrey Balmin, and Junghoo Cho. 2014. Scalable Topicspecific Influence Analysis on Microblogs. In Proceedings of the 7th ACM International Conference on Web Search and Data Mining (WSDM '14). ACM, New York, NY, USA, 513-522. DOI:http://dx.doi.org/10.1145/2556195.2556229

    Petko Bogdanov, Michael Busch, Jeff Moehlis, Ambuj K. Singh, and Boleslaw K. Szymanski. 2013. The Social Media Genome: Modeling Individual Topic-specific Behavior in Social Media. In Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM '13). ACM, New York, NY, USA, 236-242. DOI:http://dx.doi.org/10.1145/2492517.2492621

    Sergey Brin and Lawrence Page. 2012. Reprint of: The anatomy of a large-scale hypertextual web search engine. Computer Networks 56, 18 (2012), 3825-3833. DOI:http://dx.doi.org/10.1016/j.comnet.2012.10.007

    Mario Cataldi, Luigi Di Caro, and Claudio Schifanella. 2010. Emerging Topic Detection on Twitter Based on Temporal and Social Terms Evaluation. In Proceedings of the Tenth International Workshop on Multimedia Data Mining (MDMKDD '10). ACM, New York, NY, USA, Article 4, 10 pages. DOI:http://dx.doi.org/10.1145/1814245.1814249

  • Software (1)
  • Metrics
    0
    views in OpenAIRE
    0
    views in local repository
    64
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    Aston Publications Explorer - IRUS-UK 0 64
Share - Bookmark