On the role of message broker middleware for many-task computing on a big-data platform

descriptionPublicationkeyboard_double_arrow_right Article 31 Mar 2018 English Publisher:Springer Science and Business Media LLCJournal:Cluster Computing, volume 22, pages 2,527-2,540 (issn: 1386-7857, eissn: 1573-7543,

Authors: Cao Ngoc Nguyen; Jaehwan Lee; Soonwook Hwang; Jik-Soo Kim;

doi: 10.1007/s10586-018-2634-9

On the role of message broker middleware for many-task computing on a big-data platform

- Summary
- Metrics

Abstract

We have designed and implemented a new data processing framework called “Many-task computing On HAdoop” (MOHA) which aims to effectively support fine-grained many-task applications that can show another type of data-intensive workloads in the YARN-based Hadoop 2.0 platform. MOHA is developed as one of Hadoop YARN applications so that it can transparently co-host existing many-task computing (MTC) applications with other data processing workflows such as MapReduce in a single Hadoop cluster. In this paper, we investigate main characteristics of two well-known open-source message broker middleware systems (Apache ActiveMQ and Kafka) and their implications on a many-task management scheme in our MOHA framework. Through our extensive experiments with a real MTC application, we demonstrate and discuss trade-offs between parallelism and load balancing of data access patterns in message broker middleware systems for Many-Task Computing on Hadoop.

Related Organizations

Korean Association Of Science and Technology Studies
Korea (Republic of)
Korea Institute of Science & Technology Information
Korea (Republic of)
Korea Aerospace University
Korea (Republic of)
Myongji University
Korea (Republic of)
Korea University of Science and Technology
Korea (Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	12
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Top 10%

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now