descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Mar 2014Publisher:IEEEJournal:2014 IEEE 30th International Conference on Data Engineering WorkshopsFunded by:SNSF | The Datacenter Observator...

Authors: Christian Tinnefeld; Donald Kossmann; Joos-Hendrik Boese; Hasso Plattner;

doi: 10.1109/icdew.2014.6818325

Parallel join executions in RAMCloud

- Summary
- Related research
  (10)
- Metrics

Abstract

Modern large-scale storage systems provide not only storage capacity, but also processing power. When such a storage system serves as persistence for a database application, it is desirable to utilize its processing power for supporting query execution. In this paper, we evaluate the parallel execution of join operations in Stanford's RAMCloud which is a DRAM-based storage system connected via RDMA-enabled network adapters. We a) provide a system model to derive the execution costs for the Grace Join, the Distributed Block Nested Loop Join, and the Cyclo Join algorithm and their corresponding implementations in RAMCloud. We describe b) how the execution time for a single join operation depends on factors such as relation sizes, numbers of nodes used for a join, and the chosen algorithm. We finally introduce and evaluate c) a set of heuristics for parameterizing the execution of many join operations in parallel with the goal of maximizing the throughput.

Related Organizations

10 Research products, page 1 of 1

An Empirical Evaluation of How the Network Impacts the Performance and Energy Efficiency in RAMCloud
2017IsAmongTopNSimilarDocuments
Making Large Transfers Fast for in-Memory Databases in Modern Networks
2019IsAmongTopNSimilarDocuments
High-Availability Evaluation
2015IsAmongTopNSimilarDocuments
Fast crash recovery in RAMCloud
2011IsAmongTopNSimilarDocuments
Elastic online analytical processing on RAMCloud
2013IsAmongTopNSimilarDocuments
The case for RAMClouds
2010IsAmongTopNSimilarDocuments
Characterizing Performance and Energy-Efficiency of the RAMCloud Storage System
2017IsAmongTopNSimilarDocuments
Building a columnar database on shared main memory-based storage
2014IsAmongTopNSimilarDocuments
High Throughput Log-Based Replication for Many Small In-Memory Objects
2016IsAmongTopNSimilarDocuments
Implementing linearizability at large scale and low latency
2015IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%