
We consider the problem of improving the performance of OLAP applications in a database cluster (DBC), which is a low cost and effective parallel solution for query processing. Current DBC solutions for OLAP query processing provide for intra-query parallelism only, at the cost of full replication of the database. In this paper, we proposemore efficient distributed database design alternatives which combine physical/virtual partitioning with partial replication.We also propose a new load balancing strategy that takes advantage of an adaptive virtual partitioning to redistribute the load to the replicas. Our experimental validation is based on the implementation of our solution on the SmaQSS DBC middleware prototype. Our experimental results using the TPC-H benchmark and a 32-node cluster show very good speedup.
[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Dynamic load balancing, OLAP query processing · Partial replication, [INFO.INFO-DC] Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], ACM: H.: Information Systems/H.2: DATABASE MANAGEMENT/H.2.6: Database Machines, [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], ACM: E.: Data, Database clusters, Virtual partitioning, ACM: H.: Information Systems/H.2: DATABASE MANAGEMENT/H.2.4: Systems, Parallel databases
[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Dynamic load balancing, OLAP query processing · Partial replication, [INFO.INFO-DC] Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], ACM: H.: Information Systems/H.2: DATABASE MANAGEMENT/H.2.6: Database Machines, [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], ACM: E.: Data, Database clusters, Virtual partitioning, ACM: H.: Information Systems/H.2: DATABASE MANAGEMENT/H.2.4: Systems, Parallel databases
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 36 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
