publication . Doctoral thesis . 2018

Enhancing Apache AsterixDB for Efficient Big Data Search and Analytics

Kim, Taewoo;
Open Access English
  • Published: 01 Jan 2018
  • Publisher: eScholarship, University of California
  • Country: United States
Abstract
In a typical minute of a day in 2018, the Internet generates 3,138 terabytes of traffic, Twitter users send 473,000 tweets, and two million snaps are sent on Snapchat. By 2020, it is estimated that for each person on earth, 1.7 MB of data will be created every second on the average. Due to the large volumes of Big Data, efficient search methods and analytics are required to explore such data. Thus, there is a clear need for Big Data management system, such as Apache AsterixDB, to enable users and applications to search to explore Big Data. Initiated in 2009, the AsterixDB project integrated ideas from three distinct areas - semi-structured data, parallel databas...
Subjects
free text keywords: Computer science, big data search and analytics, index-only plan, memory management, similarity query, sort, hash-based group by and join, text search
Powered by OpenAIRE Open Research Graph
Any information missing or wrong?Report an Issue