Estimating Query Timings in Elasticsearch

Sikha Bagui; Evorell Fridge

Found an issue? Give us feedback

Transactions on Netw...arrow_drop_down

Transactions on Networks and Communications

Article . 2021 . Peer-reviewed

License: CC BY

Data sources: Crossref

Estimating Query Timings in Elasticsearch

descriptionPublicationkeyboard_double_arrow_right Article 23 Apr 2021Publisher:Scholar PublishingJournal:Transactions on Networks and Communications, volume 9, pages 15-36 (eissn: 2054-7420,

Copyright policy )

Authors: Sikha Bagui; Evorell Fridge;

doi: 10.14738/tnc.92.9887

Estimating Query Timings in Elasticsearch

- Summary
- Metrics

Abstract

In a shared Elasticsearch environment it can be useful to know how long a particular query will take to execute. This information can be used to enforce rate limiting or distribute requests equitably among multiple clusters. Elasticsearch uses multiple Lucene instances on multiple hosts as an underlying search engine implementation, but this abstraction makes it difficult to predict execution with previously known predictors such as the number of postings. This research investigates the ability of different pre-retrieval statistics, available through Elasticsearch, to accurately predict the execution time of queries on a typical Elasticsearch cluster. The number of terms in a query and the Total Term Frequency (TTF) from Elasticsearch’s API are found to significantly predict execution time. Regression models are then built and compared to find the most accurate method for predicting query time.

Related Organizations

University of West Florida
United States

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

hybrid