Capturing Analytical Intents from Text

Name: Capturing Analytical Intents from Text
Keywords: Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació, Large language models, Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural, Analytical intents, Data science

Pons Recasens, Gerard; Dimic, Miona; Bilalli, Besim

Found an issue? Give us feedback

downloadFull-Text

UPCommons. Portal de...arrow_drop_down

UPCommons. Portal del coneixement obert de la UPC

Conference object . 2024 . Peer-reviewed

Full-Text: https://upcommons.upc.edu/bitstreams/d87e5b22-acde-411a-8a5a-4379dca3b380/download

Data sources: UPCommons. Portal del coneixement obert de la UPC

Recolector de Ciencia Abierta, RECOLECTA

Conference object . 2024 . Peer-reviewed

Data sources: Recolector de Ciencia Abierta, RECOLECTA

https://doi.org/10.1007/978-3-...

Part of book or chapter of book . 2024 . Peer-reviewed

License: Springer Nature TDM

Data sources: Crossref

Capturing Analytical Intents from Text

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Conference object 14 Nov 2024 English Publisher:Springer Nature Switzerland

Authors: Pons Recasens, Gerard; Dimic, Miona; Bilalli, Besim;

doi: 10.1007/978-3-031-70421-5_8

handle: 2117/426742

Capturing Analytical Intents from Text

- Summary
- Subjects
- Metrics

Abstract

The ability to extract valuable information from data is crucial for organizations and individuals who want to remain competitive in a constantly evolving data-driven environment. However, some of them lack the skills required to appropriately leverage the existing data analytics tools and methods. This problem is aggravated when the users are domain-experts but completely unfamiliar with data analytics terminology, as existing assistant tools, such as AutoML or Intelligent Discovery Assistants, require them to state their analytical intent (i.e., the type of data analysis they want to perform). To address this problem, we propose to capture the underlying analytical intent from textual problem descriptions by leveraging Large Language Models (LLMs). To this end, we propose a hierarchical categorization of analytical intents, along with a data collection methodology to obtain analytical problem descriptions for all of them in order to validate different approaches that aim to extract such intents from text. Next, we compare the performance of state-of-the-art approaches with LLMs, and then study the performance of different LLMs based on their characteristics and the impact of the source of validation data. Finally, we develop a prototype to showcase how our method could interact with existing AutoML systems.

Gerard Pons is supported by the EU’s Horizon Programme call, under Grant Agreement No. 101093164 (ExtremeXP), and Besim Bilalli is partially supported by the DOGO4ML project, funded by the Spanish Ministerio de Ciencia i Innovación under the funding scheme PID2020-117191RB-I00/AEI/10.13039/501100011033.

Peer Reviewed

Related Organizations

Universitat Polite`cnica de Catalunya
Spain

Keywords

Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació, Large language models, Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural, Analytical intents, Data science

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average