Leveraging Large Language Model for Enhanced Text-to-SQL Parsing

Name: Leveraging Large Language Model for Enhanced Text-to-SQL Parsing
Keywords: LLM, SQL generation, deep learning, Electrical engineering. Electronics. Nuclear engineering, Semantic parsing, TK1-9971

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2025Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 13, pages 30,497-30,504 (eissn: 2169-3536,

Authors: Zecheng Zhan; E. Haihong; Meina Song;

doi: 10.1109/access.2025.3540072

Leveraging Large Language Model for Enhanced Text-to-SQL Parsing

- Summary
- Subjects
- Metrics

Abstract

Text-to-SQL conversion, the process of generating SQL queries from natural language input, has gained significant attention due to its potential to simplify database interaction. Although benchmarks in this task have driven advancements in the field, the challenges posed by complex join logic and the rich diversity of natural language expressions remain significant obstacles. These complexities underscore the ongoing difficulty of accurately bridging the gap between natural language and structured query representations, particularly in cross-domain and real-world scenarios. Recent research, including intermediate representations, relation-aware transformers, and large language models such as T5 and LLaMA, has improved performance by addressing the semantic gap between natural language and SQL. In this work, we propose SLENet, a novel approach that uses state-of-the-art large language models (LLMs) to enhance semantic understanding and SQL generation. Our method integrates three core innovations: (1) the use of advanced LLMs for context-aware representations, (2) syntax-constrained SQL decoder to ensure grammatical correctness, and (3) search-based prompt optimization utilizing external knowledge sources like WikiSQL. These innovations collectively address schema comprehension and SQL generation complexities. Evaluations on the Spider benchmark demonstrate that SLENet significantly outperforms existing methods, achieving higher exact matching accuracy and effectively handling complex SQL components. Our contributions highlight the importance of combining LLMs with syntax constraints and external data for advancing cross-domain semantic parsing.

Related Organizations

Beijing University of Posts and Telecommunications
China (People's Republic of)

Keywords

LLM, SQL generation, deep learning, Electrical engineering. Electronics. Nuclear engineering, Semantic parsing, TK1-9971

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

gold