Translating Questions to SQL Queries with Generative Parsers Discriminatively Reranked.

Name: Translating Questions to SQL Queries with Generative Parsers Discriminatively Reranked.
Keywords: Natural Language Interface to Databases, Semantic Parsing, Reranking.

Giordani, Alessandra; Moschitti, Alessandro

Found an issue? Give us feedback

IRIS - Institutional...arrow_drop_down

IRIS - Institutional Research Information System of the University of Trento

Conference object . 2012

Data sources: IRIS - Institutional Research Information System of the University of Trento

DBLP

Conference object

Data sources: DBLP

Translating Questions to SQL Queries with Generative Parsers Discriminatively Reranked.

descriptionPublicationkeyboard_double_arrow_right Conference object 01 Jan 2012 Italy Publisher:Coling 2012

Authors: Giordani, Alessandra; Moschitti, Alessandro;

handle: 11572/95255

Translating Questions to SQL Queries with Generative Parsers Discriminatively Reranked.

- Summary
- Subjects
- Metrics

Abstract

In this paper, we define models for automatically translating a factoid question in natural language to an SQL query that retrieves the correct answer from a target relational database (DB). We exploit the DB structure to generate a set of candidate SQL queries, which we rerank with an SVM-ranker based on tree kernels. In particular, in the generation phase, we use (i) lexical dependencies in the question and (ii) the DB metadata, to build a set of plausible SELECT, WHERE and FROM clauses enriched with meaningful joins. We combine the clauses by means of rules and a heuristic weighting scheme, which allows for generating a ranked list of candidate SQL queries. This approach can be recursively applied to deal with complex questions, requiring nested SELECT instructions. Finally, we apply the reranker to reorder the list of question and SQL candidate pairs, whose members are represented as syntactic trees. The F1 of our model derived on standard benchmarks, 87% on the first question, is in line with the best models using external and expensive hand-crafted resources such as the question meaning interpretation. Moreover, our system shows a Recall of the correct answer of about 94% and 98% on the first 2 and 5 candidates, respectively. This is an interesting outcome considering that we only need pairs of questions and answers concerning a target DB (no SQL query is needed) to train our model.

Country

Italy

Related Organizations

University of Trento
Italy

Keywords

Natural Language Interface to Databases, Semantic Parsing, Reranking.

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now