Evaluating the Text-to-SQL Capabilities of Large Language Models

Rajkumar, Nitarshan; Li, Raymond; Bahdanau, Dzmitry

doi:10.48550/arxiv.2204.00498

Cited by 17 publications

(30 citation statements)

References 7 publications

(9 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…More recently, large language models (LLMs) like GPT-3 [2] and Codex [3] have been shown to perform incredibly well in many NLP tasks without any training. [23] demonstrates Codex's near state-of-the-art performance on Spider in a zero-shot setting when prompted with in-context examples.…”

Section: Natural Language To Sql Translationmentioning

confidence: 92%

“…Translation from natural language to SQL (Text-to-SQL) has been widely studied by the NLP community [21,23,24,27,31,34]. Difficulties in text-to-SQL are mainly two-fold: encoding a variety of complex relationships between the user's query and multiple tables, and decoding the SQL with valid representations.…”

Section: Natural Language To Sql Translationmentioning

confidence: 99%

“…Codex is a GPT language model trained on both natural language and publicly available GitHub code [3]. We choose Codex because, given a few examples and without any finetuning, Codex has proven to be comparable to state-of-the-art fine-tuned systems trained on thousands of annotated text-SQL pairs [23], which fits our needs best, since we do not have large-scale training data to re-train a text-to-SPS model.…”

Section: Codex: Natural Language Queries To Spsmentioning

confidence: 99%

See 2 more Smart Citations

PI2: End-to-end Interactive Visualization Interface Generation from Queries

Chen

2022

Proceedings of the 2022 International Conference on Management of Data

View full text Add to dashboard Cite

Figure 1: This demonstrates the NL2INTERFACE user interface. A user uploads the COVID-19 dataset using the left panel and the table contents are displayed in the bottom left of the interface. Then, this user types two natural language queries in the top textbox. Our NL2INTERFACE will return an interactive interface consisting of two visualizations, one toggle, two button sets, and a click interaction over the map visualization. The user can interact with the interface to study the covid death trend within the last 30 days in Texas by clicking the state of "Texas" on the map, clicking the "deaths" button, and then clicking the "-30 days" button. The interface after these interactions is shown on the right.

show abstract

Section: Natural Language To Sql Translationmentioning

confidence: 92%

Section: Natural Language To Sql Translationmentioning

confidence: 99%

Section: Codex: Natural Language Queries To Spsmentioning

confidence: 99%

See 1 more Smart Citation

PI2: End-to-end Interactive Visualization Interface Generation from Queries

Chen

2022

Proceedings of the 2022 International Conference on Management of Data

View full text Add to dashboard Cite

show abstract

“…Recent large pretrained models can perform the task without task-specific architectures (Scholak et al, 2021b) or even in a zero/few-shot manner (Shin et al, 2021;Brown et al, 2020;Chen et al, 2021a). Rajkumar et al (2022) evaluates Codex's text-to-SQL capability.…”

Section: Related Workmentioning

confidence: 99%

“…A seed semantic parser that is likely to generate a short list of candidates that contain the correct program. This requirement is not hard to satisfy in many applications, given that large language models achieve often achieve high top-k accuracy on generating simple Python snippets (Chen et al, 2021a), JSON data (Poesia et al, 2022), Lispress (Shin et al, 2021) and SQL programs (Scholak et al, 2021b;Rajkumar et al, 2022) with only a few training examples and are likely to continue improving (Kaplan et al, 2020). For example, we achieved 95% top-32 accuracy on SPIDER without any task-specific engineering beyond few-shot prompting (e.g., specialized architectures (Wang et al, 2020), decoding constraints (Scholak et al, 2021b), etc).…”

Section: Related Workmentioning

confidence: 99%

Active Programming by Example with a Natural Language Prior

Zhong¹,

Snell²,

Klein³

et al. 2022

Preprint

View full text Add to dashboard Cite

We introduce APEL, a new framework that enables non-programmers to indirectly annotate natural language utterances with executable meaning representations, such as SQL programs. Based on a natural language utterance, we first run a seed semantic parser to generate a prior over a list of candidate programs. To obtain information about which candidate is correct, we synthesize an input on which the more likely programs tend to produce different outputs, and ask an annotator which output is appropriate for the utterance. Hence, the annotator does not have to directly inspect the programs. To further reduce effort required from annotators, we aim to synthesize simple input databases that nonetheless have high information gain. With human annotators and Bayesian inference to handle annotation errors, we outperform Codex's top-1 performance (59%) and achieve the same accuracy as the original expert annotators (75%), by soliciting answers for each utterance on only 2 databases with an average of 9 records each. In contrast, it would be impractical to solicit outputs on the original 30K-record databases provided by SPIDER.

show abstract