2023
DOI: 10.1609/aaai.v37i11.26499
|View full text |Cite
|
Sign up to set email alerts
|

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing

Abstract: Text-to-SQL semantic parsing is an important NLP task, which facilitates the interaction between users and the database. Much recent progress in text-to-SQL has been driven by large-scale datasets, but most of them are centered on English. In this work, we present MultiSpider, the largest multilingual text-to-SQL semantic parsing dataset which covers seven languages (English, German, French, Spanish, Japanese, Chinese, and Vietnamese). Upon MultiSpider we further identify the lexical and structural challenges … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…The efforts of the scientific community to build systems for cross-lingual semantic parsing have led to the development of a series of relevant benchmark datasets (Min et al, 2019;Dou et al, 2023;Zhang et al, 2023). Most recent approaches for cross-lingual semantic parsing have sought to localise parsers to new target languages using backtranslations (Sherborne et al, 2020) or machine translation (Xia and Monti, 2021;Shi et al, 2022).…”
Section: Cross-lingual Semantic Parsingmentioning
confidence: 99%
See 1 more Smart Citation
“…The efforts of the scientific community to build systems for cross-lingual semantic parsing have led to the development of a series of relevant benchmark datasets (Min et al, 2019;Dou et al, 2023;Zhang et al, 2023). Most recent approaches for cross-lingual semantic parsing have sought to localise parsers to new target languages using backtranslations (Sherborne et al, 2020) or machine translation (Xia and Monti, 2021;Shi et al, 2022).…”
Section: Cross-lingual Semantic Parsingmentioning
confidence: 99%
“…In an effort to alleviate the shortcomings of Textto-SQL solutions, increasingly difficult datasets and benchmarks have been developed (Zhong et al, 2017;Yu et al, 2018;Min et al, 2019;Dou et al, 2023;Zhang et al, 2023). The efforts to explore the generalisability of such Text-to-SQL systems have recently culminated with the introduction of multiple database datasets, which distinguish between training and evaluation databases.…”
Section: Introductionmentioning
confidence: 99%