Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2021
DOI: 10.18653/v1/2021.naacl-main.43
|View full text |Cite
|
Sign up to set email alerts
|

Open Domain Question Answering over Tables via Dense Retrieval

Abstract: Recent advances in open-domain QA have led to strong models based on dense retrieval, but only focused on retrieving textual passages. In this work, we tackle open-domain QA over tables for the first time, and show that retrieval can be improved by a retriever designed to handle tabular context. We present an effective pre-training procedure for our retriever and improve retrieval quality with mined hard negatives. As relevant datasets are missing, we extract a subset of NATURAL QUESTIONS (Kwiatkowski et al., … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
41
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 40 publications
(46 citation statements)
references
References 14 publications
0
41
0
Order By: Relevance
“…As in Karpukhin et al's (2020) approach, this goal is achieved using hard-negatives retrieved from all the tables from the English Wikipedia dump as well as in-batch negatives. DTR outperforms BM25 by more than 40 percentage points on the NQ-TABLES dataset (Herzig et al, 2021). However, the experiments also show that TAPAS requires additional pre-training on the task of table retrieval on millions of tables scraped from Wikipedia.…”
Section: Table Retrievalmentioning
confidence: 91%
See 4 more Smart Citations
“…As in Karpukhin et al's (2020) approach, this goal is achieved using hard-negatives retrieved from all the tables from the English Wikipedia dump as well as in-batch negatives. DTR outperforms BM25 by more than 40 percentage points on the NQ-TABLES dataset (Herzig et al, 2021). However, the experiments also show that TAPAS requires additional pre-training on the task of table retrieval on millions of tables scraped from Wikipedia.…”
Section: Table Retrievalmentioning
confidence: 91%
“…There are four deep learning approaches for table retrieval (Shraga et al, 2020b;Pan et al, 2021;Chen et al, 2020b;Herzig et al, 2021). Shraga et al (2020b) treat tables and queries as multi-modal objects that consist of query, table caption, schema, rows and columns.…”
Section: Table Retrievalmentioning
confidence: 99%
See 3 more Smart Citations