2019 14th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP) 2019
DOI: 10.1109/isai-nlp48611.2019.9045143
|View full text |Cite
|
Sign up to set email alerts
|

The First Wikipedia Questions and Factoid Answers Corpus in the Thai Language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 4 publications
0
9
0
Order By: Relevance
“…As shown in Figure 1, the proposed framework works as follows. Firstly, we aggregate, clean and normalize our datasets: TyDiQA , XQuAD (Artetxe et al, 2019), Iapp Wiki QA (Viriyayudhakorn and Polpanumas, 2021), and Thai QA (Trakultaweekoon et al, 2019). Then, we translate all questions into English and backtranslate to Thai using Google Translate.…”
Section: Words In Different Frequency Groupmentioning
confidence: 99%
“…As shown in Figure 1, the proposed framework works as follows. Firstly, we aggregate, clean and normalize our datasets: TyDiQA , XQuAD (Artetxe et al, 2019), Iapp Wiki QA (Viriyayudhakorn and Polpanumas, 2021), and Thai QA (Trakultaweekoon et al, 2019). Then, we translate all questions into English and backtranslate to Thai using Google Translate.…”
Section: Words In Different Frequency Groupmentioning
confidence: 99%
“…There are two Thai QA corpora used in our e Wiki QA. The dataset statistics of both datasets are Thai Wiki QA [8] is a SQuAD-like dataset in th competition dataset in Thailand National Software C dataset consists of 15,000 question-answer pairs wi annotated by 15 native Thai speakers with many ki The publisher of Thai Wiki QA also published 125,3 this dataset as an open domain QA task. In this stud for generating more question answering samples.…”
Section: Datasetsmentioning
confidence: 99%
“…The dataset statistics of both datasets are shown in Table 3. Thai Wiki QA [8] is a SQuAD-like dataset in the Thai language. It was used as a QA competition dataset in Thailand National Software Contest (NSC), during 2018-2019.…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…This model is implemented using a convolutional neural network, a bidirectional long short-term memory network, and question pair matching to perform QA processing. In still another study, Kanokorn et al [13] proposed an information extraction process for both questions and answers that uses the Thai language and a related corpus. This research resulted in a web-based QA system whose answers are factoids extracted from Thai Wikipedia articles.…”
Section: Related Workmentioning
confidence: 99%