Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering 2022
DOI: 10.1145/3551349.3556953
|View full text |Cite
|
Sign up to set email alerts
|

Natural Test Generation for Precise Testing of Question Answering Software

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(5 citation statements)
references
References 26 publications
0
5
0
Order By: Relevance
“…This approach benefits from existing techniques for evaluating question-answer (QA) language models [12,30,59,78]. Multiple studies proposed reliability testing techniques for the QA software [11,20,34,47,59,60]. Among them, Shen et al [60] outperformed previous state-of-the-art approaches by discovering 23% more bugs or inconsistencies in the target answer-generating models and the Challenger component adopted their question mutation technique for generating challenging questions.…”
Section: Mutationmentioning
confidence: 99%
See 2 more Smart Citations
“…This approach benefits from existing techniques for evaluating question-answer (QA) language models [12,30,59,78]. Multiple studies proposed reliability testing techniques for the QA software [11,20,34,47,59,60]. Among them, Shen et al [60] outperformed previous state-of-the-art approaches by discovering 23% more bugs or inconsistencies in the target answer-generating models and the Challenger component adopted their question mutation technique for generating challenging questions.…”
Section: Mutationmentioning
confidence: 99%
“…Multiple studies proposed reliability testing techniques for the QA software [11,20,34,47,59,60]. Among them, Shen et al [60] outperformed previous state-of-the-art approaches by discovering 23% more bugs or inconsistencies in the target answer-generating models and the Challenger component adopted their question mutation technique for generating challenging questions. For this, our mutation challenger employs the sentence-level metamorphic testing technique, QAQA, proposed by [60].…”
Section: Mutationmentioning
confidence: 99%
See 1 more Smart Citation
“…Shen et al 67 leveraged sentenceā€level mutation to generate natural test inputs to achieve precise testing of question answering software. You et al 68 proposed DRFuzz, which aims at generating test inputs triggering regression faults for DNN models.…”
Section: Related Workmentioning
confidence: 99%
“…NLP software, such as machine translation software and chatbot, has also been widely used in human life. Similar to AI software, researchers have proposed variance methods to validate the reliability of NLP software on the correctness [18,19,40,44], toxicity [53,54], fairness [47,52,57].…”
Section: Related Work 61 Testing Of Ai Softwarementioning
confidence: 99%