2022
DOI: 10.48550/arxiv.2210.14868
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multi-lingual Evaluation of Code Generation Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…These primarily focus on natural language to Python code generation: HumanEval (Chen et al, 2021), HumanEval+ (Liu et al, 2023b), APPS (Hendrycks et al, 2021), Code-Contests , MBPP , L2CEval (Ni et al, 2023). Their variants have been proposed to cover more languages, (Wang et al, 2022a;Cassano et al, 2022;Athiwaratkun et al, 2022). Many benchmarks have focused on code generation in APIs.…”
Section: Code Generationmentioning
confidence: 99%
“…These primarily focus on natural language to Python code generation: HumanEval (Chen et al, 2021), HumanEval+ (Liu et al, 2023b), APPS (Hendrycks et al, 2021), Code-Contests , MBPP , L2CEval (Ni et al, 2023). Their variants have been proposed to cover more languages, (Wang et al, 2022a;Cassano et al, 2022;Athiwaratkun et al, 2022). Many benchmarks have focused on code generation in APIs.…”
Section: Code Generationmentioning
confidence: 99%
“…BabelCode shares many design similarities to the concurrent work from Athiwaratkun et al (2022). Specifically, we follow the same approach to inferring argument and return types.…”
Section: Framework Designmentioning
confidence: 99%
“…We summarize the high-level differences between Babel-Code and prior works in Table 1. The MBXP framework from Athiwaratkun et al (2022) is the most similar to our work as discussed in subsection 2.1. Similar to BabelCode, MBXP does have individual test-case results; however, it uses assert statements and thus can only determine the first test-case that fails.…”
Section: Differences To Prior Workmentioning
confidence: 99%
See 2 more Smart Citations