2023
DOI: 10.3390/data8060109
|View full text |Cite
|
Sign up to set email alerts
|

Dataset of Program Source Codes Solving Unique Programming Exercises Generated by Digital Teaching Assistant

Abstract: This paper presents a dataset containing automatically collected source codes solving unique programming exercises of different types. The programming exercises were automatically generated by the Digital Teaching Assistant (DTA) system that automates a massive Python programming course at MIREA—Russian Technological University (RTU MIREA). Source codes of the small programs grouped by the type of the solved task can be used for benchmarking source code classification and clustering algorithms. Moreover, the d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(29 citation statements)
references
References 39 publications
0
9
0
Order By: Relevance
“…In addition, we calculated the occurrence frequencies of different code complexity metrics for each task type, and the results are shown in Figure 9. As is shown in Figure 10, the EduC metric is the strictest metric, rejecting 1603 more programs when compared to the CycC metric, which was previously used in the DTA system [25,45]. We suggest the use of the same threshold for EduC as for CycC (EduC ≤ 10) in autograding systems automating educational programming courses, including the DTA system.…”
Section: Question 2 (Rq2) Is the Developed Educ Metric At Least As St...mentioning
confidence: 86%
See 4 more Smart Citations
“…In addition, we calculated the occurrence frequencies of different code complexity metrics for each task type, and the results are shown in Figure 9. As is shown in Figure 10, the EduC metric is the strictest metric, rejecting 1603 more programs when compared to the CycC metric, which was previously used in the DTA system [25,45]. We suggest the use of the same threshold for EduC as for CycC (EduC ≤ 10) in autograding systems automating educational programming courses, including the DTA system.…”
Section: Question 2 (Rq2) Is the Developed Educ Metric At Least As St...mentioning
confidence: 86%
“…The DTA system supports automatic generation and automatic checking of tasks of 11 types, including formal notation into source code translation tasks and conversion between data format tasks. The task types supported in DTA are [45]:…”
Section: Programs Solving Unique Programming Exercisesmentioning
confidence: 99%
See 3 more Smart Citations