A deep learning approach to identifying source code in images and video

Ott, Jordan; Atchison, Abigail; Harnack, Paul; Bergh, A.M.H.L. Serlier-van den; Linstead, Erik

doi:10.1145/3196398.3196402

Cited by 49 publications

(43 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The VGG network shown in Fig. 1b is a very popular model used across many domains, and was chosen because it has recently been applied to software mining [2]. The VGG model has a convenient architecture in which multiple convolutional operations occur in succession, followed by a max pooling layer for down-sampling.…”

Section: Methodsmentioning

confidence: 99%

“…1 School of Information and Computer Science, University of California, Irvine, Irvine, CA, USA. 2 Fowler School of Engineering, Chapman University, One University Dr., Orange, CA 92866, USA.…”

Section: Authors' Contributionsmentioning

confidence: 99%

“…

…”

mentioning

confidence: 99%

See 2 more Smart Citations

Exploring the applicability of low-shot learning in mining software repositories

2019

Self Cite

View full text Add to dashboard Cite

IntroductionIn the past couple of years, applications of deep learning to mining software repositories have grown in number and diversity of methods [1][2][3][4][5]. Fueled in part by easy-touse libraries and graphics processing unit (GPU) computing, deep architectures have facilitated new avenues for research, often producing results that far surpass previous techniques. However, despite their advantages, the huge amount of labeled truth data traditionally required to train deep architectures for classification tasks, as well as the computational time required to iteratively improve such models, remains a substantial bottleneck [6]. As a result, some researchers are forced to turn away from deep architectures, despite the fact that for certain tasks (like image analysis and computer vision), deep learning consistently outperforms alternative algorithms and methodologies.Low-shot learning refers to the practice of training machine learning models, including deep neural networks, using far fewer samples of each classification category than what is typically standard practice. In the extreme case, training data consists of only one instance for each target class, which is known as one-shot learning [7]. These approaches Abstract Background: Despite the well-documented and numerous recent successes of deep learning, the application of standard deep architectures to many classification problems within empirical software engineering remains problematic due to the large volumes of labeled data required for training. Here we make the argument that, for some problems, this hurdle can be overcome by taking advantage of low-shot learning in combination with simpler deep architectures that reduce the total number of parameters that need to be learned. Findings:We apply low-shot learning to the task of classifying UML class and sequence diagrams from Github, and demonstrate that surprisingly good performance can be achieved by using only tens or hundreds of examples for each category when paired with an appropriate architecture. Using a large, off-the-shelf architecture, on the other hand, doesn't perform beyond random guessing even when trained on thousands of samples. Conclusion:Our findings suggest that identifying problems within empirical software engineering that lend themselves to low-shot learning could accelerate the adoption of deep learning algorithms within the empirical software engineering community. which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Authors' Contributionsmentioning

confidence: 99%

“…

…”

mentioning

confidence: 99%

See 1 more Smart Citation

Exploring the applicability of low-shot learning in mining software repositories

2019

Self Cite

View full text Add to dashboard Cite

show abstract

“…Yadid et al [3] present ACE, a tool that combines language models and image processing techniques to extract source code from software development videos. Ott et al [2] further improved the identification of code fragments using deep learning algorithm. Ponzanelli et al [4,14] employed the extracted code as features to segment development videos, which can efficiently assist developers to focus on the key point in video.…”

Section: Related Workmentioning

confidence: 99%

“…Many efforts have been taken to enhance software development tutorials [2,3,4,5], which could be classified into two categories: text tutorials and video tutorials. However, they may not be sufficient to meet the targeted learning needs if used individually.…”

Section: Introductionmentioning

confidence: 99%

FVT: A Fragmented Video Tutor for "Dubbing" Software Development Tutorials

Nong

Zhang

Huang

et al. 2019

EasyChair Preprints

View full text Add to dashboard Cite

Rapid growth of online resources provides massive supports for developers to fulfill their learning tasks. Text tutorial and video tutorial, as the two most common forms of online resources, may not be sufficient to meet developers' specific learning needs if used individually. Text tutorials are well-structured and easy to be navigated, however, digesting the text description may not be always pleasant. Video tutorials are intuitive and easy to follow, however, the pre-determined teaching flow can be very distracting if a specific piece of knowledge is targeted. In this study, we proposed a novel method and its supporting tool --- Fragmented Video Tutor (FVT), to facilitate the learning tasks with specific objectives, aiming at augmenting the strengths of both forms of tutorials while offsetting the weaknesses inherent to use each form of tutorials by itself. Specifically, FVT leverages the code snippets extracted from video tutorials as the bridge to link the video fragments in a video tutorial to the relevant sections in text tutorials. The preliminary evaluation results demonstrate that the FVT is a feasible approach to link two forms of tutorials and improve the learning effectiveness and efficiency for developers.

show abstract

Synergies Between Artificial Intelligence and Software Engineering: Evolution and Trends

Ramírez

Romero

2022

Handbook on Artificial Intelligence-Empowered Applied Software Engineering

View full text Add to dashboard Cite

A deep learning approach to identifying source code in images and video

Cited by 49 publications

References 32 publications

Exploring the applicability of low-shot learning in mining software repositories

Exploring the applicability of low-shot learning in mining software repositories

FVT: A Fragmented Video Tutor for "Dubbing" Software Development Tutorials

Synergies Between Artificial Intelligence and Software Engineering: Evolution and Trends

Contact Info

Product

Resources

About