Proceedings of the 15th International Conference on Mining Software Repositories 2018
DOI: 10.1145/3196398.3196402
|View full text |Cite
|
Sign up to set email alerts
|

A deep learning approach to identifying source code in images and video

Abstract: While substantial progress has been made in mining code on an Internet scale, efforts to date have been overwhelmingly focused on data sets where source code is represented natively as text. Large volumes of source code available online and embedded in technical videos have remained largely unexplored, due in part to the complexity of extraction when code is represented with images. Existing approaches to code extraction and indexing in this environment rely heavily on computationally intense optical character… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
40
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 49 publications
(43 citation statements)
references
References 32 publications
0
40
0
Order By: Relevance
“…The VGG network shown in Fig. 1b is a very popular model used across many domains, and was chosen because it has recently been applied to software mining [2]. The VGG model has a convenient architecture in which multiple convolutional operations occur in succession, followed by a max pooling layer for down-sampling.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The VGG network shown in Fig. 1b is a very popular model used across many domains, and was chosen because it has recently been applied to software mining [2]. The VGG model has a convenient architecture in which multiple convolutional operations occur in succession, followed by a max pooling layer for down-sampling.…”
Section: Methodsmentioning
confidence: 99%
“…1 School of Information and Computer Science, University of California, Irvine, Irvine, CA, USA. 2 Fowler School of Engineering, Chapman University, One University Dr., Orange, CA 92866, USA.…”
Section: Authors' Contributionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Yadid et al [3] present ACE, a tool that combines language models and image processing techniques to extract source code from software development videos. Ott et al [2] further improved the identification of code fragments using deep learning algorithm. Ponzanelli et al [4,14] employed the extracted code as features to segment development videos, which can efficiently assist developers to focus on the key point in video.…”
Section: Related Workmentioning
confidence: 99%
“…Many efforts have been taken to enhance software development tutorials [2,3,4,5], which could be classified into two categories: text tutorials and video tutorials. However, they may not be sufficient to meet the targeted learning needs if used individually.…”
Section: Introductionmentioning
confidence: 99%