2017 IEEE/ACM 39th International Conference on Software Engineering Companion (ICSE-C) 2017
DOI: 10.1109/icse-c.2017.90
|View full text |Cite
|
Sign up to set email alerts
|

Combining Word2Vec with Revised Vector Space Model for Better Code Retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
9
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 12 publications
1
9
0
Order By: Relevance
“…In [76], word embeddings was combined with information retrieval to recommend similar bug reports. Methods similar to or based on word embeddings have also been used recently for better code retrieval [67], to find common software weaknesses [82], API recommendation [80] and sentiment analysis for software engineering [11]. Our work comple-ments the existing work as we build a cross-platform approach that leverages word embeddings to aid software-development-specific knowledge extraction tasks.…”
Section: Leveraging Word Embeddingsmentioning
confidence: 99%
“…In [76], word embeddings was combined with information retrieval to recommend similar bug reports. Methods similar to or based on word embeddings have also been used recently for better code retrieval [67], to find common software weaknesses [82], API recommendation [80] and sentiment analysis for software engineering [11]. Our work comple-ments the existing work as we build a cross-platform approach that leverages word embeddings to aid software-development-specific knowledge extraction tasks.…”
Section: Leveraging Word Embeddingsmentioning
confidence: 99%
“…In the literature, we can find several BoW based source code retrieval methods [6][7][8][9][10][11][12][13]. Authors have also reported using word2vec for software search [14][15][16][17], The researchers considered Correct at r C@r with abbreviation occurring in top r ranked positions and Pearson Correlation score to evaluate their work using popular data sets of Eclipse and AspectJ.…”
Section: Related Workmentioning
confidence: 99%
“…Emphasizing on the issue of handling mismatch in code search/retrieval using deep learning with Word2Vec was done by Van Nguyen et al [16], in which the researchers combined Word2Vec with Revised Vector Space model for better code retrieval (rVSM). rVSM computes the weight for a word based on a new term frequency-inverse document frequency (tf-idf) formula and a new scoring scheme among the vectors that takes documents' lengths into account.…”
Section: Related Workmentioning
confidence: 99%
“…More detailed information can be obtained through the analysis of local context (i.e. words that occur near each other), for example the skip-gram approach [45], which was previously applied to documentation from the Java Development Kit for code retrieval [46]. However, there is a danger predicting the context of one word at a time will miss information available through global statistics.…”
Section: Background and Related Workmentioning
confidence: 99%