2022
DOI: 10.1007/s10664-022-10242-2
|View full text |Cite
|
Sign up to set email alerts
|

Automatic prediction of rejected edits in Stack Overflow

Abstract: The content quality of shared knowledge in Stack Overflow (SO) is crucial in supporting software developers with their programming problems. Thus, SO allows its users to suggest edits to improve the quality of a post (i.e., question and answer). However, existing research shows that many suggested edits in SO are rejected due to undesired contents/formats or violating edit guidelines. Such a scenario frustrates or demotivates users who would like to conduct good-quality edits. Therefore, our research focuses o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 82 publications
0
3
0
Order By: Relevance
“…Therefore, we chose five popular supervised ML classifiers with different learning strategies to classify the issue's reproducible/irreproducible status. These algorithms are selected because (1) they can build reliable models to predict reproducibility status using our features, and (2) they are widely used in relevant studies [51,52,53,54,50]. Random Forest (RF) is a popular supervised ML technique due to its simplicity and versatility [55].…”
Section: Machine Learning Model Selectionmentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore, we chose five popular supervised ML classifiers with different learning strategies to classify the issue's reproducible/irreproducible status. These algorithms are selected because (1) they can build reliable models to predict reproducibility status using our features, and (2) they are widely used in relevant studies [51,52,53,54,50]. Random Forest (RF) is a popular supervised ML technique due to its simplicity and versatility [55].…”
Section: Machine Learning Model Selectionmentioning
confidence: 99%
“…RF scales well to any number of dimensions and offers acceptable performance. Instead of searching for the most important feature when splitting nodes, it looks for the best feature within a random subset [51], which helps prevent model overfitting. eXtreme Gradient Boosting (XGBoost) is a scalable tree-boosting technique [57] that predicts a target variable by combining estimates from simpler weak models.…”
Section: Machine Learning Model Selectionmentioning
confidence: 99%
See 1 more Smart Citation