2019 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM) 2019
DOI: 10.1109/esem.2019.8870187
|View full text |Cite
|
Sign up to set email alerts
|

Why is Developing Machine Learning Applications Challenging? A Study on Stack Overflow Posts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
56
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(66 citation statements)
references
References 26 publications
3
56
0
Order By: Relevance
“…We have obtained thresholds of = 0.051 and = 4 for and , respectively, from unclosed Docker posts . Since a closed post may pose a possibility of being duplicated and irrelevant to discussion 5 , unclosed posts have been considered for ensuring Docker relevancy. Content-based filtering approach selects SoF posts whose ≥ and ≥ to develop of Docker posts which discuss about ' ' but does not have Docker in tag-set.…”
Section: 11mentioning
confidence: 99%
See 3 more Smart Citations
“…We have obtained thresholds of = 0.051 and = 4 for and , respectively, from unclosed Docker posts . Since a closed post may pose a possibility of being duplicated and irrelevant to discussion 5 , unclosed posts have been considered for ensuring Docker relevancy. Content-based filtering approach selects SoF posts whose ≥ and ≥ to develop of Docker posts which discuss about ' ' but does not have Docker in tag-set.…”
Section: 11mentioning
confidence: 99%
“…We also calculate the median time (in minutes) to receive an accepted answer for a question. The longer the time to receive an accepted answer can be explained as a more difficult question [4,5,8,11]. In addition, we use Kendall correlation [2] to identify and verify correlation between popular and difficult topics.…”
Section: Rq2 Topic Characteristics -What Are the Characteristics In mentioning
confidence: 99%
See 2 more Smart Citations
“…It has been a common practice for SE researchers to get insight into developers' concerns on different SE issues by mining related posts from SO [5,7,8,23,30,33]. In our study, we use SO as the data source because: (i) as one of the most popular community-driven Q&A websites, the users in SO range from novices to experts, increasing the diversity of the analyzed issues; (ii) developers often seek for help in SO after they cannot find solutions in documents or internet search, leading to more unsolvable and non-trivial build problems in our analyzed data; (iii) SO inherently contains build issues with implicit symptoms which are often hard to be captured in reproduced or historical build data, increasing comprehensiveness of the dataset.…”
Section: Data Collectionmentioning
confidence: 99%