Reliability in evaluator-based tests: using simulation-constructed models to determine contextually relevant agreement thresholds

Beckler, Dylan T.; Thumser, Zachary C.; Schofield, Jonathon S.; Marasco, Paul D.

doi:10.1186/s12874-018-0606-7

Cited by 24 publications

(9 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We computed the inter-coder agreement using Krippendorff's α (Artstein and Poesio, 2008;Hayes and Krippendorff, 2007). The inter-coder agreement for our study is 0.87 which concludes that our qualitative analysis is reliable as prior studies use α 0.8 as an indicator of reliable agreement (Li et al, 2020;Beckler et al, 2018;Webb et al, 2020;Vassallo et al, 2020;Scoccia and Autili, 2020).…”

Section: Approachsupporting

confidence: 75%

Revisiting reopened bugs in open source software systems

Tagra¹,

Zhang²,

Rajbahadur³

et al. 2022

Preprint

View full text Add to dashboard Cite

Reopened bugs can degrade the overall quality of a software system since they require unnecessary rework by developers. Moreover, reopened bugs also lead to a loss of trust in the end-users regarding the quality of the software. Thus, predicting bugs that might be reopened could be extremely helpful for software developers to avoid rework. Prior studies on reopened bug prediction focus only on three open source projects (i.e., Apache, Eclipse, and OpenOffice) to generate insights. We observe that one out of the three projects (i.e., Apache) has a data leak issue -the bug status of reopened was included as training data to predict reopened bugs. In addition, prior studies used an outdated prediction model pipeline (i.e., with old techniques for constructing a prediction model) to predict reopened bugs. Therefore, we revisit the reopened bugs study on a large scale dataset consisting of 47 projects tracked by JIRA using the modern techniques such as SMOTE, permutation importance together with 7 different machine learning models. We study the reopened bugs using a mixed methods approach (i.e., both quantitative and qualitative study). We find that: 1) After using an updated reopened bug prediction model pipeline, only 34% projects give an acceptable performance with AUC 0.7. 2) There are four major reasons for a bug getting reopened, that is, technical (i.e., patch/integration issues), documentation, human (i.e., due to incorrect bug assessment), and reasons not shown in the bug reports. 3) In

show abstract

Section: Approachsupporting

confidence: 75%

Revisiting reopened bugs in open source software systems

Tagra¹,

Zhang²,

Rajbahadur³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Organisms instinctively make strategic decisions about how to spend their resources in relationship to the payoff from their actions. Humans are subject to these same instincts (72)(73)(74)(75), and PEP provides insight into resource allocation decisions made during sensory discrimination tasks (49,50). With TMR-motor only, pSD made discrimination decisions slightly above chance.…”

Section: Discussionmentioning

confidence: 99%

“…The metrics focus on essential sensory-motor features of limb function such as visual attention, cognitive demand, fine motor dexterity, and ownership while also being clinically and real-world relevant. Each metric was validated in their foundational fields of psychophysics, mathematical theory, cognition/perception, visuomotor behavior, and psychometrics (23,31,(37)(38)(39)(40)(41)(42)(43)(44)(45)(46)(47)(48)(49)(50)(51)(52)(53)(54).…”

Section: Introductionmentioning

confidence: 99%

Neurorobotic fusion of prosthetic touch, kinesthesia, and movement in bionic upper limbs promotes intrinsic brain behaviors

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…Krippendorff's alpha of the polarity of the PPPRs and the similarities between pre-publication peer reviews and PPPRs were calculated using a software developed by Freelon (2013), and the alpha values were 0.887 and 0.941. Both alpha values were greater than 0.8, which demonstrated acceptable interrater reliability between the judgments of the two coders (Beckler et al , 2018).…”

Section: Resultsmentioning

confidence: 87%

The relationship of polarity of post-publication peer review to citation count

Zong

Fan

Xie

et al. 2020

OIR

View full text Add to dashboard Cite

PurposeThe purpose of this study is to investigate the relationship of the post-publication peer review (PPPR) polarity of a paper to that paper's citation count.Design/methodology/approachPapers with PPPRs from Publons.com as the experimental groups were manually matched 1:2 with the related papers without PPPR as the control group, by the same journal, the same issue (volume), the same access status (gold open access or not) and the same document type. None of the papers in the experimental group or control group received any comments or recommendations from ResearchGate, PubPeer or F1000. The polarity of the PPPRs was coded by using content analysis. A negative binomial regression analysis was conducted to examine the data by controlling the characteristics of papers.FindingsThe four experimental groups and their corresponding control groups were generated as follows: papers with neutral PPPRs, papers with both negative and positive PPPRs, papers with negative PPPRs and papers with positive PPPRs as well as four corresponding control groups (papers without PPPRs). The results are as follows: while holding the other variables (such as page count, number of authors, etc.) constant in the model, papers that received neutral PPPRs, those that received negative PPPRs and those that received both negative and positive PPPRs had no significant differences in citation count when compared to their corresponding control pairs (papers without PPPRs). Papers that received positive PPPRs had significantly greater citation count than their corresponding control pairs (papers without PPPRs) while holding the other variables (such as page count, number of authors, etc.) constant in the model.Originality/valueBased on a broader range of PPPR sentiments, by controlling many of the confounding factors (including the characteristics of the papers and the effects of the other PPPR platforms), this study analyzed the relationship of various polarities of PPPRs to citation count.

show abstract

Reliability in evaluator-based tests: using simulation-constructed models to determine contextually relevant agreement thresholds

Cited by 24 publications

References 25 publications

Revisiting reopened bugs in open source software systems

Revisiting reopened bugs in open source software systems

Neurorobotic fusion of prosthetic touch, kinesthesia, and movement in bionic upper limbs promotes intrinsic brain behaviors

The relationship of polarity of post-publication peer review to citation count

Contact Info

Product

Resources

About