VoMBaT: A Tool for Visualising Evaluation Measure Behaviour in High-Recall Search Tasks

Kusa, Wojciech; Lipani, Aldo; Knoth, Petr; Hanbury, Allan

doi:10.1145/3539618.3591802

Cited by 4 publications

(4 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When the screening problem is treated as a ranking task, such as screening prioritisation or stopping prediction; the performance is measured in terms of rank-based metrics and metrics at a fixed cut-off, such as nDCG@n, P recision@n, and last relevant found [69,28]. On the other hand, when the screening problem is treated as a classification task, the performance in this case is measured based on the confusion matrix and the notions of Precision and Recall are commonly used [41,59]. One challenge arising from these two distinct approaches is the difficulty in going beyond simple effectiveness measures and comparing the real-world savings for users.…”

Section: Citation Screening Automationmentioning

confidence: 99%

“…We evaluate models using nDCG@10, M AP , Recall at rank k with k in {10, 50, 100} (R@k). Additionally, we compute three measures specifically designed for the task of CS: True Negative Rate at 95% Recall (T N R@95%) [40,41], normalised Precision at 95% Recall (nP @95%) [41], and average position at which the last relevant item is found [30,31,32], calculated as a percentage of the dataset size, where a lower value indicates better performance (Last Rel).…”

Section: Baseline Experimentsmentioning

confidence: 99%

“…The True Negative Rate (T N R) was proposed as an alternative as it addresses some of the limitations of WSS regarding averaging scores from multiple datasets [40]. The measures of normalised Precision at r% recall (nPrecision@r%) and normalised rectified TNR at r% recall (nReTNR@r%) have also been introduced to focus on other important aspects of screening task: screening full texts and estimating users' time savings when compared to the random ranking, respectively [41].…”

Section: A1 Citation Screening Datasetsmentioning

confidence: 99%

See 2 more Smart Citations

An analysis of work saved over sampling in the evaluation of automated citation screening in systematic literature reviews

Kusa

Lipani

Knoth

et al. 2023

Intelligent Systems with Applications

View full text Add to dashboard Cite

Section: Citation Screening Automationmentioning

confidence: 99%

Section: Baseline Experimentsmentioning

confidence: 99%

Section: A1 Citation Screening Datasetsmentioning

confidence: 99%

See 1 more Smart Citation

An analysis of work saved over sampling in the evaluation of automated citation screening in systematic literature reviews

Kusa

Lipani

Knoth

et al. 2023

Intelligent Systems with Applications

View full text Add to dashboard Cite

“…Automated citation screening is an umbrella term for using NLP, machine learning and information retrieval (IR) techniques with the goal of decreasing the time spent on manual screening. Classification approaches train a supervised model on an annotated dataset to determine whether a paper should be included or excluded from the review [23,24].…”

Section: Automated Citation Screeningmentioning

confidence: 99%

CRUISE-Screening: Living Literature Reviews Toolbox

Kusa,

Knoth,

Hanbury

2023

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management

Self Cite

View full text Add to dashboard Cite

Keeping up with research and finding related work is still a timeconsuming task for academics. Researchers sift through thousands of studies to identify a few relevant ones. Automation techniques can help by increasing the efficiency and effectiveness of this task. To this end, we developed CRUISE-Screening, a web-based application for conducting living literature reviews -a type of literature review that is continuously updated to reflect the latest research in a particular field. CRUISE-Screening is connected to several search engines via an API, which allows for updating the search results periodically. Moreover, it can facilitate the process of screening for relevant publications by using text classification and question answering models. CRUISE-Screening can be used both by researchers conducting literature reviews and by those working on automating the citation screening process to validate their algorithms. The application is open-source, 1 and a demo is available under this URL: https://citation-screening.ec.tuwien.ac.at. We discuss the limitations of our tool in Appendix A.

show abstract