2022
DOI: 10.21203/rs.3.rs-1964690/v2
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Diagnostic Accuracy of Artificial Intelligence Algorithm incorporated into MobileODT Enhanced Visual Assessment for triaging Screen Positive Women after Cervical Cancer Screening

Abstract: Introduction : The goal of cervical cancer screening is to detect precancerous precursor lesions that can be treated in the preinvasive stage. Colposcopy is important for triaging of any abnormal cervical screening test. Scarcity of trained Colposcopists and colposcopy centres is a big hurdle to screening programs in lower and middle income countries. Objectives of the study: The objective was to assess the performance of the Artificial Intelligence based incorporated into the MobileODT Enhanced Visual Assessm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…12 The National Cancer Institute in the United States spearheads the development of models for use in low-middle income countries with promising initial results, 23 efforts to converge toward a usable model, 24 and a program for developing a novel screening/ treatment solution for resource-limited settings. 25,26 The same group worked with images from a mobile phone-based colposcope, demonstrating some potential in learning to reproduce expert reviewer predictions 27 ; however, a recent report from its implementation suggests that its performance was subpar to the colposcopists, 28 highlighting the gap between model development and deployment.…”
Section: Discussionmentioning
confidence: 99%
“…12 The National Cancer Institute in the United States spearheads the development of models for use in low-middle income countries with promising initial results, 23 efforts to converge toward a usable model, 24 and a program for developing a novel screening/ treatment solution for resource-limited settings. 25,26 The same group worked with images from a mobile phone-based colposcope, demonstrating some potential in learning to reproduce expert reviewer predictions 27 ; however, a recent report from its implementation suggests that its performance was subpar to the colposcopists, 28 highlighting the gap between model development and deployment.…”
Section: Discussionmentioning
confidence: 99%
“…Our work builds on prior work highlighting improvements in repeatability of model predictions made by certain design choices (36,37). Our work also stands out among the paucity of current approaches that have utilized AI and DL for cervical screening (21)(22)(23)(24); as aforementioned, these are largely plagued by overfitting and no consideration of repeatability. The dearth of work investigating repeatability of AI models designed for clinical translation in the current DL and medical image classification literature has meant that no rigorous study, to the best of our knowledge, has employed repeatability as a model selection criterion.…”
Section: Discussionmentioning
confidence: 99%
“…Subsequent design choices of our work are optimized to improve clinical translatability. Prior work (21)(22)(23)(24) has shown us that while binary classifiers for cervical image-based cervical precancer+ detection can achieve competitive performance in a given internal seed dataset, they translate poorly when tested in different settings; uncertain cases can be misclassified, and predictions tend to oscillate between the two classes. This oscillation phenomenon could prevent a precancer+ woman from accessing further evaluation (i.e., false negative) or direct a normal woman through unnecessary, potentially invasive tests (i.e., false positive).…”
Section: Discussionmentioning
confidence: 99%
“…Our clinical intuition from working with binary models as well as prior empirical work have informed us that these models frequently fail to capture the inherent uncertainty with ambiguous samples (21)(22)(23)(24). These uncertain samples are of two intersecting kinds: samples that are uncertain to the clinician ("rater uncertainty") and samples that are uncertain to the model i.e., where the model reports low confidence scores ("model uncertainty"); both instances can lead to incorrect classification and subsequent misinformed downstream actions for these patients.…”
Section: Improved Clinical Translatability: Multi-level Ground Truthmentioning
confidence: 99%
“…This leads to apparent initial promise, with either poor performance on or absence of held-aside test sets for evaluating true model performance. When deployed in different settings, these models fail to return consistent scores and accurately detect precancers (21)(22)(23)(24). This poses significant concerns when considering downstream deployment in various LMIC, where model predictions directly inform the course of treatment, and where screening opportunities are limited.…”
mentioning
confidence: 99%