2020
DOI: 10.1038/s41746-020-00336-w
|View full text |Cite
|
Sign up to set email alerts
|

Best practices for authors of healthcare-related artificial intelligence manuscripts

Abstract: Since its inception in 2017, npj Digital Medicine has attracted a disproportionate number of manuscripts reporting on uses of artificial intelligence. This field has matured rapidly in the past several years. There was initial fascination with the algorithms themselves (machine learning, deep learning, convoluted neural networks) and the use of these algorithms to make predictions that often surpassed prevailing benchmarks. As the discipline has matured, individuals have called attention to aberrancies in the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
28
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 39 publications
(28 citation statements)
references
References 15 publications
0
28
0
Order By: Relevance
“…To obtain valid predictive models that perform well beyond the training sample, it is crucial to collect datasets that represent the whole population and reflect its diversity as much as possible [ 6 , 25 , 26 ]. Yet clinical research often emphasizes the opposite: very homogeneous datasets and carefully selected participants.…”
Section: False Solutions To Tackling Dataset Shiftmentioning
confidence: 99%
See 1 more Smart Citation
“…To obtain valid predictive models that perform well beyond the training sample, it is crucial to collect datasets that represent the whole population and reflect its diversity as much as possible [ 6 , 25 , 26 ]. Yet clinical research often emphasizes the opposite: very homogeneous datasets and carefully selected participants.…”
Section: False Solutions To Tackling Dataset Shiftmentioning
confidence: 99%
“…One risk is the potential mismatch, or “dataset shift," between the distribution of the individuals used to estimate this statistical link and that of the target population that should benefit from the biomarker. In this case, the extracted associations may not apply to the target population [ 6 ]. Computer-aided diagnostics of thoracic diseases from X-ray images has indeed been shown to be unreliable for individuals of a given sex if built from a cohort over-representing the other sex [ 7 ].…”
Section: Introduction: Dataset Shift Breaks Learned Biomarkersmentioning
confidence: 99%
“…All methods used to perform analytical and clinical validation should be included in the statistical analysis section. Further details on good practices can be seen here [41]. To increase transparency and enable reproducibility, authors are encouraged to share their work on public code repositories, if applicable [23,[42][43][44].…”
Section: Softwarementioning
confidence: 99%
“…There have been several initiatives proposing "best practice" guidelines to ensure that AI methods are developed and deployed in a way that maximizes the benefit for patients. On the model development front, these include recommendations for providing sufficient methodological detail on the development of algorithms, and encouraging sharing of data sets and code to enhance the transparency in the reporting of AI algorithms in medicine 11,12,24,25 and allow other researchers to determine the rigor, quality, reproducibility, and generalizability of the findings [26][27][28] . Studies also need to adopt standardized guidelines for describing and reporting aspects related to the purpose and context of the "clinical need" that is being addressed, the quality of data used to train the models-including issues related to power calculation, labeling, model biases, etc.-measures of performance, outputs and framework for integration in clinical pathways and workflows, among others.…”
mentioning
confidence: 99%