2007
DOI: 10.1016/j.neuroimage.2007.04.031
|View full text |Cite
|
Sign up to set email alerts
|

On evaluating brain tissue classifiers without a ground truth

Abstract: In this paper, we present a set of techniques for the evaluation of brain tissue classifiers on a large data set of MR images of the head. Due to the difficulty of establishing a gold standard for this type of data, we focus our attention on methods which do not require a ground truth, but instead rely on a common agreement principle. Three different techniques are presented: the Williams' index, a measure of common agreement; STAPLE, an Expectation Maximization algorithm which simultaneously estimates perform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
61
0

Year Published

2007
2007
2023
2023

Publication Types

Select...
7
3

Relationship

1
9

Authors

Journals

citations
Cited by 86 publications
(62 citation statements)
references
References 30 publications
0
61
0
Order By: Relevance
“…Unlike other authors [52], [53], who selected their training set by random pixel-sample extraction from available manual segmentations of DRIVE and STARE images, we produced our own training set by hand. 1 As discussed in literature, gold-standard images may contain errors (see Bioux et al [61] for a comprehensive discussion on this issue) due to the considerable difficulty involved by the creation of these handmade images. To reduce the risk of introducing errors in and, therefore, of introducing noise in the NN, we opted for carefully selecting specific training samples covering all possible vessel, background, and noise patterns.…”
Section: ) Neural Network Designmentioning
confidence: 99%
“…Unlike other authors [52], [53], who selected their training set by random pixel-sample extraction from available manual segmentations of DRIVE and STARE images, we produced our own training set by hand. 1 As discussed in literature, gold-standard images may contain errors (see Bioux et al [61] for a comprehensive discussion on this issue) due to the considerable difficulty involved by the creation of these handmade images. To reduce the risk of introducing errors in and, therefore, of introducing noise in the NN, we opted for carefully selecting specific training samples covering all possible vessel, background, and noise patterns.…”
Section: ) Neural Network Designmentioning
confidence: 99%
“…The second scan occurred on average 1.5 years later. Brain structures were segmented using a validated tissue segmentation algorithm (32,33). Neocortical gray matter (NCGM), cerebral white matter (CWM), SCSF, and lateral ventricles (LV) were measured separately for each hemisphere.…”
mentioning
confidence: 99%
“…Human delineations of medical images are not a true gold standard but are the most objective solution. 18 We refer to Bouix et al 38 for a comprehensive discussion on ground truth in segmentations.…”
Section: Discussionmentioning
confidence: 99%