2021
DOI: 10.32920/14639652.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths

Abstract: Background Verbal autopsies (VA) are increasingly used in low- and middle-income countries where most causes of death (COD) occur at home without medical attention, and home deaths differ substantially from hospital deaths. Hence, there is no plausible “standard” against which VAs for home deaths may be validated. Previous studies have shown contradictory performance of automated methods compared to physician-based classification of CODs. We sought to compare the performance of the classic naive Bayes classifi… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(18 citation statements)
references
References 15 publications
0
18
0
Order By: Relevance
“…The number of observations within India, Mexico, Philippines, and Tanzania are 2973, 1586, 1259, and 2023, respectively. To address countryspecific dataset shift, for each country, we used the three remaining countries as training data for four methods commonly used for cause of death predictions: InterVA (Byass et al, 2012), InSilicoVA (McCormick et al, 2016, NBC (Miasnikof et al, 2015), and Tariff (Serina et al, 2015). The first three methods are probabilistic, while Tariff produces a score for each cause that needed to be normalized to be in [0, 1].…”
Section: Phmrc Dataset Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…The number of observations within India, Mexico, Philippines, and Tanzania are 2973, 1586, 1259, and 2023, respectively. To address countryspecific dataset shift, for each country, we used the three remaining countries as training data for four methods commonly used for cause of death predictions: InterVA (Byass et al, 2012), InSilicoVA (McCormick et al, 2016, NBC (Miasnikof et al, 2015), and Tariff (Serina et al, 2015). The first three methods are probabilistic, while Tariff produces a score for each cause that needed to be normalized to be in [0, 1].…”
Section: Phmrc Dataset Analysismentioning
confidence: 99%
“…However, in some applications, the objective is not individual level predictions, but rather to learn about population-level distributions of a given outcome. Examples include sentiment analysis for Twitter users (Giachanou and Crestani, 2016), estimating the prevalence of chronic fatigue syndrome (Valdez et al, 2018), and cause of death distribution estimation from verbal autopsies (King et al, 2008;McCormick et al, 2016;Serina et al, 2015;Byass et al, 2012;Miasnikof et al, 2015).…”
Section: Introductionmentioning
confidence: 99%
“…[15] Physician-coding of VA (PCVA) is subject to physician-specific bias, occupies valuable physician time, is expensive, and often leads to long delays between the VA interview and assigning a cause of death. Over the past decade computational algorithms [16,17,18,19,20,21,22,23] have been developed and are increasingly used to interpret VA data and assign causes to VA deaths. Algorithms are consistent, essentially cost-free (compared to the traditional use of physician time to assign cause of death to VA), and can be run on large numbers of deaths quickly.…”
Section: Verbal Autopsy In Civil Registration and Vital Statisticsmentioning
confidence: 99%
“…In some cases SCI take the form of deaths with symptoms and causes assigned through another mechanism, and in other cases, the relationships are solicited directly from experts. There are six VA-coding algorithms that have been proposed and/or used widely: (i) InterVA, [26,20,27,28,29,30] (ii) Tariff, [21] (iii) a derivative of Tariff called SmartVA-Analyze, [18,19] (iv) InSilicoVA, [16,31] (v) a naive Bayes classifier called NBC, [17] and (vi) the King-Lu algorithm. [22,23] The list of causes that each algorithm assigns varies slightly.…”
Section: Verbal Autopsy In Civil Registration and Vital Statisticsmentioning
confidence: 99%
“…Existing computer-coded VA algorithms include those for which the relationship between symptoms and cause of death is encoded by experts (InterVA [Byass et al, 2012] and InSilicoVA [McCormick et al, 2016]) and those for which it is learned by relying on a labeled subset of the data having known COD (the King and Lu method [King et al, 2008], the Tariff method [James et al, 2011], the Simplified Symptom Pattern method [Murray et al, 2011a], the naive Bayes classifier [Miasnikof et al, 2015], the Bayesian factor model [Kunihama et al, 2018], and latent Gaussian graphical model [Li et al, 2018b]).…”
Section: Introductionmentioning
confidence: 99%