2018
DOI: 10.1371/journal.pone.0199815
|View full text |Cite
|
Sign up to set email alerts
|

Creating longitudinal datasets and cleaning existing data identifiers in a cystic fibrosis registry using a novel Bayesian probabilistic approach from astronomy

Abstract: Patient registry data are commonly collected as annual snapshots that need to be amalgamated to understand the longitudinal progress of each patient. However, patient identifiers can either change or may not be available for legal reasons when longitudinal data are collated from patients living in different countries. Here, we apply astronomical statistical matching techniques to link individual patient records that can be used where identifiers are absent or to validate uncertain identifiers. We adopt a Bayes… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 32 publications
0
2
0
Order By: Relevance
“…The method presented allows them to compute the probability of identity even when certain elements do not agree. Other methods of de-duplication, not necessarily with anonymity, include a Bayesian method 86 adapted from astronomy—galaxies in different astronomical databases may not have matched names, but do have matched characteristics, by and large. Funded by the German Medical Informatics Initiative 87 , the SMITH consortium is developing the infrastructure to support a network of Data Integration Centres (DIC), which will share services and functionality to provide access to the local hospitals’ Electronic Medical Records (EMRs).…”
Section: Resultsmentioning
confidence: 99%
“…The method presented allows them to compute the probability of identity even when certain elements do not agree. Other methods of de-duplication, not necessarily with anonymity, include a Bayesian method 86 adapted from astronomy—galaxies in different astronomical databases may not have matched names, but do have matched characteristics, by and large. Funded by the German Medical Informatics Initiative 87 , the SMITH consortium is developing the infrastructure to support a network of Data Integration Centres (DIC), which will share services and functionality to provide access to the local hospitals’ Electronic Medical Records (EMRs).…”
Section: Resultsmentioning
confidence: 99%
“…Researchers actively studied how to merge patient records in healthcare databases [ [31] , [32] , [33] , [34] , [35] , [36] , [37] , [38] , [39] , [40] , [41] , [42] , [43] , [44] , [45] , [46] , [47] , [48] , [49] , [50] , [51] , [52] , [53] , [54] , [55] , [56] , [57] , [58] , [59] , [60] , [61] , [62] , [63] ]. These studies can be roughly grouped into three approaches: a deterministic approach [ [31] , [32] , [33] , [34] , [35] , [36] , [37] , [38] , [39] , [40] , [41] , [42] , [43] , [44] , [45] , [46] ], a probabilistic approach [ [47] , [48] , [49] , [50] , [51] , [52] , [53] , [54] , [55] , [56] , [57] , [58] ], and a hybrid approach [ [59] , [60] , [61] , [62] , [63] ]. Our method can be categorized into the deterministic approach, but it can be extended by incorporatin...…”
Section: Discussionmentioning
confidence: 99%