2024
DOI: 10.1109/jbhi.2023.3268729
|View full text |Cite
|
Sign up to set email alerts
|

An Overview of Data Integration in Neuroscience With Focus on Alzheimer's Disease

Abstract: This work represents the first attempt to provide an overview of how to face data integration as the result of a dialogue between neuroscientists and computer scientists. Indeed, data integration is fundamental for studying complex multifactorial diseases, such as the neurodegenerative diseases. This work aims at warning the readers of common pitfalls and critical issues in both medical and data science fields. In this context, we define a road map for data scientists when they first approach the issue of data… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 88 publications
0
4
0
Order By: Relevance
“…Integration of multiple genomics and phenotype data is gradually unveiling the complex molecular biology behind genetic diseases and cancers [39][40][41]. Data fusion has been previously applied as a tool to cluster patients or how to extract relevant features for disease prognosis by integrating data of several NGS, imaging and other clinically related datasets from the same group of samples [42][43][44]. The main limitations to the application of these approaches are batch effects, the curse of dimensionality that arises with genomic data and missing information or heterogeneity (data incompleteness) [43].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Integration of multiple genomics and phenotype data is gradually unveiling the complex molecular biology behind genetic diseases and cancers [39][40][41]. Data fusion has been previously applied as a tool to cluster patients or how to extract relevant features for disease prognosis by integrating data of several NGS, imaging and other clinically related datasets from the same group of samples [42][43][44]. The main limitations to the application of these approaches are batch effects, the curse of dimensionality that arises with genomic data and missing information or heterogeneity (data incompleteness) [43].…”
Section: Discussionmentioning
confidence: 99%
“…Data fusion has been previously applied as a tool to cluster patients or how to extract relevant features for disease prognosis by integrating data of several NGS, imaging and other clinically related datasets from the same group of samples [42][43][44]. The main limitations to the application of these approaches are batch effects, the curse of dimensionality that arises with genomic data and missing information or heterogeneity (data incompleteness) [43]. Regarding the first point, in each sequencing experiment, technical differences among replicates could mask or mimic biological variation; for example, different sequencing coverage among two groups of samples sequenced with RNAseq could potentially lead to the discovery of several false positives, as differentially expressed genes, if samples are not properly normalized [45].…”
Section: Discussionmentioning
confidence: 99%
“…KDD outperformed SEMMA and CRISP-DM in terms of versatility and ability to address the specific challenges of our data mining project in a more efficient and comprehensive manner, as shown in Table 5. [38].…”
Section: Comparison Of Methodologiesmentioning
confidence: 99%
“…Turrisi et al [3] work is the first effort to give a comprehensive guide on addressing data integration via collaboration between neuroscientists and computer scientists. Data integration is essential for analyzing complex multifactorial disorders like neurodegenerative diseases.…”
Section: Guest Editorial Insights Of Machine Learning Into Medicalmentioning
confidence: 99%