2023
DOI: 10.3390/microorganisms11040949
|View full text |Cite
|
Sign up to set email alerts
|

Information Scale Correction for Varying Length Amplicons Improves Eukaryotic Microbiome Data Integration

Abstract: The integration and reanalysis of big data provide valuable insights into microbiome studies. However, the significant difference in information scale between amplicon data poses a key challenge in data analysis. Therefore, reducing batch effects is crucial to enhance data integration for large-scale molecular ecology data. To achieve this, the information scale correction (ISC) step, involving cutting different length amplicons into the same sub-region, is essential. In this study, we used the Hidden Markov m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 40 publications
0
4
0
Order By: Relevance
“…These primer biases could be attributed to differential taxa specificity and amplification efficiency. Moreover, the regional information scale, for example, the longer region and higher regional hypervariability, can affect taxonomic resolution [ 34 , 35 , 36 , 37 ], potentially explaining the improved taxa detection at the species level in V5–V7. These findings underscore the bias inherent in marker gene‐based metagenomics, resulting from the imbalances among primer sequences, amplification systems, and reaction condition optimization.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…These primer biases could be attributed to differential taxa specificity and amplification efficiency. Moreover, the regional information scale, for example, the longer region and higher regional hypervariability, can affect taxonomic resolution [ 34 , 35 , 36 , 37 ], potentially explaining the improved taxa detection at the species level in V5–V7. These findings underscore the bias inherent in marker gene‐based metagenomics, resulting from the imbalances among primer sequences, amplification systems, and reaction condition optimization.…”
Section: Resultsmentioning
confidence: 99%
“…Specifically, V5–V7 data set exhibited stronger phylogenetic signals than V4. These disparities may arise from the differential taxonomic composition yielded by differential resolution [ 34 , 36 ]. The longer length and higher entropy of the V5–V7 region provide greater resolution than V4 [ 34 ], leading to divergent estimation of species relatedness/phylogenetic structure (lower βNTI) and community assembly process partitioning.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations