2012
DOI: 10.1371/journal.pcbi.1002286
|View full text |Cite
|
Sign up to set email alerts
|

Parsimonious Higher-Order Hidden Markov Models for Improved Array-CGH Analysis with Applications to Arabidopsis thaliana

Abstract: Array-based comparative genomic hybridization (Array-CGH) is an important technology in molecular biology for the detection of DNA copy number polymorphisms between closely related genomes. Hidden Markov Models (HMMs) are popular tools for the analysis of Array-CGH data, but current methods are only based on first-order HMMs having constrained abilities to model spatial dependencies between measurements of closely adjacent chromosomal regions. Here, we develop parsimonious higher-order HMMs enabling the interp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
26
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
9

Relationship

3
6

Authors

Journals

citations
Cited by 29 publications
(28 citation statements)
references
References 106 publications
(154 reference statements)
2
26
0
Order By: Relevance
“…This extends previous findings where standard mixture models have been outperformed by first-order and higher-order HMMs [11], [13], [24]. Moreover, these performances can be further improved by autoregressive higher-order HMMs (AR()-HMM() with and ; non-black curves in Figure 3a).…”
Section: Resultssupporting
confidence: 87%
“…This extends previous findings where standard mixture models have been outperformed by first-order and higher-order HMMs [11], [13], [24]. Moreover, these performances can be further improved by autoregressive higher-order HMMs (AR()-HMM() with and ; non-black curves in Figure 3a).…”
Section: Resultssupporting
confidence: 87%
“…Data were averaged on the dye-swap to remove tile-specific dye biases and across biological replicates. Normalized data were then analysed using a three-states Parsimonious Higher-Order Hidden Markov Model (PHHMM, www.jstacs.de/index.php/PHHMM) to model the joint distribution of the þ N and À N-derived signals, taking into account spatial dependency between the DNA sequence reported by the probes on the tiling array 33 . Briefly, normalized log2 þ N/ À N signal ratio for each probe corresponds to the observed values of the Hidden Markov chain, while the hidden chain is the enrichment state of the DNA sequence reported by each probe in the ChIP-chip experiments, respectively.…”
Section: Methodsmentioning
confidence: 99%
“…First, initial model parameters are iteratively maximized by a specific Bayesian Baum-Welch training algorithm enabling the integration of previous knowledge and by a data-dependent reduction of transition parameters. Second, based on this training, optimized model parameters are obtained that allow the parsimonious higher-order HMM with univariate Gaussian emission distributions to effectively model spatial dependencies between measurements 33 . This HMM approach provides a probabilistic classification of each probe into one of the three states, namely non-enriched, enriched and depleted in the þ N/ À N comparison.…”
Section: Methodsmentioning
confidence: 99%
“…All normalized CGH profiles were analyzed by a mixture model of three Gaussian distributions described in Seifert et al (2012) to identify depletions, enrichment, and unchanged regions between a test genotype and the reference sample by specifically adapting the parameters of the mixture model to both CGH profiles (replicates from each dye swap; see Supplemental Figure 1 online) of each test genotype using a Bayesian Expectation Maximization algorithm. Identification of depleted, unchanged, and enriched regions was done by performing a decoding of the measured log2 ratios into the most likely underlying state of each probe (depleted, unchanged, or enriched).…”
Section: Microarray-based Cghmentioning
confidence: 99%