2022
DOI: 10.1101/gr.276362.121
|View full text |Cite
|
Sign up to set email alerts
|

Automated annotation of human centromeres with HORmon

Abstract: Recent advances in long-read sequencing opened a possibility to address the long-standing questions about the architecture and evolution of human centromeres. They also emphasized the need for centromere annotation (partitioning human centromeres into monomers and higher-order repeats (HORs)). Even though there was a half-century-long series of semi-manual studies of centromere architecture, a rigorous centromere annotation algorithm is still lacking. Moreover, an automated centromere annotation is a prerequis… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(27 citation statements)
references
References 32 publications
0
27
0
Order By: Relevance
“…The blue path in Figure Alignment, left shows the rare-alignment path between cenX 1 and cenX 2 constructed by TandemAlignment. This path illustrates a very different and complex evolutionary scenario with only 1,954,622 matched positions and 2,335,879 insertions and deletions -surprisingly, the orange and blue paths in To illustrate limitations of the standard alignment using a simpler example, we generated the HOR decomposition of cenX 1 (Kunyavskaya et al 2022) and extracted ten HOR-blocks at coordinates cenX 1 :1,222,223-1,242,795 resulting in a sequence of length 20,572 bp referred to as a Template. We remove the third (eighth) HOR block at coordinates Template:4,172 (14,461), and refer to the resulting sequence as Template -3 (Template -8 ).…”
Section: Figure Algorithm Aligning Etrs S=cccaaccaacaaaccc and T=ccca...mentioning
confidence: 99%
See 2 more Smart Citations
“…The blue path in Figure Alignment, left shows the rare-alignment path between cenX 1 and cenX 2 constructed by TandemAlignment. This path illustrates a very different and complex evolutionary scenario with only 1,954,622 matched positions and 2,335,879 insertions and deletions -surprisingly, the orange and blue paths in To illustrate limitations of the standard alignment using a simpler example, we generated the HOR decomposition of cenX 1 (Kunyavskaya et al 2022) and extracted ten HOR-blocks at coordinates cenX 1 :1,222,223-1,242,795 resulting in a sequence of length 20,572 bp referred to as a Template. We remove the third (eighth) HOR block at coordinates Template:4,172 (14,461), and refer to the resulting sequence as Template -3 (Template -8 ).…”
Section: Figure Algorithm Aligning Etrs S=cccaaccaacaaaccc and T=ccca...mentioning
confidence: 99%
“…To generate realistic simulated centromeres, we partitioned cenX 1 into HORs (Kunyavskaya et al 2022) and introduced N HOR-indel-runs. For the length of introduced HOR-indels, we use 1-based Poisson distribution with the mean estimated from Figure IndelHistogram, left and equal to 1.66.…”
Section: Supplemental Notesmentioning
confidence: 99%
See 1 more Smart Citation
“…To address this question, Dvorkina et al proposed the first automatic centromere annotation tool, CentromereArchitect [ 10 ], which was based on StringDecomposer (SD) [ 4 ], an algorithm for detecting sequence blocks by taking monomer templates to decompose centromere DNA sequences. In CentromereArchitect, monomer inference and HOR detection were considered two separate problems without interconnections, which often led to biologically inadequate annotation [ 14 ]. The authors next proposed HORmon [ 14 ] based on the centromere evolution postulate (CE postulate, where each monomer appears only once in the HOR unit) to address the lack of interconnection issue in CentromereArchitect.…”
Section: Introductionmentioning
confidence: 99%
“…In CentromereArchitect, monomer inference and HOR detection were considered two separate problems without interconnections, which often led to biologically inadequate annotation [ 14 ]. The authors next proposed HORmon [ 14 ] based on the centromere evolution postulate (CE postulate, where each monomer appears only once in the HOR unit) to address the lack of interconnection issue in CentromereArchitect. HORmon first constructs a de Bruijn graph based on monomers inferred from CentromereArchitect and then refines the monomers by considering positional similarity to amend the graph as a single cycle (referred to as the detected HOR) to comply with the CE postulate.…”
Section: Introductionmentioning
confidence: 99%