2023
DOI: 10.1101/2023.02.13.528389
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Pairtools: from sequencing data to chromosome contacts

Abstract: The field of 3D genome organization produces large amounts of sequencing data from Hi-C and a rapidly-expanding set of other chromosome conformation protocols (3C+). Massive and heterogeneous 3C+ data require high-performance and flexible processing of sequenced reads into contact pairs. To meet these challenges, we present pairtools - a flexible suite of tools for contact extraction from sequencing data. Pairtools provides modular command-line interface (CLI) tools that can be flexibly chained into data proce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
41
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 64 publications
(41 citation statements)
references
References 76 publications
0
41
0
Order By: Relevance
“…18 Briefly, paired-end reads were aligned to the UCSC hg38 genome using bowtie2 (v2.3.5.1), and aligned read mates were subsequently processed with pairtools (v0.3.0) to remove PCR duplicates and non-uniquely mapped reads, index reads, and filter to retain only reads with both mates within the captured region ( Extended Data Table 3 ). 17,51,52 Read counts were binned across the genome for 50bp bins to coarser resolutions using cooler (v0.8.11). 53 Resulting contact matrices were balanced using iterative correction and eigendecomposition (ICE) within the captured region.…”
Section: Methodsmentioning
confidence: 99%
“…18 Briefly, paired-end reads were aligned to the UCSC hg38 genome using bowtie2 (v2.3.5.1), and aligned read mates were subsequently processed with pairtools (v0.3.0) to remove PCR duplicates and non-uniquely mapped reads, index reads, and filter to retain only reads with both mates within the captured region ( Extended Data Table 3 ). 17,51,52 Read counts were binned across the genome for 50bp bins to coarser resolutions using cooler (v0.8.11). 53 Resulting contact matrices were balanced using iterative correction and eigendecomposition (ICE) within the captured region.…”
Section: Methodsmentioning
confidence: 99%
“…4 replicates of K562 MNase CTCF HiChIP data were aligned to the reference genome using the BWA-MEM algorithm 41 . Ligation events were then recorded using pairtools parse v. 0.3.0 42 , PCR duplicates were removed, and the final pairs and bam files were generated. HiChIP loop calls were then made using FitHiChIP Peak to Peak 29 with 2.5kb loop anchor bin size.…”
Section: Methodsmentioning
confidence: 99%
“…To process the HiC data, we first mapped the HiC data to the primary genome assembly using BWA-mem version 0.7.17 (Li & Durbin, 2009) with the arguments -SP . The mapped reads were then parsed and filtered using pairtools version 1.0.2 (Open2C et al , 2023). To parse the aligned Hi-C reads, we used the options –no-flip –dd-columns mapq –walks-policy mask .…”
Section: Methodsmentioning
confidence: 99%