2015
DOI: 10.1074/mcp.o114.039115
|View full text |Cite
|
Sign up to set email alerts
|

mzDB: A File Format Using Multiple Indexing Strategies for the Efficient Analysis of Large LC-MS/MS and SWATH-MS Data Sets *

Abstract: The analysis and management of MS data, especially those generated by data independent MS acquisition, exemplified by SWATH-MS, pose significant challenges for proteomics bioinformatics. The large size and vast amount of information inherent to these data sets need to be properly structured to enable an efficient and straightforward extraction of the signals used to identify specific target peptides. Standard XML based formats are not well suited to large MS data files, for example, those generated by SWATH-MS… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
21
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 29 publications
(23 citation statements)
references
References 40 publications
0
21
0
Order By: Relevance
“…Here we compare MzTree to leading MS file formats most like it–mzML, mz5 [ 5 ] and mzDB [ 6 ]–in the arenas of data querying, file conversion time and size on disk. Querying comparisons are of two kinds: random and path.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Here we compare MzTree to leading MS file formats most like it–mzML, mz5 [ 5 ] and mzDB [ 6 ]–in the arenas of data querying, file conversion time and size on disk. Querying comparisons are of two kinds: random and path.…”
Section: Resultsmentioning
confidence: 99%
“…Because of axis-agnostic design, MzTree is comparably fast in both m/z- and RT-major queries, where competing methods perform relatively worse than in m/z-major queries (see Fig 5(b) ). MzTree’s improved performance is due to axis-agnostic design, as both mz5 [ 5 ] and mzDB [ 6 ] are optimized for m/z-centric queries.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The mzDB format 7 uses an alternative database paradigm, the lightweight SQLite relational database. mzDB’s main mechanism of increasing random read performance is in organizing data in small two-dimensional blocks across multiple consecutive spectra (i.e., along both the m / z and retention time axis), enabling a quick reading of XICs.…”
Section: Introductionmentioning
confidence: 99%
“…Regardless, algorithms are limited to taking slices of the data along the retention time (or spectrum) axis only. In contrast, other attempts such as mzDB [4], mzRTree [5], and mzTree [6], focus on random I/O access through the use of an RTree [7] data structure. This allows the data to be accessed along both the m/z (or chromatogram) axis as well as the retention time axis, at the cost of file size or mass accuracy.…”
mentioning
confidence: 99%