2018
DOI: 10.1371/journal.pone.0194361
|View full text |Cite
|
Sign up to set email alerts
|

ParBiBit: Parallel tool for binary biclustering on modern distributed-memory systems

Abstract: Biclustering techniques are gaining attention in the analysis of large-scale datasets as they identify two-dimensional submatrices where both rows and columns are correlated. In this work we present ParBiBit, a parallel tool to accelerate the search of interesting biclusters on binary datasets, which are very popular on different fields such as genetics, marketing or text mining. It is based on the state-of-the-art sequential Java tool BiBit, which has been proved accurate by several studies, especially on sce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 28 publications
(20 reference statements)
0
9
0
Order By: Relevance
“…However, all the previously mentioned works are focused on quantitative datasets and they are not able to take advantage of the binary representation of the information. Up to our knowledge, ParBiBit [6] is the only MPIbased biclustering tool specifically designed for binary data and publicly available to download. It is a parallel version of BiBit [24], a tool that obtains accurate results for gene expression data, especially on cases with many large biclusters [18].…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…However, all the previously mentioned works are focused on quantitative datasets and they are not able to take advantage of the binary representation of the information. Up to our knowledge, ParBiBit [6] is the only MPIbased biclustering tool specifically designed for binary data and publicly available to download. It is a parallel version of BiBit [24], a tool that obtains accurate results for gene expression data, especially on cases with many large biclusters [18].…”
Section: Related Workmentioning
confidence: 99%
“…Furthermore, the thread-based parallelization of this third step in ParBiBit is not able to fully exploit the resources of a one node because it requires many thread synchronizations to add biclusters to the C++ set, which is a non thread-safe structure. More information about the ParBiBit approach and its bottlenecks is available in [6].…”
Section: Initialized Biclustersmentioning
confidence: 99%
See 3 more Smart Citations