2021
DOI: 10.32604/cmc.2021.016081
|View full text |Cite
|
Sign up to set email alerts
|

BitmapAligner: Bit-Parallelism String Matching with MapReduce and Hadoop

Abstract: Advancements in next-generation sequencer (NGS) platforms have improved NGS sequence data production and reduced the cost involved, which has resulted in the production of a large amount of genome data. The downstream analysis of multiple associated sequences has become a bottleneck for the growing genomic data due to storage and space utilization issues in the domain of bioinformatics. The traditional string-matching algorithms are efficient for small sized data sequences and cannot process large amounts of d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 32 publications
0
1
0
Order By: Relevance
“…is is often considered a critical methodology in cloud-based systems, structures, and research. Hash-based homomorphic encipherment validates the confidentiality of users' sensitive information in the directive of determining the protection back issue of storage space or administering users' confidential information by unauthorized parties [14,15,[25][26][27][28][29][30]. With the affordances of WWW, cloud consumers are capable of utilizing cloud customer services just about every time and in any place.…”
Section: Related Workmentioning
confidence: 99%
“…is is often considered a critical methodology in cloud-based systems, structures, and research. Hash-based homomorphic encipherment validates the confidentiality of users' sensitive information in the directive of determining the protection back issue of storage space or administering users' confidential information by unauthorized parties [14,15,[25][26][27][28][29][30]. With the affordances of WWW, cloud consumers are capable of utilizing cloud customer services just about every time and in any place.…”
Section: Related Workmentioning
confidence: 99%
“…Spark is a Resilient Distributed Dataset (RDD) computing model based on the traditional MapReduce computing framework, which is more suitable for handling distributed parallel computing and data reuse. Therefore, it significantly improves the operational efficiency of data reading, writing, analysis, and mining [4,5]. In meteorology data mining, common analysis methods include regression analysis, cluster analysis, classification analysis, and time series analysis [6].…”
Section: Introductionmentioning
confidence: 99%