2020
DOI: 10.1109/tcbb.2019.2915811
|View full text |Cite
|
Sign up to set email alerts
|

Enabling Massive XML-Based Biological Data Management in HBase

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

3
6

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 38 publications
0
6
0
Order By: Relevance
“…As shown in Figure 2 , for the submission process, the hyper-converged architecture integrates the server, storage, and network resources into a virtual pool, so that it can achieve high scalability using a distributed scale-out expansion for storage and computing resources. Reanalyzed datasets, such as that for proteins, peptides and spectra, are stored in a distributed column-oriented storage database known as HBase ( 13 ) in the Hadoop cluster. The indexes of the identifications are stored in an Elastic Search cluster for a distributed, high-scalable, real-time search and data analysis.…”
Section: Big Data Architecture and Infrastructure Of Iproxmentioning
confidence: 99%
“…As shown in Figure 2 , for the submission process, the hyper-converged architecture integrates the server, storage, and network resources into a virtual pool, so that it can achieve high scalability using a distributed scale-out expansion for storage and computing resources. Reanalyzed datasets, such as that for proteins, peptides and spectra, are stored in a distributed column-oriented storage database known as HBase ( 13 ) in the Hadoop cluster. The indexes of the identifications are stored in an Elastic Search cluster for a distributed, high-scalable, real-time search and data analysis.…”
Section: Big Data Architecture and Infrastructure Of Iproxmentioning
confidence: 99%
“…[ 9,10 ] Cloud providers such as Amazon Web Services (AWS), [ 11 ] Microsoft Azure, [ 12 ] and Google Cloud Platform offer a range of services and tools that can be used to store, process, and analyze precision medicine data. [ 10,13 ] NoSQL Databases [ 14 ] : NoSQL databases such as MongoDB, [ 15–17 ] Cassandra, [ 15,18 ] and HBase [ 19 ] are used to store and manage large‐scale genomic and clinical datasets. These databases offer scalability, flexibility, and high availability, making them well‐suited to handle the complex and dynamic nature of precision medicine data.…”
Section: Datawarehouse and Data Managementmentioning
confidence: 99%
“…In order to effectively manage massive biomedical data, approaches concerned with the reengineering traditional databases in HBase are greatly needed. The adoption of the HBase naturally triggers the requirement of the mapping from the historical one to the new one [43], [44]. Although HBase is employed to model the future data explosion, little work focuses on the uncertainty modeling of objectoriented biomedical information in HBase.…”
Section: Introductionmentioning
confidence: 99%