Zekun Yin scite author profile

The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and computer scientists are facing the challenge of gaining a profound insight into the deepest biological functions from big biological data. This in turn requires massive computational resources. Therefore, high performance computing (HPC) platforms are highly needed as well as efficient and scalable algorithms that can take advantage of these platforms. In this paper, we survey the state-of-the-art HPC platforms for big biological data analytics. We first list the characteristics of big biological data and popular computing platforms. Then we provide a taxonomy of different biological data analysis applications and a survey of the way they have been mapped onto various computing platforms. After that, we present a case study to compare the efficiency of different computing platforms for handling the classical biological sequence alignment problem. At last we discuss the open issues in big biological data analytics.

show abstract

18.9-Pflops nonlinear earthquake simulation on Sunway TaihuLight

Chen

et al. 2017

View full text Add to dashboard Cite

Efficient Parallel Sort on AVX-512-Based Multi-Core and Many-Core Architectures

Yin

Zhang

Müller

et al. 2019

View full text Add to dashboard Cite

SLPal: Accelerating Long Sequence Alignment on Many-Core and Multi-Core Architectures

Chan

et al. 2020

View full text Add to dashboard Cite

RabbitFX: Efficient Framework for FASTA/Q File Parsing on Modern Multi-Core Platforms

Zhang

Song

et al. 2023

IEEE/ACM Trans. Comput. Biol. and Bioinf.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zekun Yin

Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges

18.9-Pflops nonlinear earthquake simulation on Sunway TaihuLight

Efficient Parallel Sort on AVX-512-Based Multi-Core and Many-Core Architectures

SLPal: Accelerating Long Sequence Alignment on Many-Core and Multi-Core Architectures

RabbitFX: Efficient Framework for FASTA/Q File Parsing on Modern Multi-Core Platforms

Contact Info

Product

Resources

About