Guoming Lu scite author profile

Guoming Lu

5Publications

43Citation Statements Received

50Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Chicago, University of Electronic Science and Technology of China

Publications

Order By: Most citations

When is multi-version checkpointing needed?

Zheng

Chien

2013

View full text Add to dashboard Cite

The scaling of semiconductor technology and increasing power concerns combined with system scale make fault management a growing concern in high performance computing systems. Greater variety of errors, higher error rates, longer detection intervals, and "silent" errors are all expected. Traditional checkpointing models and systems assume that error detection is nearly immediate and thus preserving a single checkpoint is sufficient for resilience.We define a richer model for future systems that captures the reality of latent errors, i.e. errors that go undetected for some time, and use it to derive optimal checkpoint intervals for systems with latent errors. With that model, we explore the importance of multi-version checkpoint systems. Our results highlight the limits of single checkpoint systems, showing that two to more than ten checkpoints may be needed to achieve acceptable error coverage. Further, to achieve reasonable system efficiency, multiple versions (two to seventeen) may be needed. We study several specific exascale machine scenarios, and the results show that two checkpoints are always beneficial, but when checkpoint overheads are reduced, as many as three checkpoints are beneficial.

show abstract

A Dynamic Cooperation Model of Multi-Agent System

Zhou

Liao

et al. 2008

View full text Add to dashboard Cite

Digital Audio Asymmetric Watermarking Algorithm Based on Neural Networks in the Wavelet Domain

Liu¹,

Lü²,

Zhang³

et al. 2008

View full text Add to dashboard Cite

Defects in Intrusion Detection System and its Optimization

Shou

Hao

et al. 2008

View full text Add to dashboard Cite

An Improved Coding Algorithm Based On EZW Algorithm and Human Visual Characteristics

Gu¹,

Xue²,

Fu³

et al. 2008

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Guoming Lu

When is multi-version checkpointing needed?

A Dynamic Cooperation Model of Multi-Agent System

Digital Audio Asymmetric Watermarking Algorithm Based on Neural Networks in the Wavelet Domain

Defects in Intrusion Detection System and its Optimization

An Improved Coding Algorithm Based On EZW Algorithm and Human Visual Characteristics

Contact Info

Product

Resources

About