Benoit Meister scite author profile

Benoit Meister

5Publications

175Citation Statements Received

64Citation Statements Given

How they've been cited

282

175

How they cite others

Affiliations

Reservoir Labs (United States), University of Strasbourg, Laboratoire des Sciences de l'Ingénieur, de l'Informatique et de l'Imagerie

Publications

Order By: Most citations

Runnemede: An architecture for Ubiquitous High-Performance Computing

Carter

Agrawal

Borkar

et al. 2013

View full text Add to dashboard Cite

DARPA's Ubiquitous High-Performance Computing (UHPC) program asked researchers to develop computing systems capable of achieving energy efficiencies of 50 GOPS/Watt, assuming 2018-era fabrication technologies. This paper describes Runnemede, the research architecture developed by the Intel-led UHPC team. Runnemede is being developed through a co-design process that considers the hardware, the runtime/OS, and applications simultaneously. Near-threshold voltage operation, fine-grained power and clock management, and separate execution units for runtime and application code are used to reduce energy consumption. Memory energy is minimized through application-managed on-chip memory and direct physical addressing. A hierarchical on-chip network reduces communication energy, and a codelet-based execution model supports extreme parallelism and fine-grained tasks.We present an initial evaluation of Runnemede that shows the design process for our on-chip network, demonstrates 2-4x improvements in memory energy from explicit control of on-chip memory, and illustrates the impact of hardware-software co-design on the energy consumption of a synthetic aperture radar algorithm on our architecture.

show abstract

A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction

Leung

Vasilache

Meister

et al. 2010

View full text Add to dashboard Cite

Efficient and scalable computations with sparse tensors

Baskaran

Meister

Vasilache

et al. 2012

View full text Add to dashboard Cite

For applications that deal with large amounts of high dimensional multi-aspect data, it becomes natural to represent such data as tensors or multi-way arrays. Multi-linear algebraic computations such as tensor decompositions are performed for summarization and analysis of such data. Their use in real-world applications can span across domains such as signal processing, data mining, computer vision, and graph analysis. The major challenges with applying tensor decompositions in real-world applications are (1) dealing with large-scale high dimensional data and (2) dealing with sparse data.In this paper, we address these challenges in applying tensor decompositions in real data analytic applications. We describe new sparse tensor storage formats that provide storage benefits and are flexible and efficient for performing tensor computations. Further, we propose an optimization that improves data reuse and reduces redundant or unnecessary computations in tensor decomposition algorithms. Furthermore, we couple our data reuse optimization and the benefits of our sparse tensor storage formats to provide a memory-efficient scalable solution for handling large-scale sparse tensor computations. We demonstrate improved performance and address memory scalability using our techniques on both synthetic small data sets and large-scale sparse real data sets.

show abstract

The Open Community Runtime: A runtime system for extreme scale computing

Mattson

Cledat

Cavé

et al. 2016

View full text Add to dashboard Cite

A unified Coq framework for verifying C programs with floating-point computations

Ramananandro

Mountcastle

Meister

et al. 2016

View full text Add to dashboard Cite

We provide concrete evidence that floating-point computations in C programs can be verified in a homogeneous verification setting based on Coq only, by evaluating the practicality of the combination of the formal semantics of CompCert Clight and the Flocq formal specification of IEEE 754 floating-point arithmetic for the verification of properties of floating-point computations in C programs. To this end, we develop a framework to automatically compute realnumber expressions of C floating-point computations with rounding error terms along with their correctness proofs. We apply our framework to the complete analysis of an energy-efficient C implementation of a radar image processing algorithm, for which we provide a certified bound on the total noise introduced by floating-point rounding errors and energy-efficient approximations of square root and sine.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Benoit Meister

Runnemede: An architecture for Ubiquitous High-Performance Computing

A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction

Efficient and scalable computations with sparse tensors

The Open Community Runtime: A runtime system for extreme scale computing

A unified Coq framework for verifying C programs with floating-point computations

Contact Info

Product

Resources

About