Many disruptive changes are happening in the arena of parallel computing, including the use of multiple compute element types (CPUs and GPUs), memory and interconnect types, as well as multiple concurrency models. In the face of these changes, cybersoftware development and debugging will become increasingly hard, requiring principled (formal) debugging approaches that are focused on anticipated classes of bugs. Academic research groups must play a central role in the creation of formal debugging solutions that are to become part of the nation's cyberinfrastructure. These solutions must be scalable and prove effective for debugging the parallel and distributed systems being planned for the coming decade. However, at present, there isn't a critical mass of academic groups engaged in designing such debugging solutions for HPC. In this paper, we summarize some of our own experiences in creating verification tools based on formal principles, pointing out the few instances in which we managed to impact practice. Our key observation is that our successes went hand-in-hand with collaborative tool development with practitioners. Seeking creative ways to facilitate such collaborations should therefore be high priority in our cyberinfrastructure roadmap.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.