A judiciously chosen symmetric permutation can significantly reduce the amount of storage and computation for the Cholesky factorization of sparse matrices. On distributed memory machines, the issue of mapping data and computation on processors is also important. Previous research on ordering for parallelism has focussed on idealized measures like ezecution time on an unbounded number of processors, with zero communication costs. In this paper, we propose an ordering and mapping algorithm that attempts to minimize communication and performs load-balancing of work among the processors. Performance results on an Intel iPSC/860 hypercube are presented to demonstrate its effectiveness.
Abstrslct -The concepi of supernodes has been widely used in the design of algorithms or the solurion of sparse linear systems of equations. his paper discusses the use of supernodes in (he design of algorithms for s wse Cholesky facroriralion on distriburod-memory mu&-cessors. A new algorithm that is communicarion eflcient, hus good load bulunce, and benefrls signifrcanrly from supernodes is presented. A taronomy of dislribured s arse Cholesb faclorizalbn algorithm is proposed. &rformance results on an Inre1 ffSCiS60 rndtiproerrsor are reporled.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.