Scaling performance of interior-point method on large-scale chip multiprocessor system

Smelyanskiy, Mikhail; Lee, Victor W.; Kim, Dae Hyun; Nguyen, Anthony D.; Dubey, Pradeep

doi:10.1145/1362622.1362652

Cited by 7 publications

(3 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Eleyat and Natvig [16], propose an optimized parallel solver leveraging the special capabilities of a Cell processor. Smelyanskiy et al [17] present an optimization of the interior-point methods for large-scale chip multiprocessor systems. Budiu et al [18] describe an alternative implementation of a distributed branch-and-bound solver.…”

Section: Related Workmentioning

confidence: 99%

Enhancing File Transfer Scheduling and Server Utilization in Data Distribution Infrastructures

Higuero

Tirado

Isaila

et al. 2012

2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems

View full text Add to dashboard Cite

This paper presents a methodology for efficiently solving the file transfer scheduling problem in a distributed environment. Our solution is based on the relaxation of an objectivebased time-indexed formulation of a linear programming problem. The main contributions of this paper are the following. First, we introduce a novel approach to the relaxation of the time-indexed formulation of the transfer scheduling problem in multi-server and multi-user environments. Our solution consists of reducing the complexity of the optimization by transforming it into an approximation problem, whose proximity to the optimal solution can be controlled depending on practical and computational needs. Second, we present a distributed deployment of our methodology, which leverages the inherent parallelism of the divide-and-conquer approach in order to speed-up the solving process. Third, we demonstrate that our methodology is able to considerably reduce the schedule length and idle time in a computationally tractable way.

show abstract

Section: Related Workmentioning

confidence: 99%

Enhancing File Transfer Scheduling and Server Utilization in Data Distribution Infrastructures

Higuero

Tirado

Isaila

et al. 2012

2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems

View full text Add to dashboard Cite

show abstract

“…On the other hand, Algorithm 3.1 typically spends only 10% of the time in the PGS phase and 90% in the subspace minimization. Hence it effectively offloads the parallelization task to a direct linear solver that is known to parallelize well [28]. Specifically, our adaptation of pardiso, the Cholesky solver used for the experiments in this paper, yields more than 65% utilization on a 64-core chip multiprocessor simulator (see also Schenk [25] for a scalability study for a moderate number of processors on a coarsely coupled shared memory system).…”

Section: Final Remarksmentioning

confidence: 99%

An algorithm for the fast solution of symmetric linear complementarity problems

2008

Self Cite

View full text Add to dashboard Cite

This paper studies algorithms for the solution of mixed symmetric linear complementarity problems. The goal is to compute fast and approximate solutions of medium to large sized problems, such as those arising in computer game simulations and American options pricing. The paper proposes an improvement of a method described by Kocvara and Zowe [19] that combines projected Gauss-Seidel iterations with subspace minimization steps. The proposed algorithm employs a recursive subspace minimization designed to handle severely ill-conditioned problems. Numerical tests indicate that the approach is more efficient than interior-point and gradient projection methods on some physical simulation problems that arise in computer game scenarios.

show abstract

“…Smelyanskiy et al (2007) used supernode-based blocking without use of amalgamation. Instead, they show, using a cycle accurate simulator, that the hardware support for low overhead task queues proposed by Kumar et al (2007) can be used to accelerate the scheduling of small tasks.…”

Section: Introductionmentioning

confidence: 99%

IPM based sparse LP solver on a heterogeneous processor

Eleyat

Natvig

2012

Comput Manag Sci

View full text Add to dashboard Cite

We present the parallelization of a linear programming solver using a primal-dual interior point method on one of the heterogeneous processors, namely the Cell BE processor. Focus is given to Cholesky factorization as it is the most computationally expensive kernel in interior point methods. To make it easier to develop and port to other heterogeneous systems, we propose a two-phase implementation procedure where we first develop a shared-memory multithreaded application that executes only on the main processor, and then offload the compute-intensive tasks to execute on the synergistic processors (Cell accelerator cores). We used parent-child supernode amalgamation to increase sizes of the blocks, but we noticed that the existence of many small blocks cause significant performance degradation. To reduce the overhead of small blocks, we extend the block fan-out algorithm such that small blocks are aggregated into large blocks without adding extra zeros. We also use another type of amalgamation that can merge any two consecutive supernodes and use it to avoid having very small blocks in a composed block. The suggested block aggregation method is able to speedup the whole LP solver of up to 2.5 when compared to using parent-child supernode amalgamation alone.

show abstract

Scaling performance of interior-point method on large-scale chip multiprocessor system

Cited by 7 publications

References 21 publications

Enhancing File Transfer Scheduling and Server Utilization in Data Distribution Infrastructures

Enhancing File Transfer Scheduling and Server Utilization in Data Distribution Infrastructures

An algorithm for the fast solution of symmetric linear complementarity problems

IPM based sparse LP solver on a heterogeneous processor

Contact Info

Product

Resources

About