A Tool Suite for Simulation Based Analysis of Memory Access Behavior

Weidendorfer, Josef; Kowarschik, Markus; Trinitis, Carsten

doi:10.1007/978-3-540-24688-6_58

Cited by 101 publications

(63 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…3. The callgraph is largely similar to the callgraphs given by other tools, such as callgrind [11], with the exception that the nodes are not only functions but also OpenMP constructs and user-defined regions, and the (runtime) nesting of those constructs is shown in the callgraph view. The callgraph that ompP records is the union of the callgraph of each thread.…”

Section: ] -----------------------------------------------mentioning

confidence: 92%

Visualizing the Program Execution Control Flow of OpenMP Applications

Fürlinger

Moore

2008

OpenMP in a New Era of Parallelism

View full text Add to dashboard Cite

Abstract. One important aspect of understanding the behavior of an application with respect to its performance, overhead, and scalability characteristics is knowledge of its control flow. In comparison to sequential applications the situation is more complicated in multithreaded parallel programs because each thread defines its own independent control flow. On the other hand, for the most common usage models of OpenMP the threads operate in a largely uniform way, synchronizing frequently at sequence points and diverging only to operate on different data items in worksharing constructs. This paper presents an approach to capture and visualize the control flow of OpenMP applications in a compact way that does not require a full trace of program execution events but is instead based on a straightforward extension to the data collected by an existing profiling tool.

show abstract

Section: ] -----------------------------------------------mentioning

confidence: 92%

Visualizing the Program Execution Control Flow of OpenMP Applications

Fürlinger

Moore

2008

OpenMP in a New Era of Parallelism

View full text Add to dashboard Cite

show abstract

“…Cache simulations have been performed using Callgrind, [63] and for a density of ρσ 3 = 0.5, approximately half of the cache misses result from accesses to the contents of the neighbor lists.The remaining cache misses are associated with accesses to particle and event data.…”

Section: Benchmarkingmentioning

confidence: 99%

DynamO: a free ${\cal O}$(N) general event‐driven molecular dynamics simulator

Bannerman

Sargant

Lue³

2011

J Comput Chem

161

137

View full text Add to dashboard Cite

Molecular dynamics algorithms for systems of particles interacting through discrete or "hard" potentials are fundamentally different to the methods for continuous or "soft" potential systems. Although many software packages have been developed for continuous potential systems, software for discrete potential systems based on event-driven algorithms are relatively scarce and specialized. We present DynamO, a general event-driven simulation package, which displays the optimal O(N) asymptotic scaling of the computational cost with the number of particles N, rather than the O(N log N) scaling found in most standard algorithms. DynamO provides reference implementations of the best available event-driven algorithms. These techniques allow the rapid simulation of both complex and large (>10 6 particles) systems for long times.The performance of the program is benchmarked for elastic hard sphere systems, homogeneous cooling and sheared inelastic hard spheres, and equilibrium LennardJones fluids. This software and its documentation are distributed under the GNU General Public license and can be freely downloaded from http://marcusbannerman.co.uk/dynamo.

show abstract

“…The analysis has been made using a variety of tools including igProf [2] , callgrind [3] [4] and AMD CodeAnalyst. For example, we improved by a factor 3 to 10, depending on the length and complexity of the class name, the performance of the TTree::SetBranchAddress and TTree::SetAddress routines.…”

Section: Performance Enhancementmentioning

confidence: 99%