Proceedings of the 2018 4th Workshop on Programming Models for SIMD/Vector Processing 2018
DOI: 10.1145/3178433.3178434
|View full text |Cite
|
Sign up to set email alerts
|

Small SIMD Matrices for CERN High Throughput Computing

Abstract: System tracking is an old problem and has been heavily optimized throughout the past. However, in High Energy Physics, many small systems are tracked in real-time using Kalman filtering and no implementation satisfying those constraints currently exists. In this paper, we present a code generator used to speed up Cholesky Factorization and Kalman Filter for small matrices. The generator is easy to use and produces portable and heavily optimized code. We focus on current SIMD architectures (SSE, AVX, AVX512, Ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…The key elements in the process included having flexible data structures that can be grown and shrunk at run time using dynamic memory allocation, but also the possibility of traversing decay trees for the analysis of multi-staged particle decays. In order to reach a high computational speed, the model needed to allow easy vectorisation [10][11][12][13]. At the same time, the new model had to be compatible with the old event model also during the development phase to not break the workflow of the full reconstruction sequence and for quality assurance.…”
Section: The Lhcb Event Modelmentioning
confidence: 99%
“…The key elements in the process included having flexible data structures that can be grown and shrunk at run time using dynamic memory allocation, but also the possibility of traversing decay trees for the analysis of multi-staged particle decays. In order to reach a high computational speed, the model needed to allow easy vectorisation [10][11][12][13]. At the same time, the new model had to be compatible with the old event model also during the development phase to not break the workflow of the full reconstruction sequence and for quality assurance.…”
Section: The Lhcb Event Modelmentioning
confidence: 99%
“…DSLs can either generate high-level code in a more general language or directly go to an IR level such as LLVM-IR. For batched Cholesky factorization and Kalman filters, Lemaitre et al [34] propose a template system. Rodrigues et al [44] specify a small DSL for static tensor multiplications-even parallelizing error correction in 5G base stations [14] warrants a DSL.…”
Section: Related Workmentioning
confidence: 99%