2021
DOI: 10.3389/fdata.2020.604083
|View full text |Cite
|
Sign up to set email alerts
|

GPU-Accelerated Machine Learning Inference as a Service for Computing in Neutrino Experiments

Abstract: Machine learning algorithms are becoming increasingly prevalent and performant in the reconstruction of events in accelerator-based neutrino experiments. These sophisticated algorithms can be computationally expensive. At the same time, the data volumes of such experiments are rapidly increasing. The demand to process billions of neutrino events with many machine learning algorithm inferences creates a computing challenge. We explore a computing model in which heterogeneous computing with GPU coprocessors is m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 24 publications
(16 citation statements)
references
References 29 publications
0
16
0
Order By: Relevance
“…With ML becoming increasingly common in neutrino experiments, the community is further steering its attention toward hardware acceleration of ML-based inference. GPU-accelerated ML inference as a service for computing in neutrino experiments is discussed in [229], while new developments are also targeting GPUor FPGA-based acceleration for use of machine learning algorithms such as 1D or 2D CNNs in real-time or online processing of raw LArTPC data at the data acquisition and trigger level [197][198][199].…”
Section: B Neutrino Experimentsmentioning
confidence: 99%
“…With ML becoming increasingly common in neutrino experiments, the community is further steering its attention toward hardware acceleration of ML-based inference. GPU-accelerated ML inference as a service for computing in neutrino experiments is discussed in [229], while new developments are also targeting GPUor FPGA-based acceleration for use of machine learning algorithms such as 1D or 2D CNNs in real-time or online processing of raw LArTPC data at the data acquisition and trigger level [197][198][199].…”
Section: B Neutrino Experimentsmentioning
confidence: 99%
“…Other heterogeneous computing resources specialized for inference may be even more beneficial. This speed-up may benefit the experiments' computing workflows by accessing these resources as an on-demand, scalable service [46][47][48].…”
Section: Inference Timingmentioning
confidence: 99%
“…Significant motivation is taken from the integration of these concepts for usage in high energy physics (HEP) where recent algorithmic advances and the availability of large datasets have greatly facilitated the adoption of ML. Previous work with experiments at the CERN Large Hadron Collider and the ProtoDUNE-SP experiment at the Fermi National Accelerator Laboratory have shown that the as-a-service computing model has the potential to offer impressive speed-ups, improved performance, and reduced complexity relative to traditional computing models (Krupa 2021;Rankin et al 2020;Wang et al 2020). These works have also demonstrated the ability to perform inference as-a-service with both GPUs as well as FPGAs.…”
Section: Appendixmentioning
confidence: 99%
“…In order to take full advantage of accelerators, modifications must be made to the standard model of computing, in which pipelines directly manage the accelerated resources they use for execution. An alternative model, which has gained popularity in other fields, is called "as-a-service" (Krupa 2021;Wang et al 2020). When used to specifically denote accelerated ML inference, it is referred to as Inference-as-a-Service (IaaS).…”
mentioning
confidence: 99%