C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA

Sitong An,Federico Sossai,Lorenzo Moneta,Ahmat Hamdan,Sanjiban Sengupta,Aaradhya Saxena

doi:10.1088/1742-6596/2438/1/012013

C++ Code Generation for Fast Inference of Deep Learning Models in ROOT/TMVA

Sitong An, Federico Sossai + Show 4 more

Open Access

https://doi.org/10.1088/1742-6596/2438/1/012013

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Feb 1, 2023
Citations: 1	License type: cc-by

Affiliation: European Organization for Nuclear Research, Carnegie Mellon University, University of Padua, Institut Sous-Régional de Statistique et d'Economie Appliquée, Indian Institute of Technology Bhubaneswar, Indian Institute of Technology Roorkee

#Recurrent Layers #Set Of Benchmarks + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We report the latest development in ROOT/TMVA, a new tool that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. An introduction to SOFIE (System for Optimized Fast Inference code Emit) is presented, with examples of interface and generated code. We discuss the latest expanded support of a variety of neural network operators, including convolutional and recurrent layers, as well as the integration with RDataFrame. We demonstrate the latest performance of this framework with a set of benchmarks.

Full Text