Core utility development for hysia performance optimization

Shu Zhou

doi:10.32657/10356/82835

Abstract

To serve machine learning requests with trained models plays an increasingly important role with the advance and continuous commercialization of machine learning models. Model serving is also the dominant cost in production-scale machine learning systems such as versatile prediction pipelines, complex models, diverse machine learning frameworks and heterogeneous hardware like CPU, GPU and TPU. Serving machine learning pipelines with low latencies for better user experience is the key to the success for an e-commerce product. This becomes more challenging, due to the complex constitutions of model serving, i.e. models, frameworks and hardware accelerators, to serve interactive machine learning workloads. Accessibility, cost and latency are especially challenging to be addressed. Hysia is a multi-modal machine learning model serving framework developed by our team, to remedy such challenges introduced by the complex interactions between models and hardware. Hysia framework addresses acces- sibility, cost and latency issues by providing easy-to-use application interfaces and an intelligent controller which jointly optimizes performance to balance the trade-off between resource consumption and prediction accuracy. This thesis focuses on the design, implementation and benchmarking of the core utility for Hysia framework, i.e. to provide profile information about models and statuses about system resources in order to optimize machine learning pipelines. The core utility plays a significant role for the joint system performance optimization for Hysia. Model profiler and resource monitor form the core utility. The model profiler is designed to profile machine learning models to get their statistics like parameters, memory usage and inference latency. Our design for model profiler unifies the differences among various machine learning platforms and ensures extensibility. The resource monitor is used to monitor the system resource status like memory and GPU utilization. Our resource monitor is capable to retrieve rich system statuses. Both model profiler and resource monitor are designed in a distributed way to improve efficiency and support distributed computation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Core utility development for hysia performance optimization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Towards MLOps: A Case Study of ML Pipeline Platform
Yue Zhou ... Bo Ding
-
Yue Zhou, et. al.Yue Zhou ... Bo Ding
01 Oct 2020
01 Oct 2020

Prediction of Complete Remission and Survival in Acute Myeloid Leukemia Using Supervised Machine Learning
Jan-Niklas Eckardt ...
Blood | VOL. 138
Jan-Niklas Eckardt, et. al.Jan-Niklas Eckardt ...
05 Nov 2021
Blood | VOL. 138

A User-friendly Approach for the Diagnosis of Diabetic Retinopathy Using ChatGPT and Automated Machine Learning
S Saeed Mohammadi ... Quan Dong Nguyen
Ophthalmology Science | VOL. 4
S Saeed Mohammadi, et. al.S Saeed Mohammadi ... Quan Dong Nguyen
21 Feb 2024
Ophthalmology Science | VOL. 4

A machine learning diagnosis of the severe accident progression
Jinho Song ... Sungjoong Kim
Nuclear Engineering and Design | VOL. 416
Jinho Song, et. al.Jinho Song ... Sungjoong Kim
20 Nov 2023
Nuclear Engineering and Design | VOL. 416

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Core utility development for hysia performance optimization

Abstract

Talk to us

Similar Papers