MLbench

Yu Liu,Hantian Zhang,Ce Zhang,Luyuan Zeng,Wentao Wu

doi:10.14778/3231751.3231770

Abstract

Modern machine learning services and systems are complicated data systems --- the process of designing such systems is an art of compromising between functionality , performance , and quality . Providing different levels of system supports for different functionalities, such as automatic feature engineering, model selection and ensemble, and hyperparameter tuning, could improve the quality, but also introduce additional cost and system complexity. In this paper, we try to facilitate the process of asking the following type of questions: How much will the users lose if we remove the support of functionality x from a machine learning service? Answering this type of questions using existing datasets, such as the UCI datasets, is challenging. The main contribution of this work is a novel dataset, MLBench, harvested from Kaggle competitions. Unlike existing datasets, MLBench contains not only the raw features for a machine learning task, but also those used by the winning teams of Kaggle competitions. The winning features serve as a baseline of best human effort that enables multiple ways to measure the quality of machine learning services that cannot be supported by existing datasets, such as relative ranking on Kaggle and relative accuracy compared with best-effort systems. We then conduct an empirical study using MLBench to understand example machine learning services from Amazon and Microsoft Azure, and showcase how MLBench enables a comparative study revealing the strength and weakness of these existing machine learning services quantitatively and systematically. The full version of this paper can be found at arxiv.org/abs/1707.09562

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MLbench

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Journal: Proceedings of the VLDB Endowment	Publication Date: Jun 1, 2018
Citations: 21

Similar Papers

Construction of Virtual Metrology Cloud Platform with Machine Learning Tools for Providing Factory-Wide Manufacturing Service
Tang-Hsuan O ... Min-Hsiung Hung
-
Tang-Hsuan O, et. al.Tang-Hsuan O ... Min-Hsiung Hung
01 Jan 2020
01 Jan 2020

QoA4ML - A Framework for Supporting Contracts in Machine Learning Services
Hong-Linh Truong ... Tri-Minh Nguyen
-
Hong-Linh Truong, et. al.Hong-Linh Truong ... Tri-Minh Nguyen
01 Sep 2021
01 Sep 2021

Automated machine learning and neural architecture optimization
Pravin P Adivarekar1 ... Ravi Rastogi
The Scientific Temper | VOL. 14
Pravin P Adivarekar1, et. al.Pravin P Adivarekar1 ... Ravi Rastogi
27 Dec 2023
The Scientific Temper | VOL. 14

Tools and frameworks for machine learning and deep learning: A review
Nitin Liladhar Rane ... Jayesh Rane
-
Nitin Liladhar Rane, et. al.Nitin Liladhar Rane ... Jayesh Rane
13 Oct 2024
13 Oct 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MLbench

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment