What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories

Lina Gong,Haoxiang Zhang,Jingxuan Zhang,Zhiqiu Huang,Mingqiang Wei

doi:10.1145/3569934

Abstract

There is a trend of researchers and practitioners to directly apply pre-trained models to solve their specific tasks. For example, researchers in software engineering (SE) have successfully exploited the pre-trained language models to automatically generate the source code and comments. However, there are domain gaps in different benchmark datasets. These data-driven (or machine learning based) models trained on one benchmark dataset may not operate smoothly on other benchmarks. Thus, the reuse of pre-trained models introduces large costs and additional problems of checking whether arbitrary pre-trained models are suitable for the task-specific reuse or not. To our knowledge, software engineers can leverage code contracts to maximize the reuse of existing software components or software services. Similar to the software reuse in the SE field, reuse SE could be extended to the area of pre-trained model reuse. Therefore, according to the model card’s and FactSheet’s guidance for suppliers of pre-trained models on what information they should be published, we propose model contracts including the pre- and post-conditions of pre-trained models to enable better model reuse. Furthermore, many non-trivial yet challenging issues have not been fully investigated, although many pre-trained models are readily available on the model repositories. Based on our model contract, we conduct an exploratory study of 1908 pre-trained models on six mainstream model repositories (i.e., the TensorFlow Hub, PyTorch Hub, Model Zoo, Wolfram Neural Net Repository, Nvidia, and Hugging Face) to investigate the gap between necessary pre- and post-condition information and actual specifications. Our results clearly show that (1) the model repositories tend to provide confusing information of the pre-trained models, especially the information about the task’s type, model, training set, and (2) the model repositories cannot provide all of our proposed pre/post-condition information, especially the intended use, limitation, performance, and quantitative analysis. On the basis of our new findings, we suggest that (1) the developers of model repositories shall provide some necessary options (e.g., the training dataset, model algorithm, and performance measures) for each of pre/post-conditions of pre-trained models in each task type, (2) future researchers and practitioners provide more efficient metrics to recommend suitable pre-trained model, and (3) the suppliers of pre-trained models should report their pre-trained models in strict accordance with our proposed pre/post-condition and report their models according to the characteristics of each condition that has been reported in the model repositories.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Software Engineering and Methodology	Publication Date: May 3, 2023
Citations: 6	License type: pd

R Discovery Prime

R Discovery Prime

What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology

Lead the way for us

Similar Papers

Introducing Various Semantic Models for Amharic: Experimentation and Evaluation with Multiple Tasks and Datasets
Seid Muhie Yimam ... Gopalakrishnan Venkatesh
Future Internet | VOL. 13
Seid Muhie Yimam, et. al.Seid Muhie Yimam ... Gopalakrishnan Venkatesh
27 Oct 2021
Future Internet | VOL. 13

Automatic depression severity assessment with deep learning using parameter-efficient tuning.
Clinton Lau ... Wai-Yip Chan
Frontiers in Psychiatry | VOL. 14
Clinton Lau, et. al.Clinton Lau ... Wai-Yip Chan
15 Jun 2023
Frontiers in Psychiatry | VOL. 14

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain
Yusuf Arslan ... Lisa Veiber
-
Yusuf Arslan, et. al.Yusuf Arslan ... Lisa Veiber
19 Apr 2021
19 Apr 2021

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
...
-
, et. al. ...
23 Oct 2021
23 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Software Engineering and Methodology