Low-cost language models: Survey and performance evaluation on Python code generation

Jessica López Espejel,Mahaman Sanoussi Yahaya Alassan,Merieme Bouhandi,Walid Dahhane,El Hassane Ettifouri

doi:10.1016/j.engappai.2024.109490

Abstract

Large Language Models (LLMs) have become a popular choice for many Natural Language Processing (NLP) tasks due to their versatility and ability to produce high-quality results. Specifically, they are increasingly used for automatic code generation to help developers tackle repetitive coding tasks. However, LLMs’ substantial computational and memory requirements often make them inaccessible to users with limited resources. This paper focuses on very low-cost models which offer a more accessible alternative to resource-intensive LLMs. We notably: (1) propose a thorough semi-manual evaluation of their performance in generating Python code, (2) introduce a Chain-of-Thought (CoT) prompting strategy to improve model reasoning and code quality, and (3) propose a new dataset of 60 programming problems, with varied difficulty levels, designed to extend existing benchmarks like HumanEval and EvalPlus. Our findings show that some low-cost compatible models achieve competitive results compared to larger models like ChatGPT despite using significantly fewer resources. We will make our dataset and prompts publicly available to support further research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low-cost language models: Survey and performance evaluation on Python code generation

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Similar Papers

Developing healthcare language model embedding spaces
Niall Taylor ... Alejo Nevado-Holgado
Artificial Intelligence In Medicine | VOL. 158
Niall Taylor, et. al.Niall Taylor ... Alejo Nevado-Holgado
31 Oct 2024
Artificial Intelligence In Medicine | VOL. 158

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang ... Haoming Jiang
ACM Transactions on Knowledge Discovery from Data | VOL. 18
Jingfeng Yang, et. al.Jingfeng Yang ... Haoming Jiang
26 Apr 2024
ACM Transactions on Knowledge Discovery from Data | VOL. 18

Large language models for biomedicine: foundations, opportunities, challenges, and best practices.
Satya S Sahoo ... Trevor Cohen
Journal of the American Medical Informatics Association : JAMIA | VOL. 31
Satya S Sahoo, et. al.Satya S Sahoo ... Trevor Cohen
24 Apr 2024
Journal of the American Medical Informatics Association : JAMIA | VOL. 31

A Bibliometric Review of Large Language Models Research from 2017 to 2023
Lizhou Fan ... Lingyao Li
ACM Transactions on Intelligent Systems and Technology | VOL. 15
Lizhou Fan, et. al.Lizhou Fan ... Lingyao Li
21 Oct 2024
A Bibliometric Review of Large Language Models Research from 2017 to 2023
Lizhou Fan ... Lingyao Li

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low-cost language models: Survey and performance evaluation on Python code generation

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence