LM4OPT: Unveiling the potential of Large Language Models in formulating mathematical optimization problems

Tasnim Ahmed,Salimur Choudhury

doi:10.1080/03155986.2024.2388452

Abstract

In the fast-paced domain of natural language processing, converting linguistic descriptions into mathematical optimization problems is a complex task, requiring profound comprehension and processing skills from Large Language Models (LLMs). In this study, various LLMs were evaluated, including GPT-3.5, GPT-4, and smaller variants with seven billion parameters: Llama-2, Falcon, Mistral, and Zephyr. This research investigated their performance in both zero-shot and one-shot settings for this task, revealing that GPT-4 outperformed others, particularly in the one-shot scenario. A core contribution of this study is the development of LM4OPT, a progressive fine-tuning framework specifically designed for smaller LLMs. This framework leverages noisy embeddings and specialized datasets to enhance the performance of the models. Regardless of the inherent limitations of smaller models in processing complex and lengthy input contexts, our experimental results indicate a significant reduction in the performance disparity between smaller and larger models when the former are fine-tuned using LM4OPT. Our empirical study, utilizing the NL4Opt dataset, unveils that GPT-4 surpasses the baseline performance established by previous research, achieving an accuracy of 63.30 % , solely based on the problem description in natural language, and without relying on any additional named entity information. GPT-3.5 follows closely, both outperforming the progressively fine-tuned smaller models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LM4OPT: Unveiling the potential of Large Language Models in formulating mathematical optimization problems

Abstract

Talk to us

Similar Papers

More From: INFOR: Information Systems and Operational Research

Lead the way for us

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

The Breakthrough of Large Language Models Release for Medical Applications: 1-Year Timeline and Perspectives
Marco Cascella ... Elena Bignami
Journal of Medical Systems | VOL. 48
Marco Cascella, et. al.Marco Cascella ... Elena Bignami
17 Feb 2024
Journal of Medical Systems | VOL. 48

Detecting Test Smells in Python Test Code Generated by LLM: An Empirical Study with GitHub Copilot
Victor Anthony Alves ... Ivan Machado
-
Victor Anthony Alves, et. al.Victor Anthony Alves ... Ivan Machado
30 Sep 2024
30 Sep 2024

Leveraging Large Language Models to Generate Answer Set Programs
Adam Ishay ... Joohyung Lee
-
Adam Ishay, et. al.Adam Ishay ... Joohyung Lee
01 Sep 2023
01 Sep 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LM4OPT: Unveiling the potential of Large Language Models in formulating mathematical optimization problems

Abstract

Talk to us

Similar Papers

More From: INFOR: Information Systems and Operational Research