Evaluating the effectiveness of prompt engineering for knowledge graph question answering

Catherine Kosten,Farhad Nooralahzadeh,Kurt Stockinger

doi:10.3389/frai.2024.1454258

Catherine Kosten, Farhad Nooralahzadeh + Show 1 more

Open Access

https://doi.org/10.3389/frai.2024.1454258

Copy DOI

Export

Save

Cite

Journal: Frontiers in Artificial Intelligence	Publication Date: Jan 13, 2025
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

Many different methods for prompting large language models have been developed since the emergence of OpenAI's ChatGPT in November 2022. In this work, we evaluate six different few-shot prompting methods. The first set of experiments evaluates three frameworks that focus on the quantity or type of shots in a prompt: a baseline method with a simple prompt and a small number of shots, random few-shot prompting with 10, 20, and 30 shots, and similarity-based few-shot prompting. The second set of experiments target optimizing the prompt or enhancing shots through Large Language Model (LLM)-generated explanations, using three prompting frameworks: Explain then Translate, Question Decomposition Meaning Representation, and Optimization by Prompting. We evaluate these six prompting methods on the newly created Spider4SPARQL benchmark, as it is the most complex SPARQL-based Knowledge Graph Question Answering (KGQA) benchmark to date. Across the various prompting frameworks used, the commercial model is unable to achieve a score over 51%, indicating that KGQA, especially for complex queries, with multiple hops, set operations and filters remains a challenging task for LLMs. Our experiments find that the most successful prompting framework for KGQA is a simple prompt combined with an ontology and five random shots.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Evaluating the effectiveness of prompt engineering for knowledge graph question answering

Abstract

Published Version

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence

Lead the way for us

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Generative AI enhanced with NCCN clinical practice guidelines for clinical decision support: A case study on bone cancer.
Yanshan Wang ... David Oniani
Journal of Clinical Oncology | VOL. 42
Yanshan Wang, et. al.Yanshan Wang ... David Oniani
01 Jun 2024
Journal of Clinical Oncology | VOL. 42

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection
Yuxi Li ... Gelei Deng
Proceedings of the ACM on Software Engineering | VOL. 1
Yuxi Li, et. al.Yuxi Li ... Gelei Deng
12 Jul 2024
Proceedings of the ACM on Software Engineering | VOL. 1

Can Large Language Models Assess Serendipity in Recommender Systems?
Yu Tokutake ... Kazushi Okamoto
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 28
Yu Tokutake, et. al.Yu Tokutake ... Kazushi Okamoto
20 Nov 2024
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Evaluating the effectiveness of prompt engineering for knowledge graph question answering

Abstract

Published Version

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence