A generalized decoding method for neural text generation

Ning Gong,Nianmin Yao

doi:10.1016/j.csl.2023.101503

Abstract

In natural language generation, most decoding methods are not intrinsic because their performance depends on extrinsically configured hyperparameters. It means that: first, the generation system is dynamic under different conditions while the decoding system is always static under any conditions once its hyperparameters are extrinsically fixed; second, it is hard to select a constant decoding hyperparameter that is omnipotent for all conditions. Although there are decoding methods that are hyperparameter-free, such as greedy and plain sampling, it has been well studied that these methods generally perform worse than methods with hyperparameters, such as beam search, top-k and top-p. Decoding with hyperparameters can get infinite strategies from different fixed configurations, while hyperparameter-free methods have only one strategy. Therefore, the comparison between them is actually unfair, which is a one-vs-infinite battle. So how to deal with the decoding hyperparameters properly and intrinsically? Is it true that hyperparameter-free methods are always inferior to methods with inexhaustible hyperparameter configurations? Is it possible to design a generalized framework, by which these decoding methods can be naturally connected, uniformly described, and mutually inspired? In these paper, we try to find answers to these questions.To this end, we first propose a generalized decoding framework, i.e., GSD, that can be used to uniformly describe and connect existing popular decoding methods. As far as we know, this is the first work trying to build a theoretical framework to associate these decoding methods in formal mathematical theorems. Based on the framework, we then propose Intrinsic Decoding, a novel implementation of GSD with distinctive design from existing decoding algorithms: it is intrinsic and dynamic. Intrinsic Decoding changes the aforementioned comparison from one-vs-infinite to dynamic-vs-infinite. Just like greedy and sampling, Intrinsic Decoding has no hyperparameter, while effecting better than both greedy and sampling, even achieving comparable performance to the methods equipped with inexhaustible hyperparameter configurations, such as beam search, top-k and top-p.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A generalized decoding method for neural text generation

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Mar 21, 2023
Citations: 1

Similar Papers

When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)
Liang Huang ... Mingbo Ma
-
Liang Huang, et. al.Liang Huang ... Mingbo Ma
01 Jan 2017
01 Jan 2017

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation
Subrota Kumar Mondal ... H M Dipu Kabir
Applied Sciences | VOL. 14
Subrota Kumar Mondal, et. al.Subrota Kumar Mondal ... H M Dipu Kabir
05 Aug 2024
Applied Sciences | VOL. 14

If beam search is the answer, what was the question?
Clara Meister ... Tim Vieira
-
Clara Meister, et. al.Clara Meister ... Tim Vieira
01 Jan 2020
01 Jan 2020

SONGS CONTINUATION GENERATION TECHNOLOGY BASED ON TEST GENERATION STRATEGIES, TEXTMINING AND LANGUAGE MODEL T5
O Mediakov ... V Vysotska
Radio Electronics, Computer Science, Control | VOL. -
O Mediakov, et. al.O Mediakov ... V Vysotska
04 Jan 2024
Radio Electronics, Computer Science, Control | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A generalized decoding method for neural text generation

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language