The Bigger the Better?

Susanne Förster

doi:10.7146/aprja.v12i1.140444

Abstract

This article looks at a controversy over the ‘better’ architecture for conversational AI that unfolds initially along the question of the ‘right’ size of models. Current generative models such as ChatGPT and DALL-E follow the imperative of the largest possible, ever more highly scalable, training dataset. I therefore first describe the technical structure of large language models and then address the problems of these models which are known for reproducing societal biases or so-called hallucinations. As an ‘alternative’, computer scientists and AI experts call for the development of much smaller language models linked to external databases, that should minimize the issues mentioned above. As this paper will show, the presentation of this structure as ‘alternative’ adheres to a simplistic juxtaposition of different architectures that follows the imperative of a computable reality, thereby causing problems analogous to the ones it tried to circumvent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Bigger the Better?

Abstract

Talk to us

Similar Papers

More From: A Peer-Reviewed Journal About

Lead the way for us

Journal: A Peer-Reviewed Journal About	Publication Date: Sep 7, 2023
License type: CC BY-NC-SA 4.0

Similar Papers

Improving the effectiveness of language modeling approaches to information retrieval
Yuanhua Lv
ACM SIGIR Forum | VOL. 46
Yuanhua LvYuanhua Lv
21 Dec 2012
ACM SIGIR Forum | VOL. 46

A Novel Procedure to Represent Lightning Return Strokes—Current Dissipation Return Stroke Models
Vernon Cooray
IEEE Transactions on Electromagnetic Compatibility | VOL. 51
Vernon CoorayVernon Cooray
01 Aug 2009
IEEE Transactions on Electromagnetic Compatibility | VOL. 51

On the Relationship Between the Signature of Close Electric Field and the Equivalent Corona Current in Lightning Return Stroke Models
V. Cooray ... R. Montano
IEEE Transactions on Electromagnetic Compatibility | VOL. 50
V. Cooray, et. al.V. Cooray ... R. Montano
01 Nov 2008
IEEE Transactions on Electromagnetic Compatibility | VOL. 50

Generating synthetic mixed-type longitudinal electronic health records for artificial intelligent applications
Jin Li ... Benjamin J Cairns
npj Digital Medicine | VOL. 6
Jin Li, et. al.Jin Li ... Benjamin J Cairns
27 May 2023
npj Digital Medicine | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Bigger the Better?

Abstract

Talk to us

Similar Papers

More From: A Peer-Reviewed Journal About