The Issues of Large Language Models indicated by Addition Experiments on GPT4

MOTOHIRO Okaya

doi:10.11517/jsaisigtwo.2023.agi-024_02

Bibliographic Information

Other Title

GPT-4による足し算実験から示唆されるLarge Language Modelsの課題

Abstract

<p>In this study, I evaluate the proficiency of GPT-4, by OpenAI, particularly focusing on its handling of simple high-digit addition tasks. While GPT-4 exhibits impressive capabilities in various tasks, it showed inconsistencies when dealing with ten-digit addition problems. My examination showed that while GPT-4 correctly solved all three-digit additions, it was only 60% accurate for ten-digit additions. Adding prompts to encourage a step-by-step addition process did not improve this accuracy. I suggest that this limitation may be due to the inability of large language models (LLMs) to extract commonalities from different concepts, as seen in the process of addition. This difference between human cognition and LLMs may be crucial for the further development of these models.</p>

Journal

JSAI Technical Report, Type 2 SIG

JSAI Technical Report, Type 2 SIG 2023 (AGI-024), 02-, 2023-08-08

The Japanese Society for Artificial Intelligence

Details 詳細情報について

CRID: 1390859989726701312

DOI: 10.11517/jsaisigtwo.2023.agi-024_02

ISSN: 24365556

Text Lang: ja

Data Source

JaLC

Abstract License Flag: Allowed

Export