The Origin of LLMs: ULMFit or GPT-1?
2025-03-30
This article delves into the mystery of the origin of Large Language Models (LLMs). The author revisits the development from ULMFit to GPT-1, providing a detailed analysis of the definition of an LLM. It argues that ULMFit might be the first LLM, fulfilling key criteria such as self-supervised training, next-word prediction, and easy adaptability to various text-based tasks. While GPT-1 is widely known for its Transformer architecture, ULMFit's contribution cannot be ignored. The article also explores the future trends of LLMs, predicting that the term 'LLM' will continue to be used, evolving with the model's capabilities and potentially encompassing multimodal processing.
Read more
AI