AMD Unveils Instella: A Family of Fully Open 3B Parameter Language Models
2025-03-24
AMD has announced Instella, a family of fully open, state-of-the-art 3-billion-parameter language models (LLMs) trained from scratch on AMD Instinct™ MI300X GPUs. Instella outperforms existing fully open models of similar size and achieves competitive results against leading open-weight models like Llama-3.2-3B. AMD is open-sourcing all model artifacts, including weights, training configurations, datasets, and code, to foster collaboration and innovation within the AI community. The models leverage efficient training techniques and a multi-stage training pipeline.
AI