ZAYA1: AI model using AMD GPUs for training hits milestone
📖 Article Preview
Zyphra, AMD, and IBM have successfully trained ZAYA1, a large-scale Mixture-of-Experts foundation model built entirely on AMD's Instinct MI300X GPUs, marking a significant milestone in AI infrastructure independence from NVIDIA. This achievement demonstrates that enterprise-grade AI training can be effectively supported by AMD's hardware and networking solutions, utilizing Pensando networking and ROCm software within IBM Cloud's infrastructure, and achieving performance comparable or superior to established models in reasoning, mathematics, and coding tasks. The deployment of AMD's MI300X GPUs, each equipped with 192GB of high-band
Read the Complete Article
Get the full story with in-depth analysis, expert insights, and comprehensive coverage from the original source.
Stay Informed
Get the latest AI insights and breakthroughs delivered to your inbox weekly.
We respect your privacy. Unsubscribe at any time. Privacy Policy