AMD Ryzen™ AI Software 1.4: Features for Next Gen AI PCs

Apr 18, 2025

Introduction

We are excited to announce the AMD Ryzen AI 1.4 software release for AMD AI PCs. Ryzen AI 1.4 software highlights new capabilities including additional hardware, model support, and new ease of use developer tools. Ryzen AI 1.4 provides support for state-of-the-art Large Language Models (LLMs), Natural Language Processing (NLP) models, and Convolutional Neural Networks (CNNs). For developers and end users, it enables seamless compilation and deployment of models in INT8 or BF16 configurations, providing flexibility to build applications without altering their environment. Additionally, Ryzen AI 1.4 introduces new developer tools like Digest AI, Lemonade SDK, GAIA, and TurnkeyML, making it easier to work with SOTA models and accelerate AI application deployment. This update enhances model performance, enables new experiences, and offers a simplified approach to deploying AI on AMD PCs, ensuring developers have the tools needed to innovate in the rapidly evolving field of AI.

Unified Support for LLMs, NLP and CNN Models in a Single Release Package

Ryzen AI 1.4 offers support for LLM (Deepseek, Gemma, QWEN, etc.), NLP (BERT, Embedding Models families) and CNN models. It offers a unified installer allowing end users to seamlessly compile their CNN and NLP models in either INT8 or BF16 configurations, as well as deploying ready to run LLMs in hybrid and NPU only execution modes. This streamlined approach simplifies the development process, offering ease of use and improving developer efficiency.

Quantization and Compilation for BF16 CNN and NLP Models on Windows

Ryzen AI provides the tools and recipes for quantization and compilation with BF16 data type on Windows delivering improved performance without accuracy loss. BF16 quantization uses 16 bits with a floating-point format that preserves much of the dynamic range of full 32-bit floating-point (FP32) computations, making it more suitable for models that rely on small or very large numbers, such as deep neural networks that require high dynamic range for weights and activations.

Public Release of LLM Hybrid ONNX Runtime Gen AI (OGA) Flow

The 1.4 release enables developers to run LLMs in hybrid or NPU-only execution mode. In hybrid execution mode, developers can deploy LLMs on NPU as well as iGPU. Hybrid execution mode optimally partitions the model such that different operations are scheduled on NPU and iGPU. This minimizes time-to-first token (TTFT) in the prefill-phase and maximizes token generation (tokens per second, TPS) in the decode phase.

Ryzen AI 1.4 also provides support for an OGA-based flow where the iGPU is not solicited and where the compute-intensive operations are exclusively offloaded to the NPU. Both methods offer a comprehensive and simplified approach to building a Ryzen AI model from a trained foundational or finetuned model, giving end users a powerful and accessible way to quickly deploy their model for real-world AI use cases on AI PCs. List of models which can be deployed using NPU1 and Hybrid Mode2.

New Ease of Use Open-Source Developer Tools: Digest AI, Lemonade, GAIA and TurnkeyML

AMD is excited to introduce various open-source tools to help developers run LLMs on AMD AI PCs.

Digest AI is a powerful and user-friendly tool that can significantly enhance your machine learning development. By providing detailed model analysis, multi-model support, and integration with the Hugging Face hub, Digest allows users to analyze individual models to extract parameters, FLOPs, meta parameters, FLOPs, meta data, IO tensors and more. It also allows you to compare high level statistics of multiple model's side by side and lets you save all extracted statistics into a comprehensive report for easy sharing and collaboration

Lemonade SDK provides a set of high-level APIs and tools that make it easy to experiment with LLMs using the hybrid execution mode, which uses the integrated GPU and the NPU as well as in NPU-only mode. This allows you to try out different models, prompt them, and benchmark them to measure inference speed, time to first token, and tokens per second. You can also measure task performance with accuracy tests like MMLU. It is a multi-vendor, multi-backend SDK that supports Runtime Gen AI, Llama.cpp and Hugging Face Transformers, allowing you to compare across different backends. Additionally, it provides support for a CPU backend that will work on other AMD laptops

GAIA is an open-source solution designed for the quick setup and execution of generative AI applications on local PC hardware. It supports basic chat, RAG enhanced applications and specialized agents as well as used the AMD NPU and iGPU for hybrid acceleration. GAIA also provides both a command-line interface and a graphical user interface option for easy interaction with models and agents or allows you to easily build and integrate your own agents and use-cases.

AMD, in collaboration with the ONNX community, is excited to introduce TurnkeyML, an open-source toolchain designed to improve the way we handle AI models for inferencing. TurnkeyML is a comprehensive framework that streamlines the process of ingesting any open-source PyTorch model, optimizing them, and executing them across a diverse set of hardware targets.

Summary

We are excited to announce continuous improvements to Ryzen AI software and encourage developers to provide feedback on how these updates enhance their experience. For a comprehensive overview of the new features and improvements in the 1.4 software release, check out the official release notes.

Stay at the forefront of AI innovation by signing up for the latest Ryzen AI software updates to ensure you have access to the tools and resources that will help you push the boundaries of what’s possible on AI PCs.

Additional resources

New Ryzen AI Video Tutorials https://www.youtube.com/playlist?list=PLYw1WVX5aNHABNAfottruTY8oX2eFlzmz&utm_source=blog&utm_medium=referral&utm_campaign=rai-1.4&utm_content=release-blog
AMD Ryzen AI Developer Hub: https://www.amd.com/en/developer/resources/ryzen-ai-software.html
Ryzen AI 1.4 Release Notes: https://ryzenai.docs.amd.com/en/latest/
NPU Model collection: https://huggingface.co/collections/amd/ryzenai-14-llm-npu-models-67da3494ec327bd3aa3c83d7
Hybrid Model collection: https://huggingface.co/collections/amd/ryzenai-14-llm-hybrid-models-67da31231bba0f733750a99c

Download Ryzen AI Software

Article By

Anshul Gupta

Data Center

Business Systems

Personal & Gaming

Embedded

Resources

GPU Accelerators

Adaptive Accelerators

DPU Accelerators

Ethernet Adapters

Workstations

Desktops

Laptops

Resources

Adaptive SoCs & FPGAs

System-on-Modules (SOMs)

Technologies

Resources

Evaluation Boards & Kits

Processor Tools

Graphics Tools & Apps

Adaptive SoC & FPGA Tools

Intellectual Property & Apps

GPU Accelerator Tools & Apps

Overview

For Data Center & Cloud

For Edge & Endpoints

For Developers

Industries

Industries

Industries

Industries

Industries

Workloads

Gaming

Systems

Technologies

Resources

EPYC Processors

Radeon Graphics & AMD Chipsets

Adaptive SoCs & FPGAs

Alveo Accelerators & Kria SOMs

Ryzen Processors

Ethernet Adapters

Overview

Processors

Accelerators

Adaptive SoCs, FPGAs, & SOMs

Graphics

Overview

Resources by Market Segment

Resources by Product

Resources by Type

About Our Partners

AMD Global Support

Processors & Graphics

Accelerators

Adaptive SoCs & FPGAs

Gaming & Personal Computing

Adaptive & Embedded Computing

Get AMD Fan Gear

Shop Our Retail Partners

AMD Ryzen™ AI Software 1.4: Features for Next Gen AI PCs

Introduction

Unified Support for LLMs, NLP and CNN Models in a Single Release Package

Quantization and Compilation for BF16 CNN and NLP Models on Windows

Public Release of LLM Hybrid ONNX Runtime Gen AI (OGA) Flow

New Ease of Use Open-Source Developer Tools: Digest AI, Lemonade, GAIA and TurnkeyML

Summary

Additional resources

Article By

Company

News & Events

Community

Partners

Investors