AMD Instinct™ MI350 Series GPUs

What's New

Preview Our Newest AMD Instinct MI350 Series GPU Offering

AMD Instinct™ MI350P PCIe® card enables the AI performance you need to deploy and scale generative and agentic AI within your existing infrastructure.

Leadership AI & HPC Acceleration

AMD Instinct™ MI350 Series GPUs set a new standard for Generative AI and high performance computing (HPC) in data centers. Built on the new cutting-edge 4th Gen AMD CDNA™ architecture, these GPUs deliver exceptional efficiency and performance for training massive AI models, high-speed inference, and complex HPC workloads like scientific simulations, data processing, and computational modeling.

Under the Hood

The Ultimate AI and HPC Performance

Built on the cutting-edge 4th Gen AMD CDNA™ architecture, AMD Instinct™ MI350 Series GPUs feature powerful and energy-efficient cores, maximizing performance per watt to drive the next era of AI and HPC innovation.

Learn More

Benefits

Platforms Built for Any Enterprise Scale AI
Breakthrough AI Acceleration With Huge Memory
Advanced Security for AI & HPC
Seamless Deployment & AI Optimization
Trusted by AI Leaders

Platforms Built for Any Enterprise Scale AI

AMD Instinct MI350P PCIe cards offer simple deployment for mainstream enterprise, while AMD Instinct MI350X and MI355X platforms deliver efficient performance for large scale training and inference enterprise workloads, all supported by a unified AMD enterprise AI software stack.

Breakthrough AI Acceleration With Huge Memory

The AMD Instinct™ MI350 Series GPUs redefine AI acceleration with next-gen MXFP6 and MXFP4 datatype support, optimizing efficiency, bandwidth, and energy use for lightning-fast AI inference and training.

Designed to fuel performance of the most demanding AI models, Instinct MI350 GPUs boast a massive 288GB of HBM3E memory and 8TB/s bandwidth, delivering a huge leap in performance over previous generations.¹

Advanced Security for AI & HPC

AMD Instinct™ MI350 Series GPUs help ensure trusted firmware, verify hardware integrity, enable secure multi-tenant GPU sharing, and encrypt GPU communication—helping enhance reliability, scalability, and data security for cloud AI and mission-critical workloads.

Seamless Deployment & AI Optimization

AMD Instinct™ MI350 Series GPUs help enable frictionless adoption with drop-in compatibility, while the AMD GPU Operator simplifies deployment and workload configuration in Kubernetes. Powered by the open AMD ROCm™ software stack, developers get Day 0 support for leading AI frameworks and models from OpenAI, Meta, PyTorch, Hugging Face, and more— helping ensure efficient, high-performance execution without vendor lock-in.

Trusted by AI Leaders

Industry leaders and innovators trust AMD Instinct™ GPUs for large-scale AI, powering models like Llama 405B and GPT. Broad AMD Instinct GPU adoption by CSPs & OEMs are helping to drive next-gen AI at scale.

Meet the Series

Explore AMD Instinct MI350 Series GPUs and AMD Instinct MI350 Series Platforms.

AMD Instinct™ MI350P PCIe® Cards

Modernize your existing infrastructure with AMD Instinct MI350P PCIe® cards, offering leadership costs with an open, enterprise-ready AI stack, simplifying deployment.

Read the Listicle

View MI350P Specs

Enterprise AI, Ready Where You Are

AMD Instinct™ MI350P PCIe® card enables the AI performance you need to deploy and scale generative and agentic AI within your existing infrastructure. Join our ecosystem partners to experience exceptional performance, leadership costs, and simplified deployment. Scale enterprise AI without reinventing your datacenter when you choose AMD Instinct MI350P.

AI Performance That Fits Your Rack

Deploy and scale generative AI and agentic AI workloads within your current rack and power-and-cooling envelope, while maximizing throughput.

More Performance. Fewer Dollars.

Achieve leadership OPEX versus the NVIDIA H200 NVL card through lower precision MXFP6 and MXFP4 data types, more HBM3E memory more raw performance and an open low and no-cost software ecosystem for generative and agentic AI workloads.

Develop With Your AI Stack. Your Way, Today.

Simplify your AI deployment and migrate with ease using the AMD enterprise AI software including AMD Inference Microservices (AIMs), a no licensing fee, open, standards-based enterprise-ready AI stack.

128 CUs

128 GPU Compute Units

144 GB

144 GB HBM3E Memory

4 TB/s

Up to 4 TB/s Peak Theoretical Memory Bandwidth

Hear From Our OEM Partners

AMD Instinct™ MI350 Series GPUs

Built on 4th Gen AMD CDNA™ architecture, AMD Instinct™ MI350 Series GPUs deliver exceptional AI inference, training, and HPC workload performance with massive 288GB HBM3E memory, 8TB/s bandwidth, and expanded datatype support including MXFP6, MXFP4.

View MI355X Specs

View MI350X Specs

Up to

256 CUs

256 GPU Compute Units

Up to

288 GB

288 GB HBM3E Memory

Up to

8 TB/s

8 TB/s Peak Theoretical Memory Bandwidth

AMD Instinct MI355X GPU Specs Comparisons

AI Performance
HPC Performance
Memory

AI Performance (Peak PFLOPs)

Up to 2.2X the AI performance vs. competitive accelerators²

FP16/BF16 Tensor / FP16/BF16 Matrix
(Sparsity)

4.5

5.0

B200 SXM5

MI355X OAM

FP8 Tensor / OCP-FP8 Matrix
(Sparsity)

10.1

B200 SXM5

MI355X OAM

FP6 Tensor / MXFP6 Matrix

4.5

10.1

B200 SXM5

MI355X OAM

B200 SXM5 180GB

MI355X OAM 288GB

HPC Performance (Peak TFLOPs)

Up to 2.1X the HPC performance vs. competitive accelerators³

FP64 (Vector)

78.6

B200 SXM5

MI355X OAM

FP64 (Tensor / Matrix)

78.6

B200 SXM5

MI355X OAM

FP32 (Vector)

157.3

B200 SXM5

MI355X OAM

B200 SXM5 180GB

MI355X OAM 288GB

Memory Capacity & Bandwidth

1.6X Memory Capacity vs. competitive accelerators¹

Memory Capacity

180 GB

288 GB

B200 SXM5

MI355X OAM

Memory Bandwidth

7.7 TB/s

8.0 TB/s

B200 SXM5

MI355X OAM

B200 SXM5 180GB

MI355X OAM 288GB

AMD Instinct MI350 Series Platforms

The AMD Instinct MI350 Series Platforms integrate 8 fully connected MI355X or MI350X GPU OAM modules onto an industry standard OCP design via 4th Gen AMD Infinity Fabric™ technology, with an industry leadership 2.3TB HBM3E memory capacity for high throughput AI processing. These ready-to-deploy platforms now offer support for a variety of systems, from standard air cooled UBB-based servers to ultra dense Direct Liquid Cooled (DLC) platforms, helping to accelerate time-to-market and reduce development costs when adding AMD Instinct MI350 Series GPUs into existing AI rack and server infrastructures.

View MI355X Platform Specs

View MI350X Platform Specs

8 MI350 Series GPUs

Eight (8) MI355X or MI350X GPU OAM modules

2.3 TB

2.3 TB Total HBM3E Memory

64 TB/s

64 TB/s Peak Theoretical Aggregate Memory Bandwidth

AMD Software

AMD ROCm™ Software

AMD ROCm™ software includes a broad set of programming models, tools, compilers, libraries, and runtimes for AI models and HPC workloads targeting AMD Instinct GPUs.

About ROCm AMD ROCm™ Developer Hub

AMD Enterprise AI Software

The AMD Enterprise AI Suite enables enterprises to go from bare metal compute to production-grade AI in minutes by connecting key open-source AI frameworks and Gen AI models with an enterprise-ready Kubernetes platform, minimizing the time from AI experimentation to large scale production on AMD compute platforms.

Learn More About AMD Enterprise AI Software

Case Studies

View All

Find Solutions

Experience AMD Instinct GPUs in the Cloud

Support your AI, HPC, and software development needs with programs supported by leading cloud service providers.

Learn about Instinct Cloud Options

Abstract data center interior with glowing lights

AMD Instinct GPU Partners and Server Solutions

AMD collaborates with leading Original Equipment Manufacturers (OEMs), and platform designers to offer a robust ecosystem of AMD Instinct GPU-powered solutions.

Find Instinct Solutions

Resources

Blogs

Read the latest blogs on AMD Instinct GPUs.

ROCm Blogs Read Blogs

Technical Documentation

Read the latest AMD Instinct and ROCm documentation.

AMD Instinct Docs ROCm Docs

AMD Doc Hub

Find solution briefs, white papers, programmer references, and more documentation.

Find Docs

Stay Informed

Footnotes

Calculations conducted by AMD Performance Labs as of May 22nd, 2025, based on current specifications and /or estimation. The AMD Instinct™ MI355X OAM accelerators have 288GB HBM3E memory capacity and 8 TB/s GPU peak theoretical memory bandwidth performance. The highest published results on the NVidia Hopper H200 (141GB) SXM GPU accelerator resulted in 141GB HBM3E memory capacity and 4.8 TB/s GPU memory bandwidth performance. https://nvdam.widen.net/s/nb5zzzsjdf/hpc-datasheet-sc23-h200-datasheet-3002446 The highest published results on the NVidia Blackwell HGX B200 (180GB) GPU accelerator resulted in 180GB HBM3E memory capacity and 7.7 TB/s GPU memory bandwidth performance. https://nvdam.widen.net/s/wwnsxrhm2w/blackwell-datasheet-3384703 The highest published results on the NVidia Grace Blackwell GB200 (186GB) GPU accelerator resulted in 186GB HBM3E memory capacity and 8 TB/s GPU memory bandwidth performance. https://nvdam.widen.net/s/wwnsxrhm2w/blackwell-datasheet-3384703 MI350-001
Based on calculations by AMD Performance Labs in May 2025, to determine the peak theoretical precision performance for the AMD Instinct™ MI350X / MI355X GPUs, when comparing FP64, FP32, FP16, OCP-FP8, FP8, MXFP6, FP6, MXFP4, FP4, INT8, and bfloat16 datatypes with Vector, Matrix, or Tensor with Sparsity as applicable, vs. NVIDIA Blackwell B200 accelerator. Server manufacturers may vary configurations, yielding different results. MI350-009A
Based on calculations by AMD Performance Labs in May 2025, to determine the peak theoretical precision performance for the AMD Instinct™ MI350X / MI355X GPUs, when comparing FP64 and FP32 with Vector, Matrix or Tensor as applicable, vs. NVIDIA Blackwell B200 accelerator. Results may vary based on server configuration, datatype, and workload. Performance may vary based on use of latest drivers and optimizations. MI350-019
Based on calculations by AMD as of April 17, 2025, using the published memory specifications of the AMD Instinct MI350X / MI355X GPUs (288GB) vs MI300X (192GB) vs MI325X (256GB). Calculations performed with FP16 precision datatype at (2) bytes per parameter, to determine the minimum number of GPUs (based on memory size) required to run the following LLMs: OPT (130B parameters), GPT-3 (175B parameters), BLOOM (176B parameters), Gopher (280B parameters), PaLM 1 (340B parameters), Generic LM (420B, 500B, 520B, 1.047T parameters), Megatron-LM (530B parameters), LLaMA ( 405B parameters) and Samba (1T parameters). Results based on GPU memory size versus memory required by the model at defined parameters, plus 10% overhead. Server manufacturers may vary configurations, yielding different results. Results may vary based on GPU memory configuration, LLM size, and potential variance in GPU memory access or the server operating environment. *All data based on FP16 datatype. For FP8 = X2. For FP4 = X4. MI350-012
Based on calculations by AMD Performance Labs in May 2025, for the 8 GPU AMD Instinct™ MI350X / MI355X Platforms to determine the peak theoretical precision performance when comparing FP64, FP32, FP16, OCP-FP8, FP8, MXFP6, FP6, MXFP4, FP4, and INT8 datatypes with Matrix, Tensor, Vector and Sparsity, as applicable vs. NVIDIA HGX Blackwell B200 accelerator platform. Results may vary based on configuration, datatype, and workload. MI350-010A

Server CPUs

Business Systems

Personal & Gaming

Embedded

Resources

GPU Accelerators

Adaptive Accelerators

DPU Accelerators

Ethernet Adapters

Workstations

Desktops

Laptops

Resources

Adaptive SoCs & FPGAs

System-on-Modules (SOMs)

Technologies

Resources

Evaluation Boards & Kits

Processor Tools

Graphics Tools & Apps

Adaptive SoC & FPGA Tools

Intellectual Property & Apps

GPU Accelerator Tools & Apps

Ethernet Adapter Tools

Overview

For Data Center & Cloud

For Edge & Endpoints

For Developers

Industries

Industries

Industries

Industries

Industries

Workloads

Gaming

Systems

Technologies

Resources

EPYC Processors

Radeon Graphics & AMD Chipsets

Adaptive SoCs & FPGAs

Alveo Accelerators & Kria SOMs

Ryzen Processors

Ethernet Adapters

Overview

Processors

Accelerators

Embedded Products

Graphics

Overview

Resources by Product

Resources by Type

About Our Partners

AMD Global Support

Processors & Graphics

Accelerators

Adaptive SoCs & FPGAs

Gaming & Personal Computing

Adaptive & Embedded Computing

Get AMD Fan Gear

Shop Our Retail Partners

AMD Instinct™ MI350 Series GPUs

Preview Our Newest AMD Instinct MI350 Series GPU Offering

Leadership AI & HPC Acceleration

Under the Hood

Benefits

Platforms Built for Any Enterprise Scale AI

Breakthrough AI Acceleration With Huge Memory

Advanced Security for AI & HPC

Seamless Deployment & AI Optimization

Trusted by AI Leaders

Meet the Series

AMD Instinct™ MI350P PCIe® Cards

Enterprise AI, Ready Where You Are

AI Performance That Fits Your Rack

More Performance. Fewer Dollars.

Develop With Your AI Stack. Your Way, Today.

Hear From Our OEM Partners

AMD Instinct™ MI350 Series GPUs

AMD Instinct MI355X GPU Specs Comparisons

FP16/BF16 Tensor / FP16/BF16 Matrix
(Sparsity)

FP8 Tensor / OCP-FP8 Matrix
(Sparsity)