AMD Research - Publications

RAD – Publications

AMD Research & Development (RAD) highly values the publication of key scientific research findings in peer-reviewed conferences and journals.

The links on this page provide links to RAD’s many publications through the last few years.

2024

AI-Based Approaches in Network Security – AI4Good 2024
T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute and Collectives – ASPLOS 2024
Integrating FPGA and GPU Acceleration to OpenMP Distributed Computing – FPL 2024
Turn-based Spatiotemporal Coherence for GPUs – HiPEAC 2024
Networking Technologies for Handling AI Workloads – ISC 2024
Sustainable Computing at Scale – MODSIM 2024

2023

Spectrum Usage and Occupancy Monitoring: Challenges and Software-Defined Radio Solutions – IIIE WCNC 2023
Improving DNN Throughput Via Intelligent Concurrent GEMM Executions – arXiv 2023
The Next Era for Chiplet Innovation – DATE 2023
Leveraging MLIR to Design for AI Engines – FCCM 2023
Reducing Internode Communication Using FPGA-Accelerated Neural Network Surrogate Models – FIRE 2023
Navigating the Future Landscape of System-On-Chip Technology – IEEE SOCC 2023
Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware – IISWC 2023
SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation – ICS 2023
Introduction to the AMD Versal ACAP Adaptable Intelligent Engine and to its Programming Model – SC 2023
Innovative Approaches to AI with Adaptive Computing – SPL 2023

2022

Demystifying BERT: System Design Implications - IISWC 2022
A Case for Fine-grain Coherence Specialization in Heterogeneous Systems - TACO
Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells – HPCA 2022
Data Convection: A GPU-Driven Case Study for Thermal-Aware Data Placement in 3D DRAMs - SIGMETRICS 2022
Cloak: Tolerating Non-Volatile Cache Read Latency – ICS 2022
Uncertainty Quantification Methods for ML-based Surrogate Models of Scientific Applications – NeurIPS 2022
Eager Memory Cryptography in Caches – MICRO 2022
Athena: An Early-Fetch Architecture To Reduce On-Chip Page Walk Latencies – PACT 2022
Improving Energy Efficiency of Permissioned Blockchains Using FPGAs – ICPADS 2022

2021

Analyzing and Leveraging Decoupled L1 Caches in GPUs - HPCA 2021
Deadline-Aware Offloading for High-Throughput Accelerators - HPCA 2021
Understanding Chiplets Today to Anticipate Future Integration Opportunities and Limits - DATE 2021
Systems-on-Chip with Strong Ordering - TACO
Pioneering Chiplet Technology and Design for AMD EPYC™ and Ryzen™ Processor Families -ISCA 2021 (Industry Track)
Quantifying Server Memory Frequency Margin and Using it to Improve Performance in HPC Systems - ISCA 2021
Interconnect Modeling for Homogeneous and Heterogeneous Multiprocessors - Springer (Book Chapter)
Increasing GPU Translation Reach by Leveraging Under-Utilized On-Chip Resources - MICRO 2021
DUB: Dynamic Underclocking and Bypassing in Network-on-Chip for Heterogeneous GPU Workloads - NOCS 2021
A New Era of Tailored Computing (short paper) - VLSI Symposium 2021
Efficient Cache Utilization via Model-aware Data Placement for Recommendation Models - MEMSYS 2021
Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells - HPCA 2022
Using neural networks to reduce communication in numerical solution of partial differential equations - NEURIPS 2021
Using physics-informed regularization to improve extrapolation capabilities of neural networks - NEURIPS 2021

2020

Kite: A Family of Heterogeneous Interposer Topologies Enabled via Accurate Interconnect Modeling – DAC 2020
SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks – ISPASS 2020
Improving the Utilization of Micro-operation Caches in x86 Processors – MICRO 2020
Centaur: A Novel Architecture for Reliable, Low-Wear,High-Density 3D NAND Storage - SIGMETRICS 2020
Analyzing and Leveraging Shared L1 Caches in GPUs – PACT 2020
PreFAM: Understanding the Impact of Prefetching in Fabric-Attached Memory Architectures – MEMSYS 2020
CFDNet: a deep learning-based accelerator for fluid simulations – ICS 2020
Optimizing of Intercache Traffic Entanglement in Tagless Caches With Tiling Opportunities – TCAD 2020
Optimizing of Intercache Traffic Entanglement in Tagless Caches With Tiling Opportunities – CASES 2020
Independent Forward Progress of Work-groups – ISCA 2020
Experiences with ML-Driven Design: A NoC Case Study – HPCA 2020
GPU Initiated OpenSHMEM : Correct and Efficient Intra-Kernel Networking for dGPUs – PPoPP 2020
Centaur: A Novel Architecture for Reliable, Low-Wear, High-Density 3D NAND Storage – SIGMETRICS 2020
DSM: A Case for Hardware-Assisted Merging of DRAM Rows with Same Content – SIGMETRICS 2020

Data Center

Business Systems

Personal & Gaming

Embedded

Resources

GPU Accelerators

Adaptive Accelerators

DPU Accelerators

Ethernet Adapters

Workstations

Desktops

Laptops

Resources

Adaptive SoCs & FPGAs

System-on-Modules (SOMs)

Technologies

Resources

Evaluation Boards & Kits

Processor Tools

Graphics Tools & Apps

Adaptive SoC & FPGA Tools

Intellectual Property & Apps

GPU Accelerator Tools & Apps

Overview

For Data Center & Cloud

For Edge & Endpoints

For Developers

Industries

Industries

Industries

Industries

Industries

Workloads

Gaming

Systems

Technologies

Resources

EPYC Processors

Radeon Graphics & AMD Chipsets

Adaptive SoCs & FPGAs

Alveo Accelerators & Kria SOMs

Ryzen Processors

Ethernet Adapters

Overview

Processors

Accelerators

Adaptive SoCs, FPGAs, & SOMs

Graphics

Overview

Resources by Market Segment

Resources by Product

Resources by Type

About Our Partners

AMD Global Support

Processors & Graphics

Accelerators

Adaptive SoCs & FPGAs

Gaming & Personal Computing

Adaptive & Embedded Computing

Get AMD Fan Gear

Shop Our Retail Partners

AMD Research and Development (RAD) - Publications

RAD – Publications

2024

2023

2022

2021

2020

Company

News & Events

Community

Partners

Investors