RAD – Publications
AMD Research & Development (RAD) highly values the publication of key scientific research findings in peer-reviewed conferences and journals.
The links on this page provide links to RAD’s many publications through the last few years.
2024
- AI-Based Approaches in Network Security – AI4Good 2024
- T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute and Collectives – ASPLOS 2024
- Integrating FPGA and GPU Acceleration to OpenMP Distributed Computing – FPL 2024
- Turn-based Spatiotemporal Coherence for GPUs – HiPEAC 2024
- Networking Technologies for Handling AI Workloads – ISC 2024
- Sustainable Computing at Scale – MODSIM 2024
2023
- Spectrum Usage and Occupancy Monitoring: Challenges and Software-Defined Radio Solutions – IIIE WCNC 2023
- Improving DNN Throughput Via Intelligent Concurrent GEMM Executions – arXiv 2023
- The Next Era for Chiplet Innovation – DATE 2023
- Leveraging MLIR to Design for AI Engines – FCCM 2023
- Reducing Internode Communication Using FPGA-Accelerated Neural Network Surrogate Models – FIRE 2023
- Navigating the Future Landscape of System-On-Chip Technology – IEEE SOCC 2023
- Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware – IISWC 2023
- SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation – ICS 2023
- Introduction to the AMD Versal ACAP Adaptable Intelligent Engine and to its Programming Model – SC 2023
- Innovative Approaches to AI with Adaptive Computing – SPL 2023
2022
- Demystifying BERT: System Design Implications - IISWC 2022
- A Case for Fine-grain Coherence Specialization in Heterogeneous Systems - TACO
- Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells – HPCA 2022
- Data Convection: A GPU-Driven Case Study for Thermal-Aware Data Placement in 3D DRAMs - SIGMETRICS 2022
- Cloak: Tolerating Non-Volatile Cache Read Latency – ICS 2022
- Uncertainty Quantification Methods for ML-based Surrogate Models of Scientific Applications – NeurIPS 2022
- Eager Memory Cryptography in Caches – MICRO 2022
- Athena: An Early-Fetch Architecture To Reduce On-Chip Page Walk Latencies – PACT 2022
- Improving Energy Efficiency of Permissioned Blockchains Using FPGAs – ICPADS 2022
2021
- Analyzing and Leveraging Decoupled L1 Caches in GPUs - HPCA 2021
- Deadline-Aware Offloading for High-Throughput Accelerators - HPCA 2021
- Understanding Chiplets Today to Anticipate Future Integration Opportunities and Limits - DATE 2021
- Systems-on-Chip with Strong Ordering - TACO
- Pioneering Chiplet Technology and Design for AMD EPYC™ and Ryzen™ Processor Families -ISCA 2021 (Industry Track)
- Quantifying Server Memory Frequency Margin and Using it to Improve Performance in HPC Systems - ISCA 2021
- Interconnect Modeling for Homogeneous and Heterogeneous Multiprocessors - Springer (Book Chapter)
- Increasing GPU Translation Reach by Leveraging Under-Utilized On-Chip Resources - MICRO 2021
- DUB: Dynamic Underclocking and Bypassing in Network-on-Chip for Heterogeneous GPU Workloads - NOCS 2021
- A New Era of Tailored Computing (short paper) - VLSI Symposium 2021
- Efficient Cache Utilization via Model-aware Data Placement for Recommendation Models - MEMSYS 2021
- Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells - HPCA 2022
- Using neural networks to reduce communication in numerical solution of partial differential equations - NEURIPS 2021
- Using physics-informed regularization to improve extrapolation capabilities of neural networks - NEURIPS 2021
2020
- Kite: A Family of Heterogeneous Interposer Topologies Enabled via Accurate Interconnect Modeling – DAC 2020
- SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks – ISPASS 2020
- Improving the Utilization of Micro-operation Caches in x86 Processors – MICRO 2020
- Centaur: A Novel Architecture for Reliable, Low-Wear,High-Density 3D NAND Storage - SIGMETRICS 2020
- Analyzing and Leveraging Shared L1 Caches in GPUs – PACT 2020
- PreFAM: Understanding the Impact of Prefetching in Fabric-Attached Memory Architectures – MEMSYS 2020
- CFDNet: a deep learning-based accelerator for fluid simulations – ICS 2020
- Optimizing of Intercache Traffic Entanglement in Tagless Caches With Tiling Opportunities – TCAD 2020
- Optimizing of Intercache Traffic Entanglement in Tagless Caches With Tiling Opportunities – CASES 2020
- Independent Forward Progress of Work-groups – ISCA 2020
- Experiences with ML-Driven Design: A NoC Case Study – HPCA 2020
- GPU Initiated OpenSHMEM : Correct and Efficient Intra-Kernel Networking for dGPUs – PPoPP 2020
- Centaur: A Novel Architecture for Reliable, Low-Wear, High-Density 3D NAND Storage – SIGMETRICS 2020
- DSM: A Case for Hardware-Assisted Merging of DRAM Rows with Same Content – SIGMETRICS 2020