NEW! AOCL 5.0 is now available, October 10, 2024

AOCL is a set of numerical libraries optimized for AMD processors based on the AMD “Zen” core architecture and generations. Supported processor families are AMD EPYC™, AMD Ryzen™, and AMD Ryzen™ Threadripper™ processors. The tuned implementations of industry-standard math libraries enable rapid development of scientific and high-performance computing applications.

Libraries

What’s new in AOCL 5.0 (October 10, 2024)

  • AOCL-Data Analytics 
    • A new library providing optimized building blocks for data analysis and classical machine learning.
  • AOCL-BLAS
    • Turin specific tuning for the following APIs:
      • D/ZGEMM, DTRSM, and DNRM2
    • AVX512 improvements for the following APIs:
      • ZGEMV, D/ZAXPYF, D/ZDOTXF, ZDOTV, C/ZSCALV, DNRM2, S/D/ZCOPY, S/D/C/ZAXPBYV, DTRSV, DGEMMT, D/ZTRSM, and D/ZGEMM
    • Improvements to AOCL_ENABLE_INSTRUCTIONS functionality.
    • Additional APIs and Post-Ops support in addition to the improved performance for the existing APIs in aocl_gemm add-on.
  • AOCL-Compression
    •  Single-threaded performance improvements for ZLIB and BZIP2.
    • Multi-threaded support for LZ4HC and performance improvements for LZ4, ZSTD, ZLIB, and Snappy.
    • AOCL_ZLIB_QUICK_MODE option for faster compression in ZLIB and AOCL_DECOMPRESS_FAST option for faster decompression in ZSTD and LZ4.
  • AOCL-Cryptography
    •  OpenSSL Provider support and fixes for Cipher, CMAC, Poly1305, and RSA-2048 algorithms.
    • Features added:
      • Chacha20-Poly1305 Cipher
      • RSA PKCS Encrypt/Decrypt, PKCS Sign and Verify, and PSS Sign and Verify
      • SHA3 Shake Squeeze
    •  Performance improvements
      • OAEP Encrypt Decrypt
      • HMAC, CMAC, and Poly1305
      • Chacha20 stream cipher
      • CFB, CBC, and GCM
      • SHA2 and SHA3
  • AOCL-FFTW
    • Support for using the Wisdom feature by default under the –enable-amd-app-opt option
  • AOCL-LAPACK
    • Improvements to SVD, Factorization, Inverse, and Least square Solver APIs.
    • Option to set specific ISA code path at runtime through a new environment variable.
    • Sphinx-based AOCL-LAPACK API documentation.
    • pkgconfig support on linux with cmake builds.
  • AOCL-LibM
    • Added new vector APIs – powx and linearfrac.
    • Added vector variants of sincos, sqrt, and fabs.
    • Added vector array variants of pow, exp, exp2, exp10, expm1, log, log2, log10, log1p, cbrt, cos, and sin.
    • Debuting a beta version of CMAKE build system for AOCL-LibM on Linux
  • AOCL-LibMem
    • Added support for new functions:
      • memchr, strcat, strstr
    • Improved AVX2 performance for smaller sizes.
  • AOCL-RNG
    • AVX2 support for double precision MRG32K3A Generator.
    • Added Sobol as new generator of type QRNG.
    • Dynamic dispatch feature enhancements.
  • AOCL-ScaLAPACK
    • Ability to build ScaLAPACK library without building a test framework.
    • Tracing and Logging feature enabled now for all APIs including single precision.
  • AOCL-Sparse
    • Incorporated new APIs:
      • Level 3: sp2md, spmmd, syrk, syrkd, sypr, syprd
      • Sparse preconditioner: sorv
    • Support for:
      • Symmetric and Hermitian matrices in csrmm API
      • Strided dense vector in TRSM and TRSV
      • Symmetric Gauss Seidel
    • Performance improvements:
      •  Level 1: dot, gthr, sctr, roti
      • Level 2: SpMV for complex general matrices
    • Multi-Threading support
      • Level 3: Sp2M, SpMM, SpAdd
    • Benchmarking framework improvements:
      • Enhanced statistics, support for new APIs , random matrix generation capability (Hermitian, diagonally dominant) APIs to support HPCG
  • AOCL-Utils
    • New APIs to get Instruction Set Architecture (ISA) details
    • New API to check CPU features
    • Thread pinning
    • Support for Linux and Windows platforms
  • AOCL-Enabled MUMPS Library
    • Building for Windows 10 systems: CMake-based build system on AOCL-enabled MUMPS sparse solver library; refer to GitHub MUMPS-build (https://github.com/amd/mumps-build) for more information.
    • Mumps 5.5.1 support with 5.5.0 and 5.4.1 backward compatibility.

 

Resources and Technical Support

Documentation

AOCL User Guide

AOCL Tuning Guide

AOCL Release Notes

For prior versions of AOCL, refer to AOCL Archive.

Support

For support options, refer to Technical Support.

AMD Community

For moderated forums, refer to the AMD community.

Download with End User License Agreement

File Name Version Size Launch Date OS Bitness Description
AOCL 5.0 binary packages compiled with AOCC 5.0
aocl-linux-aocc-5.0.0.tar.gz 5.0 105MB 10/10/2024 RHEL, Ubuntu, SLES 64-bit AOCC compiled AOCL tar file containing all the library binaries. It includes install.sh file that extracts and installs the libraries.
sha256sum: 604ae6c34ef2648504ba08d2f33e42b5a9cf4032eaefaa9253560747ebaeb1ca
aocl-linux-aocc-5.0.0_1_amd64.deb 5.0 72MB 10/10/2024 Ubuntu 64-bit AOCC compiled Debian package
sha256sum: f397a03c92525513b298aba949c9cffcf458ff640e6e7eb0a0c6782827d55168
aocl-linux-aocc-5.0.0-1.x86_64.rpm 5.0 84MB 10/10/2024 RHEL, SLES 64-bit AOCC compiled RPM package
sha256sum: 44127f2df32b9a946832cf73846cdd35c63771c11f0527d44d1e88babac8b054
AOCL 5.0 binary packages compiled with GCC 13.2.1
aocl-linux-gcc-5.0.0.tar.gz 5.0 112MB 10/10/2024 RHEL, Ubuntu, SLES 64-bit GCC compiled AOCL tar file containing all the library binaries. Includes install.sh file that extracts and installs the libraries.
sha256sum: e7e1e1fec01ad2e184d06d27dbe977883c3d93a3ffda72997fb23335975dd595
aocl-linux-gcc-5.0.0_1_amd64.deb 5.0 75MB 10/10/2024 Ubuntu 64-bit GCC compiled Debian package
sha256sum: b20a9560d3905d14a7ae1e8a558f40e9620f8736b08253dcc72c32c514a02468
aocl-linux-gcc-5.0.0-1.x86_64.rpm 5.0 89MB 10/10/2024 RHEL, SLES 64-bit GCC compiled RPM package
sha256sum: 16a33339ceffe5a7dd466ba0e89aad5259082407e3d2af44f4a9bb55c28d2704
Windows
AOCL_Windows-setup-5.0.0.384-AMD.exe 5.0 104MB 10/10/2024 Windows 11, Windows 10 64-bit Windows installer file containing all the AOCL library binaries compiled with Clang 17.
sha256sum: 026405b98e2cf3c529bacdf76eb6e43935b639ed2ab8e90cba22bb992ecf13de