1.3. Requirements for the GPU Accelerator in the Mechanical APDL Application

Your system must meet the following requirements to use the GPU accelerator capability in Mechanical APDL. For information on the most recently tested GPU cards, see the GPU Accelerator Capabilities PDF on the Platform Support section of the Ansys Website.

  • The machine(s) being used for the simulation must contain at least one GPU card.

  • A minimum 16GB of on-card memory is recommended in order to achieve meaningful acceleration in simulations that can use the GPU card.

  • To achieve optimal performance, only GPU cards with significant double precision performance (FP64) are recommended for use with the sparse direct solver and eigensolvers based on the sparse solver (for example, Block Lanczos or subspace). The following cards are recommended:

     GPU Information[a]Recommended Equation Solvers
    Card[b]Release YearFP64 and FP32 Performance (TFLOPS)Memory Capacity (GB)Memory Bandwidth (GB/sec)Sparse (Direct)Iterative (PCG, etc.)Mixed
    Data Center Cards
    AMD Instinct MI355X[c]202578.6/157.3288 GB8,000 GB/sYYY
    AMD Instinct MI325X202481.7/163.4256 GB6,000 GB/sYYY
    AMD Instinct MI300X202381.7/163.4192 GB5,300 GB/sYYY
    AMD Instinct MI210202222.6/45.364 GB1,600 GB/sYYY
    AMD Instinct MI250, MI250X202145.3/90.5128 GB3,200 GB/sYYY
    AMD Instinct MI250X202147.9/95.7128 GB3,200 GB/sYYY
    NVIDIA B300202575/1.2 270 GB7,700 GB/sYYY
    NVIDIA B200202580/40186 GB7,700 GB/sYYY
    NVIDIA B1002025[d][d][d]YYY
    NVIDIA H200202434/67141 GB4,800 GB/sYYY
    NVIDIA H100202230/6094 GB3,900 GB/sYYY
    NVIDIA L4020221.4/9048 GB864 GB/sNYY
    NVIDIA A3020215.2/10.324 GB933 GB/sYYY
    NVIDIA A10020209.7/19.580 GB1,940 GB/sYYY
    NVIDIA A4020200.58/3748 GB696 GB/sNYY
    Professional Workstation Cards
    AMD Radeon AI PRO R970020251.5/47.832 GB640 GB/sNYY
    AMD Radeon PRO W790020231.9/6148 GB864 GB/sNYY
    AMD Radeon PRO W780020231.9/6148 GB864 GB/sNYY
    NVIDIA RTX PRO 600020251.9/12696 GB1,340 GB/sNYY
    NVIDIA RTX PRO 500020251/66.548 GB1,790 GB/sNYY
    NVIDIA A80020249.7/19.540 GB1,500 GB/sYYY
    NVIDIA RTX 5880 Ada20241.08/6948 GB864 GB/sNYY
    NVIDIA RTX 5000 Ada20231.0/6532 GB576 GB/sNYY
    NVIDIA RTX 4500 Ada20230.6/3920 GB640 GB/sNYY
    NVIDIA RTX 6000 Ada20221.5/9148 GB960 GB/sNYY
    NVIDIA RTX A550020220.5/3424 GB768 GB/sNYY
    NVIDIA A1620210.1/4.516 GB200 GB/sNYY
    NVIDIA A1020211/3124 GB600 GB/sNYY
    NVIDIA RTX A500020210.5/2724 GB768 GB/sNYY
    NVIDIA RTX A450020210.3/2324 GB640 GB/sNYY
    NVIDIA RTX A400020210.3/1916 GB448 GB/sNYY
    NVIDIA RTX A600020200.6/38 48 GB768 GB/sNYY
    Consumer Cards[e]
    NVIDIA GeForce RTX 509020251.6/10432 GB1790 GB/sNYY
    NVIDIA GeForce RTX 508020250.8/5616 GB960 GB/sNYY
    NVIDIA GeForce RTX 5070 Ti20250.7/4416 GB896 GB/sNYY

    [a] The hardware specifications in this table are based on publicly available product datasheets and vendor documentation. Specifications may change or contain inaccuracies. Verify all details with the official manufacturer resources before making decisions.

    [b] GPUs that are unreleased at the time of this software release are not listed in this table. These GPUs may still be compatible with this release, but their performance and functionality have not been validated.

    [c] Hardware specifications for AMD Instinct MI355X are sourced from the official AMD product datasheet. For details, see https://www.amd.com/content/dam/amd/en/documents/instinct-tech-docs/product-briefs/amd-instinct-mi355x-gpu-brochure.pdf.

    [d] The vendor has not declared the specification for this entry.

    [e] Consumer GPUs with 16 GB of memory offer limited performance for large-scale workloads. Their fixed memory capacity restricts acceleration compared to workstation or data center GPUs with higher memory configurations.

  • For NVIDIA GPU cards, note the following:

    1. The Mechanical APDL installation provides the necessary CUDA 13.0.2 libraries to accelerate the FEA solution. No separate installation of CUDA is required.

    2. The GPU driver is not installed by Mechanical APDL. The driver version must be 580.95.05 or newer.

  • For AMD GPU cards, note the following:

    1. The Mechanical APDL installation provides the necessary HIP/ROCm 7.0.2 libraries to accelerate the FEA solution. No separate installation of HIP/ROCm is required. Note that your machine must have at least SUSE Linux Enterprise Server 15 SP5 to use the ROCm libraries.

    2. The GPU driver is not installed by Mechanical APDL. The driver version must be 7.0.2 or newer. Some instructions are given below.

      • the following AMD link lists their supported operating systems (OS).

      • To install the AMD driver, you must perform the three-step procedure summarized below and described in detail with specific Linux commands for different operating systems on the AMD page: How to Install ROCm (amd.com).

      • Download the "amdgpu-install" script to your system using the appropriate commands for your operating system. Note that the version of amdgpu-install must come from the ROCm 7.0.2 page, and you only need to download the installer script (the full ROCm package is not needed).

      • Install the "kernel-mode driver" with the following command:

        sudo amdgpu-install --usecase=dkms
      • Install the “libdrm-amdgpu” library to your operating system using the appropriate commands.

      • Add your user name (LOGNAME) to the list of users in the video group through the following command.

        sudo usermod -a -G video $LOGNAME
  • To utilize a GPU device that is not on the recommended list of cards, set the following environment variable:

    ANSGPU_OVERRIDE=1

    This is most beneficial when you wish to run on newer GPUs that were not available at the time of release of this version of the Ansys program. If you choose to use this environment variable, you should ensure that the GPU device that you wish to use is sufficiently powerful, in terms of both double-precision compute power and on-card memory, to achieve meaningful acceleration for your simulation. Using this environment variable with an underpowered CPU may actually decelerate your simulation.

    Support for AMD GPUs released after a specific version of the Mechanical APDL application is limited due to incompatibility between required software libraries and the GPU hardware. To check which GPU architectures are supported, refer to the AMD compatibility matrix version. You can find the GPU architecture in the device’s specification sheet.

  • If you have issues setting up the GPU drivers, see Troubleshooting in the Parallel Processing Guide.

  • For details on HPC licensing, see HPC Licensing in the Parallel Processing Guide.

  • For details on GPU benchmarks, see Ansys Mechanical Benchmarks.

  • For more information about using GPUs with the Mechanical APDL application, see the following resources on the Ansys website and innovation space: