Improve cuda gencode flags #321

agirault · 2025-03-31T17:21:42Z

Context

The current cuda SASS/PTX list is hardcoded manually based on a versioning heuristic that is error-prone. Case in point:

blackwell sm 10.1 is missing

blackwell sm 10.0 is supported after 12.8, not 12.6.

make[1]: Entering directory '***/gdrcopy/tests'
/usr/local/cuda/bin/nvcc -o pplat.o -c pplat.cu -lcuda -lpthread -ldl -lgdrapi -I /usr/local/cuda/include -I ../include -I ../src -I /usr/local/cuda/include  -gencode arch=compute_60,code=compute_60 -gencode arch=compute_61,code=compute_61 -gencode arch=compute_62,code=compute_62 -gencode arch=compute_70,code=compute_70 -gencode arch=compute_72,code=compute_72 -gencode arch=compute_75,code=compute_75 -gencode arch=compute_80,code=compute_80 -gencode arch=compute_86,code=compute_86 -gencode arch=compute_87,code=compute_87 -gencode arch=compute_90,code=compute_90 -gencode arch=compute_100,code=compute_100 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_61,code=sm_61 -gencode arch=compute_62,code=sm_62 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_72,code=sm_72 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_87,code=sm_87 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_100,code=sm_100
nvcc fatal   : Unsupported gpu architecture 'compute_100'
make[1]: *** [Makefile:54: pplat.o] Error 1

full arch list are used for code (SASS) and compute (PTX). For PTX, only latest is needed.

Changes

Use sm list from nvcc --list-gpu-code directly when available
Fix blackwell sm list and version compatibility
Consolidate compute & sm list in a single variable
Only build PTX for last supported arch

Signed-off-by: Alexis Girault <agirault@nvidia.com>

- 10.0 is from CTK 12.8+ - 10.1 was missing Signed-off-by: Alexis Girault <agirault@nvidia.com>

The list was the same Signed-off-by: Alexis Girault <agirault@nvidia.com>

Signed-off-by: Alexis Girault <agirault@nvidia.com>

drossetti · 2025-04-03T20:29:35Z

scripts/get_cuda_gencode.sh

-    COMPUTE_LIST="$COMPUTE_LIST 120"
-    SM_LIST="$SM_LIST 120"
+    # Add Blackwell (10.0, 10.1, 12.0) if CUDA >= 12.8
+    if [ "$CUDA_VERSION_MAJOR" -ge 12 ] && [ "$CUDA_VERSION_MINOR" -ge 8 ]; then


does this check work for CUDA 13.0?

same as above

drossetti · 2025-04-03T20:30:37Z

scripts/get_cuda_gencode.sh

+    fi
+
+    # Add Ada Lovelace (8.9) if CUDA >= 11.8
+    if [ "$CUDA_VERSION_MAJOR" -ge 11 ] && [ "$CUDA_VERSION_MINOR" -ge 8 ]; then


does this work on CUDA 12.0 ?

I doubt it would. wasn't in the scope of this MR but I can edit these checks to check major first

pakmarkthub · 2025-05-30T20:12:48Z

@agirault The fix has been merged to R2.5 branch (https://github.com/NVIDIA/gdrcopy/tree/R2.5)

agirault added 4 commits March 31, 2025 12:58

build: use sm list from nvcc

e6ea6db

Signed-off-by: Alexis Girault <agirault@nvidia.com>

build: correct blackwell sm use

72fa69c

- 10.0 is from CTK 12.8+ - 10.1 was missing Signed-off-by: Alexis Girault <agirault@nvidia.com>

build: consolidate arch lists for ptx and sass

c521788

The list was the same Signed-off-by: Alexis Girault <agirault@nvidia.com>

build: only build ptx for last arch

d1fde74

Signed-off-by: Alexis Girault <agirault@nvidia.com>

drossetti reviewed Apr 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve cuda gencode flags #321

Improve cuda gencode flags #321

agirault commented Mar 31, 2025 •

edited

Loading

Uh oh!

drossetti Apr 3, 2025

Uh oh!

agirault May 2, 2025

Uh oh!

drossetti Apr 3, 2025

Uh oh!

agirault May 2, 2025

Uh oh!

pakmarkthub commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve cuda gencode flags #321

Are you sure you want to change the base?

Improve cuda gencode flags #321

Conversation

agirault commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Changes

Uh oh!

drossetti Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

agirault May 2, 2025

Choose a reason for hiding this comment

Uh oh!

drossetti Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

agirault May 2, 2025

Choose a reason for hiding this comment

Uh oh!

pakmarkthub commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

agirault commented Mar 31, 2025 •

edited

Loading