-
Notifications
You must be signed in to change notification settings - Fork 180
Improve cuda gencode flags #321
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Signed-off-by: Alexis Girault <agirault@nvidia.com>
- 10.0 is from CTK 12.8+ - 10.1 was missing Signed-off-by: Alexis Girault <agirault@nvidia.com>
The list was the same Signed-off-by: Alexis Girault <agirault@nvidia.com>
Signed-off-by: Alexis Girault <agirault@nvidia.com>
| COMPUTE_LIST="$COMPUTE_LIST 120" | ||
| SM_LIST="$SM_LIST 120" | ||
| # Add Blackwell (10.0, 10.1, 12.0) if CUDA >= 12.8 | ||
| if [ "$CUDA_VERSION_MAJOR" -ge 12 ] && [ "$CUDA_VERSION_MINOR" -ge 8 ]; then |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does this check work for CUDA 13.0?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
same as above
| fi | ||
|
|
||
| # Add Ada Lovelace (8.9) if CUDA >= 11.8 | ||
| if [ "$CUDA_VERSION_MAJOR" -ge 11 ] && [ "$CUDA_VERSION_MINOR" -ge 8 ]; then |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does this work on CUDA 12.0 ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I doubt it would. wasn't in the scope of this MR but I can edit these checks to check major first
|
@agirault The fix has been merged to R2.5 branch (https://github.com/NVIDIA/gdrcopy/tree/R2.5) |
Context
Changes
nvcc --list-gpu-codedirectly when available