Skip to content

Support for newer CUDA capability (e.g. sm_86) #359

Closed
@MMelQin

Description

@MMelQin

Discussed in #358

Originally posted by Leengit September 28, 2022
I successfully build a docker image with monai-deploy package that runs on the computer on which I built it. However when I try to run the same docker image on a computer with a significantly newer / more powerful GPU, it fails. It appears that the underlying docker image nvcr.io/nvidia/pytorch:21.07-py3 uses a version of CUDA 11.3 and torch that do not support sm_86. Upgrading to torch==1.12.1 within the docker image that I create (and committing the change to a new image that I then use) does not help. Despite my attempts with apt-get, I have been unable to install a newer version of CUDA within the created docker image.

Your help with getting support for an NVIDIA RTX A5000 would be much appreciated! The error from running the docker image that I created with monai deploy includes

NVIDIA RTX A5000 with CUDA capability sm_86 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_37 sm_50 sm_60 sm_70.
If you want to use the NVIDIA RTX A5000 GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/
...
  File "~/venv/lung/lib/python3.8/site-packages/torch/nn/modules/conv.py", line 453, in _conv_forward
                            weight, bias, self.stride,
                            _pair(0), self.dilation, self.groups)
        return F.conv2d(input, weight, bias, self.stride,
               ~~~~~~~~ <--- HERE   
                        self.padding, self.dilation, self.groups)
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
```</div>

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions