-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bump CUDA from 11.2 to 11.8 #505
Conversation
Update Pytorch, Torchvision and Tensorflow to use CUDA 11.8 builds. Also bumped tensorflow from 2.9.1 to 2.14.0 because lower versions <2.13.1 only has CUDA 11.2 on conda-forge.
/condalock |
/condalock |
@@ -7,6 +7,6 @@ channels: | |||
dependencies: | |||
- jupyterlab-nvdashboard | |||
- gpytorch | |||
- pytorch>=2.0.0=*cuda112* | |||
- torchvision>=0.15.1=*cuda112* | |||
- pytorch>=2.0.0=*cuda118* |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apparently we've been on 11.8 for some time, so perhaps this * syntax isn't really pinning anything?
pangeo-docker-images/pytorch-notebook/packages.txt
Lines 114 to 115 in d2651ff
cuda-version==11.8 | |
cudatoolkit==11.8.0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's cudatoolkit. CUDA 11.2 and 11.8 does have this forward/backward compatibility thing, but if you look at the conda-lock.yml file, the Pytorch build is actually the CUDA 11.2 one:
url: https://conda.anaconda.org/conda-forge/linux-64/pytorch-2.1.0-cuda112py311ha0492fd_300.conda |
Compared to Pytorch compiled with CUDA 11.2, the one compiled with CUDA 11.8 enables compute capability 8.9, as seen at https://github.com/conda-forge/pytorch-cpu-feedstock/blob/7c7a57b7515eaeda67d3879b56b68466f38f0b0d/recipe/build_pytorch.sh#L144-L153.
looks like tensorflow image size dropped a bit, but pytorch keeps inflating :) would be nice to get these below 10GB if possible one of these days
|
Yes, things should get smaller! This is because conda-forge has removed the need for a large |
The CUDA 11.8 migration across conda-forge is practically complete (see https://conda-forge.org/status/#cuda118), so we can start updating to a newer version of CUDA!
This should add support for the new NVIDIA Ada Lovelace and Hopper generation GPUs that requires compute capability 8.9 or 9.0 (see https://docs.nvidia.com/cuda/archive/11.8.0/hopper-compatibility-guide/index.html#verifying-hopper-compatibility-using-cuda-11-8).
Note:
Changes in this PR:
References: