You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am having what looks to me like essentially the same problem. All the diagnostics I have tried (such as those you show above) suggest to me that my cuda installation is correct (although I am far from an expert). But I cannot get cnn examples from the mlverse webpage to run on the gpu. R crashes every time I try to train. On the cpu they work fine.
Hi,
I followed the installation instructions provided here and installed CUDA 11.3 and cuDNN 8.4 as per the support matrix.
Running the following command:
C:\Users\Utente>nvidia-smi
gives the following output:
Mon Sep 23 10:20:22 2024
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 516.01 Driver Version: 516.01 CUDA Version: 11.7 |
|-------------------------------+----------------------+----------------------+
| GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA T400 WDDM | 00000000:01:00.0 On | N/A |
| 41% 53C P0 N/A / 31W | 277MiB / 2048MiB | 3% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 9544 C+G ...5n1h2txyewy\SearchApp.exe N/A |
| 0 N/A N/A 9616 C+G C:\Windows\explorer.exe N/A |
| 0 N/A N/A 10080 C+G ...artMenuExperienceHost.exe N/A |
| 0 N/A N/A 12056 C+G ...me\Application\chrome.exe N/A |
| 0 N/A N/A 12332 C+G ...Spark\CiscoCollabHost.exe N/A |
| 0 N/A N/A 12596 C+G ...ge\Application\msedge.exe N/A |
| 0 N/A N/A 12732 C+G ...oft\OneDrive\OneDrive.exe N/A |
| 0 N/A N/A 12924 C+G ...2txyewy\TextInputHost.exe N/A |
| 0 N/A N/A 14536 C+G ...qxf38zg5c\Skype\Skype.exe N/A |
+-----------------------------------------------------------------------------+
Running the nvcc --version command shows:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Tue_May__3_19:00:59_Pacific_Daylight_Time_2022
Cuda compilation tools, release 11.7, V11.7.64
Build cuda_11.7.r11.7/compiler.31294372_0
I tried running the example provided here, but RStudio crashes after running the fit part. The same issue occurs with other examples as well.
I am using CUDA, and the cuda_is_available() command returns TRUE.
Also, this simple code runs without issues:
torch_tensor(1, device = "cuda")
which outputs:
torch_tensor
1
[ CUDAFloatType{1} ]
Session Info:
Matrix products: default
locale:
[1] LC_COLLATE=Italian_Italy.utf8 LC_CTYPE=Italian_Italy.utf8 LC_MONETARY=Italian_Italy.utf8 LC_NUMERIC=C LC_TIME=Italian_Italy.utf8
time zone: Europe/Rome
tzcode source: internal
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] torch_0.13.0
loaded via a namespace (and not attached):
[1] processx_3.8.4 bit_4.5.0 compiler_4.4.1 magrittr_2.0.3 cli_3.6.3 tools_4.4.1 rstudioapi_0.16.0 Rcpp_1.0.13 bit64_4.0.5
[10] coro_1.0.4 callr_3.7.6 ps_1.8.0 rlang_1.1.4
Envoroment variables:
My nvcc path:
returns:
[1] "/c/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.7/bin/nvcc"
Any help with diagnosing and resolving these crashes would be greatly appreciated!
The text was updated successfully, but these errors were encountered: