Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ROCm] fix test_softmax_forward_64bit_indexing_cuda OOM (pytorch#113093)
TestNNDeviceTypeCUDA.test_softmax_forward_64bit_indexing_cuda started failing for ROCm after pytorch#112096 with the message torch.cuda.OutOfMemoryError: HIP out of memory. Tried to allocate 13.35 GiB. GPU 0 has a total capacity of 31.98 GiB of which 3.89 GiB is free. Of the allocated memory 26.69 GiB is allocated by PyTorch, and 18.91 MiB is reserved by PyTorch but unallocated. This amounts to approximately 41GB. The test is currently decorated with `largeTensorTest("30GB", "cuda")` but this is not sufficient for ROCm. Pull Request resolved: pytorch#113093 Approved by: https://github.com/malfet
- Loading branch information