Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel scan performance gap between Vulkan and CUDA #17

Open
qiao-bo opened this issue Jul 21, 2022 · 0 comments
Open

Parallel scan performance gap between Vulkan and CUDA #17

qiao-bo opened this issue Jul 21, 2022 · 0 comments

Comments

@qiao-bo
Copy link
Contributor

qiao-bo commented Jul 21, 2022

Currently we support warp-based parallel scan for Vulkan and CUDA. Lets use this issue to track some performance data:

ENV: RTX3080 with Driver 510. CUDA 11.6.

Number of elements Vulkan CUDA
131072 0.348 ms 0.160 ms
65536 0.308 ms 0.111 ms
32768 0.311 ms 0.114 ms
16384 0.232 ms 0.082 ms
8192 0.222 ms 0.075 ms
4096 0.183 ms 0.075 ms
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant