Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] NNDescent:unused local_join function and reports an unaligned error #549

Open
zben777 opened this issue Dec 23, 2024 · 2 comments
Open
Labels
bug Something isn't working

Comments

@zben777
Copy link

zben777 commented Dec 23, 2024

Describe the bug
When I use wmma with a uint8_t parameter, I find that the program does not make the call to the local_join function and reports an unaligned error:code=716(cudaErrorMisalignedAddress) "cudaDeviceSynchronize()"
I would like to ask if there is support for uint8_t or int8_t matrix calculations in local_join?

I get an error when using the following uint8_t. And equipment :
NVIDIA GeForce RTX 4090 Driver Version: 560.28.03 CUDA Version: 12.6

And I'd like to ask if the current NNDescent supports local_join calculations other than __half

@zben777 zben777 added the bug Something isn't working label Dec 23, 2024
@cjnolet
Copy link
Member

cjnolet commented Jan 7, 2025

Hi @zben777 thanks for opening an issue. Can you please provide a minimal reproducible code snippet / example that we can use to reproduce this behavior? nn-descent is only instantiated for specific supported types and it's not header-only. Are you calling public APIs or internal APIs?

@zben777
Copy link
Author

zben777 commented Jan 10, 2025

Hi @zben777 thanks for opening an issue. Can you please provide a minimal reproducible code snippet / example that we can use to reproduce this behavior? nn-descent is only instantiated for specific supported types and it's not header-only. Are you calling public APIs or internal APIs?

constexpr int WMMA_M = 16;
constexpr int WMMA_N = 16;
constexpr int WMMA_K = 16;
// wmma::fragment<wmma::matrix_a, WMMA_M, WMMA_N, WMMA_K, half, wmma::row_major> a_frag;
// wmma::fragment<wmma::matrix_b, WMMA_M, WMMA_N, WMMA_K, half, wmma::col_major> b_frag;
wmma::fragment<wmma::matrix_a, WMMA_M, WMMA_N, WMMA_K, int8_t, wmma::row_major> a_frag;
wmma::fragment<wmma::matrix_b, WMMA_M, WMMA_N, WMMA_K, int8_t, wmma::col_major> b_frag;
wmma::fragment<wmma::accumulator, WMMA_M, WMMA_N, WMMA_K, int> c_frag;
wmma::fill_fragment(c_frag, 0);

I want to use int8 or uint8 type data for general matrix calculations to adapt to datasets like SIFT.
But in NNDescent, the code at this position is fixed to use half type.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants