Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

implemented L1 distance & fixed L2 in binary quantization #21

Merged
merged 14 commits into from
Nov 27, 2023
Merged

implemented L1 distance & fixed L2 in binary quantization #21

merged 14 commits into from
Nov 27, 2023

Conversation

kaancfidan
Copy link
Contributor

@kaancfidan kaancfidan commented Nov 22, 2023

This pull request is related to qdrant/qdrant#3052.

It implements L1 distance type to be used for the matching Manhattan distance implementation in qdrant/qdrant.

I have tried to make the changes as non-disruptive as possible, but since L1 distance cannot be related to dot product as easily as L2, there had to be some branching between scoring implementations.

I have added tests and benchmarks and verified the implementation works for simple, AVX, SSE and Neon instructions.

@kaancfidan
Copy link
Contributor Author

I just realized the changes in C implementations broke qdrant windows build, I'm trying to understand why cl.exe can't build these.

@kaancfidan
Copy link
Contributor Author

I had seen the l1_avx is not used warning, but it is being used in the benchmarks. I could remove the benchmarks and the implementation of course but do you have a better idea?

@kaancfidan kaancfidan mentioned this pull request Nov 22, 2023
9 tasks
@kaancfidan
Copy link
Contributor Author

kaancfidan commented Nov 22, 2023

I have realized that this L1 distance is currently not compatible with binary quantization. I had followed mostly what had been done for L2, but I suspect L2 is also incompatible with it.

Edit: On second thought, I think it may already be compatible as the current code uses XOR in case of invert = true and the truth tables match perfectly (as it's the actual Hamming distance in the case of L1)

A B Dot product L1
-0.5 -0.5 0.25 0
-0.5 0.5 -0.25 1
0.5 -0.5 -0.25 1
0.5 0.5 0.25 0
A B NXOR XOR
0 0 1 0
0 1 0 1
1 0 0 1
1 1 1 0

@kaancfidan kaancfidan marked this pull request as draft November 22, 2023 14:36
@kaancfidan kaancfidan marked this pull request as ready for review November 22, 2023 15:18
@kaancfidan
Copy link
Contributor Author

I have fixed both L1 and L2 distances and added tests for both in case of binary quantization.

@kaancfidan kaancfidan marked this pull request as draft November 22, 2023 21:45
@kaancfidan
Copy link
Contributor Author

I have fixed both L1 and L2 distances and added tests for both in case of binary quantization.

I have run a benchmark with real data using Euclid + binary quantization (using qdrant itself). It did not work well. I need to investigate and understand the difference between the toy test data and the real one.

I'll also check L1 tomorrow. Meanwhile, I have set the status to draft again.

Copy link
Member

@timvisee timvisee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did a first review pass with some minor comments. Thank you very much for your time and effort to implement this! We'll definitely take a proper look at this.

quantization/tests/metrics.rs Outdated Show resolved Hide resolved
quantization/tests/test_avx2.rs Outdated Show resolved Hide resolved
quantization/src/encoded_vectors_binary.rs Outdated Show resolved Hide resolved
quantization/src/encoded_vectors.rs Outdated Show resolved Hide resolved
quantization/src/encoded_vectors_u8.rs Outdated Show resolved Hide resolved
@kaancfidan
Copy link
Contributor Author

kaancfidan commented Nov 23, 2023

I have fixed both L1 and L2 distances and added tests for both in case of binary quantization.

I have run a benchmark with real data using Euclid + binary quantization (using qdrant itself). It did not work well. I need to investigate and understand the difference between the toy test data and the real one.

I'll also check L1 tomorrow. Meanwhile, I have set the status to draft again.

I have been banging my head on this all day.

I am pretty sure L1 and L2 distances and inverses (similarities) are calculated properly. I have gone through unit tests, added integration tests that use binary and scalar quantizations on qdrant...etc. they all work as I expect.

Nevertheless, when I run our benchmark image vector dataset with this implementation, I still get 0% accuracy with L1 and L2 using binary quantization while dot product gets about 84%.

I will let it go if I can justify it as "our problem is just not compatible with binary quantization" but the fact that dot product works annoys me to no end. I am open to suggestions.

@kaancfidan
Copy link
Contributor Author

kaancfidan commented Nov 23, 2023

OK, I had an epiphany where I understood that the binary calculate_metric method does not need to actually calculate the correct metric (dot, L1 or L2) but just has to approximate their behavior and put the vectors in the same order. The actual score is being calculated when rescoring anyway.

I have relaxed the L1 and L2 binary tests to just check the similarity order but not care about the actual score values. This way I could see that the previous implementation with dot product is still on point for both L1 and L2, but it was just inverse of what it was supposed to be. The many layers of inversing confused me too.

I apologize to the reviewers that they had to go through my learning experience with me. :)

I will run the large benchmark with this latest version as soon as I can build a docker image.

edit: aaaand we have a winner! 🍾

@kaancfidan kaancfidan marked this pull request as ready for review November 23, 2023 21:35
@kaancfidan kaancfidan requested a review from timvisee November 23, 2023 21:39
@kaancfidan kaancfidan changed the title implemented L1 distance implemented L1 distance & fixed L2 in binary quantization Nov 23, 2023
Copy link
Member

@timvisee timvisee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work!

I must admit that I didn't look into the C implementations. Maybe @IvanPleshkov can take a look.

quantization/cpp/sse.c Outdated Show resolved Hide resolved
quantization/cpp/neon.c Outdated Show resolved Hide resolved
quantization/cpp/avx2.c Outdated Show resolved Hide resolved
quantization/tests/test_pq.rs Show resolved Hide resolved
quantization/src/encoded_vectors_binary.rs Show resolved Hide resolved
quantization/src/encoded_vectors_binary.rs Show resolved Hide resolved
@IvanPleshkov
Copy link
Collaborator

It's a great work! I really wondered! Impressed with correct scalar quantization alpha and multiplier values and fixed for non-L1 related bugs. I just added some requests for C code. Everything else looks perfect!

@timvisee timvisee merged commit 939fdb6 into qdrant:master Nov 27, 2023
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants