Bq with u8 alignment support #25

IvanPleshkov · 2024-05-24T09:11:11Z

For multivector 128-dim alignment is too large. This PR introduces generic BQ storage with u128 and u8 alignment.

In u128 case there are no any changes for backward compatibility. In u8 case there is a flexible alignment: if dim >= 128, alignment is 128 and simd is optimized for 128-bit alignment (the same for 64 and 32).

IvanPleshkov · 2024-05-30T09:24:08Z

quantization/cpp/neon.c

-EXPORT uint64_t impl_xor_popcnt_neon(
-    const uint64_t* query_ptr,
-    const uint64_t* vector_ptr,
+EXPORT uint32_t impl_xor_popcnt_neon_uint128(


just simplification. not much difference in benches

IvanPleshkov · 2024-05-30T09:25:31Z

quantization/cpp/neon.c


-    return (uint64_t)vaddvq_u32(result);
+EXPORT uint32_t impl_xor_popcnt_neon_uint64(


There are no xor_popcnt for u32 in neon, because neon doesn't have intrinsic for 32bit popcnt

IvanPleshkov · 2024-05-30T09:26:56Z

quantization/src/encoded_vectors_binary.rs

-        stop_condition: impl Fn() -> bool,
-    ) -> Result<Self, EncodingError> {
-        debug_assert!(validate_vector_parameters(orig_data.clone(), vector_parameters).is_ok());
+pub trait BitsStoreType:


To separate dense and multivector behaviour, I defined here a generic type which is u128 for dense and u8 for multivector

IvanPleshkov · 2024-05-30T09:27:35Z

quantization/src/encoded_vectors_binary.rs

+        #[cfg(any(target_arch = "x86", target_arch = "x86_64"))]
+        if is_x86_feature_detected!("sse4.2") {
+            unsafe {
+                if v1.len() > 16 {


For large-dim multivector use more efficient simd implementation

IvanPleshkov · 2024-05-30T09:29:26Z

quantization/src/encoded_vectors_binary.rs

+
+    fn encode_vector(vector: &[f32]) -> EncodedBinVector<TBitsStoreType> {
+        let mut encoded_vector =
+            vec![Default::default(); TBitsStoreType::get_storage_size(vector.len())];


Just realised that for u128 we may have the wrong alignment here. Need additional investigation but it's not a part of this PR

IvanPleshkov added 4 commits May 23, 2024 15:40

BQ with u8 alignment support

7602478

neon

8bf61e7

fix u8 alignment

afe2678

are you happy fmt

9cb08cc

IvanPleshkov requested a review from generall May 24, 2024 10:07

IvanPleshkov marked this pull request as draft May 24, 2024 10:12

IvanPleshkov added 5 commits May 29, 2024 22:09

remove obsolete methods

84dd4f6

different dimensions test

6577eed

u32 and u64 simd

3c2d13b

use one from num traits

ae42e09

different alignments for u8

660fd49

IvanPleshkov commented May 30, 2024

View reviewed changes

IvanPleshkov marked this pull request as ready for review May 30, 2024 09:46

nitpicks

45bd0b3

generall approved these changes May 30, 2024

View reviewed changes

IvanPleshkov merged commit 0caf67d into master May 30, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bq with u8 alignment support #25

Bq with u8 alignment support #25

IvanPleshkov commented May 24, 2024 •

edited

Loading

IvanPleshkov May 30, 2024

IvanPleshkov May 30, 2024

IvanPleshkov May 30, 2024

IvanPleshkov May 30, 2024

IvanPleshkov May 30, 2024


		return (uint64_t)vaddvq_u32(result);
		EXPORT uint32_t impl_xor_popcnt_neon_uint64(

Bq with u8 alignment support #25

Bq with u8 alignment support #25

Conversation

IvanPleshkov commented May 24, 2024 • edited Loading

IvanPleshkov May 30, 2024

Choose a reason for hiding this comment

IvanPleshkov May 30, 2024

Choose a reason for hiding this comment

IvanPleshkov May 30, 2024

Choose a reason for hiding this comment

IvanPleshkov May 30, 2024

Choose a reason for hiding this comment

IvanPleshkov May 30, 2024

Choose a reason for hiding this comment

IvanPleshkov commented May 24, 2024 •

edited

Loading