Parallel hashing for 6-9x speedup #24

glycerine · 2025-02-02T00:05:45Z

fixes #22 and #23

glycerine · 2025-02-02T00:58:11Z

Nice to see: on my non-AVX512 amd thread ripper, this is almost as fast as the b3sum reference implementation written in rust, generally 1-2% slower.

On my AVX512 enabled mac, this Go version is faster than b3sum.

Tests are in parallel_test.go. The defaults give about a 6x speed up when hashing large files on my box.

glycerine · 2025-02-02T10:56:29Z

Ah. It turns out I was choking off the available parallelism. With a buffered channel, we are strictly faster than the rust version.

glycerine · 2025-02-02T23:23:38Z

I appreciate that authors have limited time to review such changes; and may wish to keep their libraries tightly focused on single core performance.

For users like myself who need the fastest possible file hashing -- using all available cores -- my fork's master branch https://github.com/glycerine/blake3 has these changes applied. My b3 tool makes them available in a b3sum-like command line utility: https://github.com/glycerine/b3

lukechampine · 2025-02-03T15:14:01Z

Thanks for the contribution! The API and overall style is markedly different from the rest of the repo, so I don't think I can merge this as-is -- but I will push a commit that simplifies parallel hashing, with you as a co-author, if that's acceptable.

glycerine · 2025-02-04T10:28:12Z

@lukechampine Feel free to re-mold as you like.

I'll point out that I find getting a Hasher back after a parallel file scan incredibly useful; it expands the usefulness of the functionality a great deal. For example, if I want to track a file's modification time (or other meta data) as well as its content, I can just add on (Write) the few bytes for the timestamp to the existing hasher and call Sum again. No need to repeat the expensive part.

lukechampine · 2025-02-10T23:39:24Z

Superceded by #25 -- thank you!

HashFile and friends are available in parallel.go

51f61af

Tests are in parallel_test.go. The defaults give about a 6x speed up when hashing large files on my box.

glycerine force-pushed the parallel branch from 4919624 to 51f61af Compare February 2, 2025 03:48

lukechampine closed this Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel hashing for 6-9x speedup #24

Parallel hashing for 6-9x speedup #24

glycerine commented Feb 2, 2025

glycerine commented Feb 2, 2025

glycerine commented Feb 2, 2025

glycerine commented Feb 2, 2025

lukechampine commented Feb 3, 2025

glycerine commented Feb 4, 2025 •

edited

Loading

lukechampine commented Feb 10, 2025

Parallel hashing for 6-9x speedup #24

Parallel hashing for 6-9x speedup #24

Conversation

glycerine commented Feb 2, 2025

glycerine commented Feb 2, 2025

glycerine commented Feb 2, 2025

glycerine commented Feb 2, 2025

lukechampine commented Feb 3, 2025

glycerine commented Feb 4, 2025 • edited Loading

lukechampine commented Feb 10, 2025

glycerine commented Feb 4, 2025 •

edited

Loading