Bugfix: wrong attention mask calculation resulted in wrong embeddings #14496

maziyarpanahi · 2025-01-06T22:30:42Z

This pull request includes several updates to the attention mask logic across multiple classes in the com.johnsnowlabs.ml.ai package. The changes primarily involve modifying the condition used to set the attention mask values.

Updates to attention mask logic:

BGE.scala: Changed the condition for setting attention mask values from x < 0L to x == 0L in both getSentenceEmbeddingFromOv and getSentenceEmbeddingFromOnnx methods. [1] [2]
E5.scala: Updated the attention mask condition from x < 0L to x == 0L in the getSentenceEmbeddingFromOnnx method. [1] [2]
MPNet.scala: Modified the attention mask condition from x < this.paddingTokenId to x == this.paddingTokenId in the getSentenceEmbeddingFromOv and getSentenceEmbeddingFromOnnx methods. [1] [2]
Mxbai.scala: Changed the attention mask condition from x < 0L to x == 0L in the getSentenceEmbeddingFromOnnx method.
Nomic.scala: Updated the attention mask condition from x < 0L to x == 0L in the getSentenceEmbeddingFromOnnx method. [1] [2]
SnowFlake.scala: Modified the attention mask condition from x < 0L to x == 0L in both getSentenceEmbeddingFromOv and getSentenceEmbeddingFromOnnx methods. [1] [2]
UAE.scala: Changed the attention mask condition from x < 0L to x == 0L in both getSentenceEmbeddingFromOpenvino and getSentenceEmbeddingFromOnnx methods. [1] [2] [3]

ahmedlone127

LGTM!

maziyarpanahi added 2 commits January 6, 2025 23:23

Fix the bug generating wrong embeddings

0898d8c

fixing attention mask in bge, e5, mxbai, nomic, snowflake, and uae

9ac93ba

maziyarpanahi requested a review from ahmedlone127 January 6, 2025 22:33

maziyarpanahi self-assigned this Jan 6, 2025

maziyarpanahi added bug-fix DON'T MERGE Do not merge this PR labels Jan 6, 2025

ahmedlone127 reviewed Jan 16, 2025

View reviewed changes

fix issue with attention mask calculation

a6a6fca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix: wrong attention mask calculation resulted in wrong embeddings #14496

Bugfix: wrong attention mask calculation resulted in wrong embeddings #14496

maziyarpanahi commented Jan 6, 2025 •

edited

Loading

ahmedlone127 left a comment

Bugfix: wrong attention mask calculation resulted in wrong embeddings #14496

Are you sure you want to change the base?

Bugfix: wrong attention mask calculation resulted in wrong embeddings #14496

Conversation

maziyarpanahi commented Jan 6, 2025 • edited Loading

Updates to attention mask logic:

ahmedlone127 left a comment

Choose a reason for hiding this comment

maziyarpanahi commented Jan 6, 2025 •

edited

Loading