Difference between cli and corpus_score #259

VarunGumma · 2024-03-20T18:52:58Z

I am attempting to compute chrF++ for a set of predictions and references. If I use sacrebleu cli (sacrebleu ref.eng_Latn.tok < pred.eng_Latn.tok -m bleu chrf --chrf-word-order 2), I find a significant difference when I use corpus_score with CHRF(word_order=2).corpus_score(preds, refs). I have double-checked the data in both cases, and it is correct and the same, so no issues there. Any reason why this is happening? Similarly, the BLEU scores (with BLEU().corpus_score(preds, refs)) also varies significantly. Are there some default params that I am missing?

The text was updated successfully, but these errors were encountered:

nkrasner · 2024-05-31T18:10:45Z

I think this is related to #220 . I was having the same issue, transposing the references as they mentioned fixed my issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference between cli and corpus_score #259

Difference between cli and corpus_score #259

VarunGumma commented Mar 20, 2024

nkrasner commented May 31, 2024 •

edited

Loading

Difference between cli and corpus_score #259

Difference between cli and corpus_score #259

Comments

VarunGumma commented Mar 20, 2024

nkrasner commented May 31, 2024 • edited Loading

nkrasner commented May 31, 2024 •

edited

Loading