Skip to content

Actions: huggingface/lighteval

Quality

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,793 workflow runs
1,793 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Extractive Match metric
Quality #2015: Pull request #495 synchronize by hynky1999
January 13, 2025 14:09 2m 10s math_extraction
January 13, 2025 14:09 2m 10s
Extractive Match metric
Quality #2014: Pull request #495 synchronize by hynky1999
January 13, 2025 14:08 2m 19s math_extraction
January 13, 2025 14:08 2m 19s
Extractive Match metric
Quality #2013: Pull request #495 synchronize by hynky1999
January 13, 2025 13:14 2m 13s math_extraction
January 13, 2025 13:14 2m 13s
llm_as_a_judge_for_oallv2_arabic
Quality #2012: Pull request #498 opened by Manel-Hik
January 13, 2025 11:30 2m 1s Manel-Hik:main
January 13, 2025 11:30 2m 1s
Add swiss legal evals as new community tasks
Quality #2011: Pull request #389 synchronize by JoelNiklaus
January 13, 2025 05:35 Action required JoelNiklaus:add_swiss_legal_evals
January 13, 2025 05:35 Action required
Initial proposal for model lazy loading
Quality #2010: Pull request #497 opened by JoelNiklaus
January 11, 2025 21:15 Action required JoelNiklaus:lazy-load-model-init
January 11, 2025 21:15 Action required
Extractive Match metric
Quality #2009: Pull request #495 opened by hynky1999
January 11, 2025 19:03 2m 14s math_extraction
January 11, 2025 19:03 2m 14s
Added custom model inference.
Quality #2008: Pull request #437 synchronize by JoelNiklaus
January 11, 2025 18:31 Action required JoelNiklaus:add-custom-model
January 11, 2025 18:31 Action required
Add Doc Strings to Config Files
Quality #2006: Pull request #465 synchronize by ParagEkbote
January 11, 2025 14:41 Action required ParagEkbote:Document-Custom-Model-Files
January 11, 2025 14:41 Action required
Add swiss legal evals as new community tasks
Quality #2001: Pull request #389 synchronize by JoelNiklaus
January 10, 2025 18:13 Action required JoelNiklaus:add_swiss_legal_evals
January 10, 2025 18:13 Action required
Add swiss legal evals as new community tasks
Quality #2000: Pull request #389 synchronize by JoelNiklaus
January 10, 2025 16:55 Action required JoelNiklaus:add_swiss_legal_evals
January 10, 2025 16:55 Action required
Fixed issue with o1 in litellm.
Quality #1999: Pull request #493 opened by JoelNiklaus
January 10, 2025 02:10 2m 13s JoelNiklaus:fix-o1-litellm
January 10, 2025 02:10 2m 13s
Add swiss legal evals as new community tasks
Quality #1996: Pull request #389 synchronize by JoelNiklaus
January 7, 2025 18:14 Action required JoelNiklaus:add_swiss_legal_evals
January 7, 2025 18:14 Action required
Quality
Quality #1994: by clefourrier
January 7, 2025 15:20 2m 20s main
January 7, 2025 15:20 2m 20s
Hotfix for litellm judge
Quality #1993: Pull request #490 synchronize by JoelNiklaus
January 7, 2025 15:17 2m 12s JoelNiklaus:fix-litellm-judge
January 7, 2025 15:17 2m 12s
Hotfix for litellm judge
Quality #1991: Pull request #490 synchronize by JoelNiklaus
January 7, 2025 14:57 2m 13s JoelNiklaus:fix-litellm-judge
January 7, 2025 14:57 2m 13s