Skip to content

Actions: open-thought/reasoning-gym

Actions

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
613 workflow runs
613 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix PoolMatrixConfigs::score_answer(), add unit tests (#215)
Tests #633: Commit 48f0826 pushed by andreaskoepf
February 25, 2025 23:43 2m 7s main
February 25, 2025 23:43 2m 7s
Merge pull request #212 from open-thought/eval_consolidation_2
Tests #630: Commit 99fec34 pushed by andreaskoepf
February 25, 2025 22:46 2m 4s main
February 25, 2025 22:46 2m 4s
Add llama-3.3-70b-instruct algebra, algorithmic eval configs
Tests #629: Pull request #212 synchronize by andreaskoepf
February 25, 2025 22:43 2m 8s eval_consolidation_2
February 25, 2025 22:43 2m 8s
fix formatting of NOTICE.txt
Tests #628: Commit 92c8be1 pushed by andreaskoepf
February 25, 2025 22:43 2m 6s main
February 25, 2025 22:43 2m 6s
Add llama-3.3-70b-instruct algebra, algorithmic eval configs
Tests #623: Pull request #212 synchronize by andreaskoepf
February 25, 2025 22:36 2m 15s eval_consolidation_2
February 25, 2025 22:36 2m 15s
Add llama-3.3-70b-instruct algebra, algorithmic eval configs
Tests #622: Pull request #212 synchronize by andreaskoepf
February 25, 2025 22:32 2m 6s eval_consolidation_2
February 25, 2025 22:32 2m 6s
Add llama-3.3-70b-instruct algebra, algorithmic eval configs
Tests #621: Pull request #212 synchronize by andreaskoepf
February 25, 2025 22:27 2m 5s eval_consolidation_2
February 25, 2025 22:27 2m 5s
feat(env): CodeIO
Tests #619: Pull request #186 synchronize by zafstojano
February 25, 2025 21:21 2m 13s zafstojano:feat/codeio
February 25, 2025 21:21 2m 13s
docs: Add BibTeX citation for Re-ARC dataset in NOTICE.txt
Tests #618: Commit 8ccf077 pushed by andreaskoepf
February 25, 2025 19:19 2m 9s main
February 25, 2025 19:19 2m 9s
Add KnightsKnavesDataset (knights_knaves)
Tests #617: Commit 5f01049 pushed by andreaskoepf
February 25, 2025 19:15 2m 5s main
February 25, 2025 19:15 2m 5s
knights_knaves
Tests #616: Pull request #196 synchronize by andreaskoepf
February 25, 2025 19:15 2m 5s vncntt:knights-knaves
February 25, 2025 19:15 2m 5s
knights_knaves
Tests #615: Pull request #196 synchronize by andreaskoepf
February 25, 2025 19:09 2m 13s vncntt:knights-knaves
February 25, 2025 19:09 2m 13s
knights_knaves
Tests #614: Pull request #196 synchronize by andreaskoepf
February 25, 2025 19:04 2m 9s vncntt:knights-knaves
February 25, 2025 19:04 2m 9s
knights_knaves
Tests #613: Pull request #196 synchronize by andreaskoepf
February 25, 2025 18:54 2m 2s vncntt:knights-knaves
February 25, 2025 18:54 2m 2s
Merge pull request #205 from open-thought/consolidate_eval_script
Tests #612: Commit ed9292a pushed by andreaskoepf
February 25, 2025 18:45 2m 14s main
February 25, 2025 18:45 2m 14s
Consolidate eval scripts to have single eval.py
Tests #611: Pull request #205 synchronize by andreaskoepf
February 25, 2025 18:41 2m 5s consolidate_eval_script
February 25, 2025 18:41 2m 5s
Consolidate eval scripts to have single eval.py
Tests #610: Pull request #205 synchronize by joesharratt1229
February 25, 2025 18:14 2m 10s consolidate_eval_script
February 25, 2025 18:14 2m 10s
Fix/eval
Tests #609: Pull request #206 synchronize by joesharratt1229
February 25, 2025 16:32 2m 5s joesharratt1229:fix/eval
February 25, 2025 16:32 2m 5s