Multiple-E Go test file name suffix does not contain _test.go #224

hitesh-1997 · 2024-04-20T14:39:58Z

Hi Team,
I was using the bigcode-evaluation-harness to evaluate generation for go on Multiple-E dataset and found that, all the evaluation had output ? command-line-arguments [no test files] although status_code = 0.
On debugging further, it looks like we set self.language here instead of prompt_name['langugage'] in the problem dict to process execution downstream, and when language is checked in evaluators here, it is appended without _test.go suffix leading to non detecting any test files.

To make it easy to repro this, I have added a video below which evaluate one go generation test case (used deepseek coder to generate this)

generations_go_example.json

[
	[
        "package strlen_test\n\nimport (\n    \"testing\"\n    \"fmt\"\n)\n\n// Return length of given string\n// >>> strlen(\"\")\n// 0\n// >>> strlen(\"abc\")\n// 3\nfunc strlen(myString string) int {\n    return len(myString)\n}\n"
    ]
]

bigcode_go_test_file_name_issue.mp4

The text was updated successfully, but these errors were encountered:

hitesh-1997 linked a pull request Apr 20, 2024 that will close this issue

fix: Multiple-E dataset fix go_test.go path for test execution #225

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple-E Go test file name suffix does not contain _test.go #224

Multiple-E Go test file name suffix does not contain _test.go #224

hitesh-1997 commented Apr 20, 2024 •

edited

Loading

Multiple-E Go test file name suffix does not contain _test.go #224

Multiple-E Go test file name suffix does not contain _test.go #224

Comments

hitesh-1997 commented Apr 20, 2024 • edited Loading

hitesh-1997 commented Apr 20, 2024 •

edited

Loading