Result of Localization #21

GCVulnerability · 2024-08-13T11:53:34Z

Hi, Agentless is an amazing work. I notice that '% Correct Location' is mentioned in the paper.
I'm really interested in the Fault Location of SWE-bench.
So can you please provide the ground truth of SWE-bench lite and the evaluation code?

brutalsavage · 2024-08-14T05:09:18Z

Thanks for the question, we will release that soon!

yorhaha · 2024-08-20T07:05:31Z

Thanks for the question, we will release that soon!

Could you please give an approximate release time? Otherwise, we will consider implementing our own evaluation code. But this may lead to differences in our results.

Thanks for your work.

yorhaha · 2024-08-20T09:58:02Z

Also, may I ask if you calculate recall value based on the final generated patch and the ground truth patch? (without considering the code retrieved during the intermediate process before generating the patch)

brutalsavage · 2024-08-20T14:55:15Z

Could you please give an approximate release time?

Our hope is sometime this week or early next week

calculate recall value

Not totally sure what you mean, can you please elaborate a bit more?

yorhaha · 2024-08-21T02:57:27Z

By recall value (used in SWE-bench paper), I want to mean "% Correct Location" in your paper. But after reading your paper carefully, now I think the two concepts are different.

Recall value measures the performance of RAG in SWE-bench paper. I am confused by the meaning of "% Correct Location", which encourages more code changes (to cover ground truth patch)?

brutalsavage · 2024-08-21T05:17:33Z

right so in our paper "% Correct Location" measure the percentage of time the patch edits the location as the groundtruth developer patch. We count it as the correct location if the patch edits a superset of all the locations. For example if its the function granularity, if a patch edits func1 and func2 but the groundtruth patch edits only func1 we still count it as correct. You can see Section 3 in the paper for more detail

yorhaha · 2024-08-21T08:24:18Z

Thanks for your explanation! I have got it.

UniverseFly · 2024-10-13T03:31:12Z

Any updates for the eval of fault location accuracy?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Result of Localization #21

Result of Localization #21

GCVulnerability commented Aug 13, 2024

brutalsavage commented Aug 14, 2024

yorhaha commented Aug 20, 2024

yorhaha commented Aug 20, 2024

brutalsavage commented Aug 20, 2024

yorhaha commented Aug 21, 2024

brutalsavage commented Aug 21, 2024

yorhaha commented Aug 21, 2024

UniverseFly commented Oct 13, 2024

Result of Localization #21

Result of Localization #21

Comments

GCVulnerability commented Aug 13, 2024

brutalsavage commented Aug 14, 2024

yorhaha commented Aug 20, 2024

yorhaha commented Aug 20, 2024

brutalsavage commented Aug 20, 2024

yorhaha commented Aug 21, 2024

brutalsavage commented Aug 21, 2024

yorhaha commented Aug 21, 2024

UniverseFly commented Oct 13, 2024