Xingyao Wang
|
50c13aad98
[Eval] Improve SWE-Bench Eval harness: multi-run support & entry script simplification (#4396)
|
1 year ago |
Xingyao Wang
|
0c2a35b256
[eval] update aider bench scripts (#4203)
|
1 year ago |
Raj Maheshwari
|
0cdeb83b17
Enabling of unittests in aider benchmark should be optional. (#3620)
|
1 year ago |
tobitege
|
8fcf0817d4
(eval) Aider_bench: add eval_ids arg to run specific instance id's (#3592)
|
1 year ago |
Raj Maheshwari
|
80f88e14cd
[Feat] Aider Benchmark (#3507)
|
1 year ago |