Commit History

Author SHA1 Message Date
  Xingyao Wang 50c13aad98 [Eval] Improve SWE-Bench Eval harness: multi-run support & entry script simplification (#4396) 1 year ago
  Xingyao Wang 0c2a35b256 [eval] update aider bench scripts (#4203) 1 year ago
  Raj Maheshwari 0cdeb83b17 Enabling of unittests in aider benchmark should be optional. (#3620) 1 year ago
  tobitege 8fcf0817d4 (eval) Aider_bench: add eval_ids arg to run specific instance id's (#3592) 1 year ago
  Raj Maheshwari 80f88e14cd [Feat] Aider Benchmark (#3507) 1 year ago