Histórico de Commits

Autor SHA1 Mensagem Data
  Engel Nyst eeb2342509 Refactor history/event stream (#3808) há 1 ano atrás
  Xingyao Wang 1f23dc89b6 fix(eval): add runtime.connect to all eval harness (#4565) há 1 ano atrás
  Xingyao Wang 2d5b360505 refactor: re-organize different runtime implementations into an impl folder (#4346) há 1 ano atrás
  Xingyao Wang da548d308c [agent] LLM-based editing (#3985) há 1 ano atrás
  Xingyao Wang b23c7aab5a [eval] stop set sid in eval (#4311) há 1 ano atrás
  Aditya Bharat Soni 0809d26f4d fix: Allow evaluation benchmarks to pass image urls in run_controller() instead of simply passing strings (#4100) há 1 ano atrás
  Xingyao Wang 0c2a35b256 [eval] update aider bench scripts (#4203) há 1 ano atrás
  tofarr 152f99c64f Chore Bump python version (#3545) há 1 ano atrás
  tobitege dbb671a8a5 logname fix; improve test calling instruction (#3666) há 1 ano atrás
  Xingyao Wang 090c911a50 (refactor) Make `Runtime` class synchronous (#3661) há 1 ano atrás
  tobitege 9c39f07430 (enh) Aider-Bench: make resumable with skip_num arg (#3626) há 1 ano atrás
  Raj Maheshwari 0cdeb83b17 Enabling of unittests in aider benchmark should be optional. (#3620) há 1 ano atrás
  Raj Maheshwari 789f15a5db Allow the Agent to run uniittests for verification. (#3609) há 1 ano atrás
  tobitege 8fcf0817d4 (eval) Aider_bench: add eval_ids arg to run specific instance id's (#3592) há 1 ano atrás
  Graham Neubig f9088766e8 Allow setting of runtime container image (#3573) há 1 ano atrás
  Raj Maheshwari 11d8d05b1a [Fix] Metrics should be updated when agent reaches max iterations. (#3549) há 1 ano atrás
  Raj Maheshwari 80f88e14cd [Feat] Aider Benchmark (#3507) há 1 ano atrás