Histórico de Commits

Autor SHA1 Mensagem Data
  Engel Nyst eeb2342509 Refactor history/event stream (#3808) há 1 ano atrás
  Xingyao Wang 966da7b7c8 feat(agent, CodeAct 2.2): native CodeAct support for Browsing (#4667) há 1 ano atrás
  Xingyao Wang 9c2b48ff5d fix(eval): SWE-Bench instance with upper-case instance id (#4649) há 1 ano atrás
  Xingyao Wang ae13171194 feat(agent): CodeAct with function calling (#4537) há 1 ano atrás
  Xingyao Wang 1f23dc89b6 fix(eval): add runtime.connect to all eval harness (#4565) há 1 ano atrás
  Xingyao Wang 7340b78962 feat(eval): rewrite log_completions to save completions to directory (#4566) há 1 ano atrás
  Xingyao Wang 2d5b360505 refactor: re-organize different runtime implementations into an impl folder (#4346) há 1 ano atrás
  Xingyao Wang da548d308c [agent] LLM-based editing (#3985) há 1 ano atrás
  Alejandro Cuadron Lafuente a9a593bb21 [Fix] Added support to specify the platform on which the runtime image should be built. (#4402) há 1 ano atrás
  Xingyao Wang 91308ba4dc feat: clean-up retries RemoteRuntime & add FatalErrorObservation (#4485) há 1 ano atrás
  Jiayi Pan c1b323a076 Show actual dataset name in swebench log directory (#4417) há 1 ano atrás
  Xingyao Wang b23c7aab5a [eval] stop set sid in eval (#4311) há 1 ano atrás
  Robert Brennan 45fb4fb9bc allow reconnecting to a runtime (#4223) há 1 ano atrás
  Engel Nyst e6847e9e61 Move agenthub within openhands (#4130) há 1 ano atrás
  Aditya Bharat Soni 0809d26f4d fix: Allow evaluation benchmarks to pass image urls in run_controller() instead of simply passing strings (#4100) há 1 ano atrás
  Xingyao Wang 9cc9b19958 eval: improve swebench infer error handling and retry (#4205) há 1 ano atrás
  Xingyao Wang 1109637efb Update instruction for new version of eval runtime-api (#4128) há 1 ano atrás
  Xingyao Wang 714e46f29a [eval] save eventstream & llm completions for SWE-Bench run_infer (#3923) há 1 ano atrás
  tofarr ad0b549d8b Feat Tightening up Timeouts and interrupt conditions. (#3926) há 1 ano atrás
  Xingyao Wang f996b31d64 [eval] Fix multi-processing bug (again^3) & allow set EXP_NAME for each `run_infer` (#3907) há 1 ano atrás
  Xingyao Wang 3a1b8c093b [eval] yet another eval fixes on multi-processing (#3854) há 1 ano atrás
  Xingyao Wang 78c5f58adc refactor & improve retry for the reliability of `RemoteRuntime` & evaluation (#3846) há 1 ano atrás
  Xingyao Wang 47d9621742 [eval] SWE-Bench eval usability fixes (#3836) há 1 ano atrás
  Xingyao Wang 2fe2f4c530 [eval] increase timeout for SWEBench eval init/complete (#3829) há 1 ano atrás
  Jiayi Pan 43c4a7fff4 Allow Generalized SWE-Bench format for evaluation (#3752) há 1 ano atrás
  Xingyao Wang 688068a44e Fix issues for running `RemoteRuntime` in parallel on SWE-Bench (#3716) há 1 ano atrás
  Xingyao Wang d283420ac2 feat: add SWE-bench fullset support (#3477) há 1 ano atrás
  Xingyao Wang 090c911a50 (refactor) Make `Runtime` class synchronous (#3661) há 1 ano atrás
  Xingyao Wang 8b1f207d39 feat: support remote runtime (#3406) há 1 ano atrás
  Xingyao Wang 98081b9b1b (eval) EOF fixes for SWE-Bench evaluation (#3623) há 1 ano atrás