Commit History

Autor SHA1 Mensaxe Data
  Graham Neubig ce6f99d80e Add GITHUB_USERNAME env var to resolver step (#4999) hai 1 ano
  Ketan Ramaneti 852c90f64a [fix eval] Fix issues with miniwob remote runtime evaluation (#5001) hai 1 ano
  Ketan Ramaneti 42b49e6c43 [fix eval] Fix issues with aider_bench remote runtime evaluation (#5000) hai 1 ano
  Robert Brennan 17f4c6e1a9 Refactor sessions a bit, and fix issue where runtimes get killed (#4900) hai 1 ano
  Engel Nyst eeb2342509 Refactor history/event stream (#3808) hai 1 ano
  Xingyao Wang 966da7b7c8 feat(agent, CodeAct 2.2): native CodeAct support for Browsing (#4667) hai 1 ano
  Xingyao Wang 1f23dc89b6 fix(eval): add runtime.connect to all eval harness (#4565) hai 1 ano
  Xingyao Wang 2d5b360505 refactor: re-organize different runtime implementations into an impl folder (#4346) hai 1 ano
  Xingyao Wang b23c7aab5a [eval] stop set sid in eval (#4311) hai 1 ano
  Aditya Bharat Soni 0809d26f4d fix: Allow evaluation benchmarks to pass image urls in run_controller() instead of simply passing strings (#4100) hai 1 ano
  Xingyao Wang 090c911a50 (refactor) Make `Runtime` class synchronous (#3661) hai 1 ano
  Graham Neubig f9088766e8 Allow setting of runtime container image (#3573) hai 1 ano
  Robert Brennan 01ae22ef57 Rename OpenDevin to OpenHands (#3472) hai 1 ano
  Xingyao Wang b30a2dd87a completely remove update_source_code (#3280) hai 1 ano
  Xingyao Wang 31b244f95e [Refactor, Evaluation] Refactor and clean up evaluation harness to remove global config and use EventStreamRuntime (#3230) hai 1 ano
  Xingyao Wang 001195a3ea reduce the duplication in run_controller (#3217) hai 1 ano
  Xingyao Wang 4f0a454ed6 [Arch] Support integration tests using EventStream Runtime (#3184) hai 1 ano
  tobitege 70dd705418 Fix: apply config arguments for miniwob get_sandbox() from loaded config (#3198) hai 1 ano
  Graham Neubig 275ea706cf Remove remaining global config (#3099) hai 1 ano
  Xingyao Wang da17665cab fix: make max_budget_per_task optional in `run_agent_controller` (#3071) hai 1 ano
  Xingyao Wang cf910dfa9d fix eval api_key leak in metadata; fix llm config in run infer (#2998) hai 1 ano
  Engel Nyst d37b2973b2 Refactoring: event stream based agent history (#2709) hai 1 ano
  Graham Neubig d0384cafdd Two fixes to swe bench eval (#2831) hai 1 ano
  Xingyao Wang f6dc89b41a [Evaluation] Simplify eval & and multi-processing related fixes (#2810) hai 1 ano
  Graham Neubig a081935fd8 Simplify eval code (#2775) hai 1 ano
  Graham Neubig ffd3c7144c Remove global args (#2760) hai 1 ano
  Engel Nyst 2d9bb56763 Add ability to restore the cli session (optional) (#2699) hai 1 ano
  Engel Nyst 874b4c9075 CLI concurrency (#2695) hai 1 ano
  RainRat 745ae42a72 fix typos (#2352) hai 1 ano
  Frank Xu 48151bdbb0 [feat] WebArena benchmark, MiniWoB++ benchmark and related arch changes (#2170) hai 1 ano