Commit History

Autor SHA1 Mensaxe Data
  Xingyao Wang 2d5b360505 refactor: re-organize different runtime implementations into an impl folder (#4346) hai 1 ano
  Xingyao Wang b23c7aab5a [eval] stop set sid in eval (#4311) hai 1 ano
  Aditya Bharat Soni 0809d26f4d fix: Allow evaluation benchmarks to pass image urls in run_controller() instead of simply passing strings (#4100) hai 1 ano
  Xingyao Wang 090c911a50 (refactor) Make `Runtime` class synchronous (#3661) hai 1 ano
  Graham Neubig f9088766e8 Allow setting of runtime container image (#3573) hai 1 ano
  Robert Brennan 01ae22ef57 Rename OpenDevin to OpenHands (#3472) hai 1 ano
  Xingyao Wang b30a2dd87a completely remove update_source_code (#3280) hai 1 ano
  Xingyao Wang 31b244f95e [Refactor, Evaluation] Refactor and clean up evaluation harness to remove global config and use EventStreamRuntime (#3230) hai 1 ano
  Xingyao Wang 001195a3ea reduce the duplication in run_controller (#3217) hai 1 ano
  Xingyao Wang 4f0a454ed6 [Arch] Support integration tests using EventStream Runtime (#3184) hai 1 ano
  மனோஜ்குமார் பழனிச்சாமி 563ebd406d Fix: Add missing arguments for SSHBox in evaluation (#3075) hai 1 ano
  Graham Neubig 275ea706cf Remove remaining global config (#3099) hai 1 ano
  Xingyao Wang da17665cab fix: make max_budget_per_task optional in `run_agent_controller` (#3071) hai 1 ano
  Graham Neubig 3a21198424 Remove monologue agent (#3036) hai 1 ano
  Xingyao Wang cf910dfa9d fix eval api_key leak in metadata; fix llm config in run infer (#2998) hai 1 ano
  Anush Kumar V 8f76587e5c docs: updated docstrings using ruff's autofix feature (#2923) hai 1 ano
  Engel Nyst d37b2973b2 Refactoring: event stream based agent history (#2709) hai 1 ano
  Graham Neubig d0384cafdd Two fixes to swe bench eval (#2831) hai 1 ano
  Xingyao Wang f6dc89b41a [Evaluation] Simplify eval & and multi-processing related fixes (#2810) hai 1 ano
  Graham Neubig a081935fd8 Simplify eval code (#2775) hai 1 ano
  Graham Neubig ffd3c7144c Remove global args (#2760) hai 1 ano
  Engel Nyst 874b4c9075 CLI concurrency (#2695) hai 1 ano
  RainRat 745ae42a72 fix typos (#2352) hai 1 ano
  super-dainiu beabcce16d [Hotfix] Fix ML-Bench continue ``run_inference.py`` (#2284) hai 1 ano
  super-dainiu ebafb702e5 Add ML-Bench Evaluation with OpenDevin (#2015) hai 1 ano