Histórico de Commits

Autor SHA1 Mensagem Data
  Xingyao Wang 81b3cd71b3 [eval] log evaluating warnings directly to console (#4026) há 1 ano atrás
  Xingyao Wang 1b1d8f0b02 [eval] Use `imap_unorderd` for parallizing evaluation (#4040) há 1 ano atrás
  Xingyao Wang a66e738957 [eval] use mp Pool instead ProcessPoolExecutor (#4025) há 1 ano atrás
  Xingyao Wang 714e46f29a [eval] save eventstream & llm completions for SWE-Bench run_infer (#3923) há 1 ano atrás
  Xingyao Wang 5d7f2fd4ae [eval] Allow evaluation of SWE-Bench patches on `RemoteRuntime` (#3927) há 1 ano atrás
  Xingyao Wang f996b31d64 [eval] Fix multi-processing bug (again^3) & allow set EXP_NAME for each `run_infer` (#3907) há 1 ano atrás
  Xingyao Wang 2b3925278d [eval] refactor process instance logic into `update_progress` (#3875) há 1 ano atrás
  Engel Nyst 379f2b6f23 Fix queue length on Macs (#3867) há 1 ano atrás
  Xingyao Wang 3a1b8c093b [eval] yet another eval fixes on multi-processing (#3854) há 1 ano atrás
  Xingyao Wang 78c5f58adc refactor & improve retry for the reliability of `RemoteRuntime` & evaluation (#3846) há 1 ano atrás
  tobitege dbb671a8a5 logname fix; improve test calling instruction (#3666) há 1 ano atrás
  Xingyao Wang 090c911a50 (refactor) Make `Runtime` class synchronous (#3661) há 1 ano atrás
  tobitege 9c39f07430 (enh) Aider-Bench: make resumable with skip_num arg (#3626) há 1 ano atrás
  Raj Maheshwari e72dc96d13 [Fix] Stop API key from leaking in evaluation outputs. (#3603) há 1 ano atrás
  tobitege 8fcf0817d4 (eval) Aider_bench: add eval_ids arg to run specific instance id's (#3592) há 1 ano atrás
  Robert Brennan 01ae22ef57 Rename OpenDevin to OpenHands (#3472) há 1 ano atrás
  Xingyao Wang 31b244f95e [Refactor, Evaluation] Refactor and clean up evaluation harness to remove global config and use EventStreamRuntime (#3230) há 1 ano atrás
  Graham Neubig 3a21198424 Remove monologue agent (#3036) há 1 ano atrás
  Xingyao Wang cf910dfa9d fix eval api_key leak in metadata; fix llm config in run infer (#2998) há 1 ano atrás
  Engel Nyst d37b2973b2 Refactoring: event stream based agent history (#2709) há 1 ano atrás
  Xingyao Wang f6dc89b41a [Evaluation] Simplify eval & and multi-processing related fixes (#2810) há 1 ano atrás
  Graham Neubig a081935fd8 Simplify eval code (#2775) há 1 ano atrás