tobitege
|
dbb671a8a5
logname fix; improve test calling instruction (#3666)
|
1 سال پیش |
Xingyao Wang
|
090c911a50
(refactor) Make `Runtime` class synchronous (#3661)
|
1 سال پیش |
tobitege
|
9c39f07430
(enh) Aider-Bench: make resumable with skip_num arg (#3626)
|
1 سال پیش |
Raj Maheshwari
|
e72dc96d13
[Fix] Stop API key from leaking in evaluation outputs. (#3603)
|
1 سال پیش |
tobitege
|
8fcf0817d4
(eval) Aider_bench: add eval_ids arg to run specific instance id's (#3592)
|
1 سال پیش |
Robert Brennan
|
01ae22ef57
Rename OpenDevin to OpenHands (#3472)
|
1 سال پیش |
Xingyao Wang
|
31b244f95e
[Refactor, Evaluation] Refactor and clean up evaluation harness to remove global config and use EventStreamRuntime (#3230)
|
1 سال پیش |
Graham Neubig
|
3a21198424
Remove monologue agent (#3036)
|
1 سال پیش |
Xingyao Wang
|
cf910dfa9d
fix eval api_key leak in metadata; fix llm config in run infer (#2998)
|
1 سال پیش |
Engel Nyst
|
d37b2973b2
Refactoring: event stream based agent history (#2709)
|
1 سال پیش |
Xingyao Wang
|
f6dc89b41a
[Evaluation] Simplify eval & and multi-processing related fixes (#2810)
|
1 سال پیش |
Graham Neubig
|
a081935fd8
Simplify eval code (#2775)
|
1 سال پیش |