Robert Brennan
|
01ae22ef57
Rename OpenDevin to OpenHands (#3472)
|
1 年之前 |
Xingyao Wang
|
31b244f95e
[Refactor, Evaluation] Refactor and clean up evaluation harness to remove global config and use EventStreamRuntime (#3230)
|
1 年之前 |
Xingyao Wang
|
ff6ddc831f
fix: runtime test for mac (#3005)
|
1 年之前 |
Boxuan Li
|
c68478f470
Customize LLM config per agent (#2756)
|
1 年之前 |
Boxuan Li
|
6f235937cf
Evaluation time travel: allow evaluation on a specific version (#2356)
|
1 年之前 |
super-dainiu
|
563bc41fd3
Use LLM to analyze ML-Bench failure cases (#2399)
|
1 年之前 |
super-dainiu
|
ebafb702e5
Add ML-Bench Evaluation with OpenDevin (#2015)
|
1 年之前 |