Xingyao Wang
|
9c2b48ff5d
fix(eval): SWE-Bench instance with upper-case instance id (#4649)
|
il y a 1 an |
Xingyao Wang
|
6d19c93d19
[eval] add evaluation workflow (#4489)
|
il y a 1 an |
Xingyao Wang
|
1f23dc89b6
fix(eval): add runtime.connect to all eval harness (#4565)
|
il y a 1 an |
Xingyao Wang
|
b23c7aab5a
[eval] stop set sid in eval (#4311)
|
il y a 1 an |
Xingyao Wang
|
1109637efb
Update instruction for new version of eval runtime-api (#4128)
|
il y a 1 an |
Xingyao Wang
|
b13ed017d8
[eval] add git patch post-processing for SWE-Bench eval_infer (#3980)
|
il y a 1 an |
Xingyao Wang
|
5d7f2fd4ae
[eval] Allow evaluation of SWE-Bench patches on `RemoteRuntime` (#3927)
|
il y a 1 an |