Xingyao Wang
|
b3bdc44292
mkdir `infer_logs` instead of `logs` (#2382)
|
1 year ago |
Xingyao Wang
|
11a2d1682d
Minor SWE-Bench inference config tweak (#2381)
|
1 year ago |
Robert
|
7fc57650f3
BioCoder integration (#2076)
|
1 year ago |
finaltrip
|
05b84df9cb
chore: fix some comments (#2234)
|
1 year ago |
RainRat
|
ed6dcc8381
fix typos (#2187)
|
1 year ago |
Xingyao Wang
|
01ef90205d
Add CodeActSWEAgent to remove browsing & github + improvements on agentskills (#2105)
|
1 year ago |
Xingyao Wang
|
5114230e53
Some SWE-Bench infer fixes and improvements (#2065)
|
1 year ago |
Xingyao Wang
|
602ffcdffb
Implement `agentskills` for OpenDevin to helpfully improve edit AND including more useful tools/skills (#1941)
|
1 year ago |
Xingyao Wang
|
6ff50ed369
Fix SWE-Bench evaluation due to setuptools version (#1995)
|
1 year ago |
Boxuan Li
|
4add8a5595
SWE-bench: Allow selection of tasks (#1935)
|
1 year ago |
Boxuan Li
|
b845a38169
Small improvements & fixes to SWE-Bench (#1874)
|
1 year ago |
Xingyao Wang
|
b2fdb963b6
Add detailed tutorial for adding new evaluation benchmarks (#1827)
|
1 year ago |
Xingyao Wang
|
2406b901df
feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468)
|
1 year ago |