Commit Verlauf

Autor SHA1 Nachricht Datum
  Boxuan Li 6f235937cf Evaluation time travel: allow evaluation on a specific version (#2356) vor 1 Jahr
  Xingyao Wang 01ef90205d Add CodeActSWEAgent to remove browsing & github + improvements on agentskills (#2105) vor 1 Jahr
  Xingyao Wang 5114230e53 Some SWE-Bench infer fixes and improvements (#2065) vor 1 Jahr
  Xingyao Wang 602ffcdffb Implement `agentskills` for OpenDevin to helpfully improve edit AND including more useful tools/skills (#1941) vor 1 Jahr
  Boxuan Li b845a38169 Small improvements & fixes to SWE-Bench (#1874) vor 1 Jahr
  Xingyao Wang b2fdb963b6 Add detailed tutorial for adding new evaluation benchmarks (#1827) vor 1 Jahr
  Xingyao Wang e31f8b8322 automatically get agent version for eval (#1844) vor 1 Jahr
  Xingyao Wang 2406b901df feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468) vor 1 Jahr