Histórico de commits

Autor SHA1 Mensagem Data
  Graham Neubig cab7a288ca Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597) 1 ano atrás
  Boxuan Li 6f235937cf Evaluation time travel: allow evaluation on a specific version (#2356) 1 ano atrás
  Xingyao Wang 01ef90205d Add CodeActSWEAgent to remove browsing & github + improvements on agentskills (#2105) 1 ano atrás
  Xingyao Wang 5114230e53 Some SWE-Bench infer fixes and improvements (#2065) 1 ano atrás
  Xingyao Wang 602ffcdffb Implement `agentskills` for OpenDevin to helpfully improve edit AND including more useful tools/skills (#1941) 1 ano atrás
  Boxuan Li b845a38169 Small improvements & fixes to SWE-Bench (#1874) 1 ano atrás
  Xingyao Wang b2fdb963b6 Add detailed tutorial for adding new evaluation benchmarks (#1827) 1 ano atrás
  Xingyao Wang e31f8b8322 automatically get agent version for eval (#1844) 1 ano atrás
  Xingyao Wang 2406b901df feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468) 1 ano atrás