Xingyao Wang 53390d9885 Fix issue #4583: [Bug]: Unable to pull the full SWE-Bench test set (#4813) 1 year ago
..
docker 53390d9885 Fix issue #4583: [Bug]: Unable to pull the full SWE-Bench test set (#4813) 1 year ago
eval 6d19c93d19 [eval] add evaluation workflow (#4489) 1 year ago
setup 50c13aad98 [Eval] Improve SWE-Bench Eval harness: multi-run support & entry script simplification (#4396) 1 year ago
cleanup_remote_runtime.sh 245334e89d [eval] improve update output script for swe-bench (#4180) 1 year ago
eval_infer.sh 8d6eda3623 fix eval_infer.sh to correctly copy SWE-Bench logs (#4111) 1 year ago
eval_infer_remote.sh 245334e89d [eval] improve update output script for swe-bench (#4180) 1 year ago
run_infer.sh 966da7b7c8 feat(agent, CodeAct 2.2): native CodeAct support for Browsing (#4667) 1 year ago