| .. |
|
EDA
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
agent_bench
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
biocoder
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
bird
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
gaia
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
gorilla
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
gpqa
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
humanevalfix
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
logic_reasoning
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
miniwob
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
mint
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
ml_bench
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
regression
|
e89cc8f19b
Feat: add stream output to exec_run (#1625)
|
1 éve |
|
static
|
b2fdb963b6
Add detailed tutorial for adding new evaluation benchmarks (#1827)
|
1 éve |
|
swe_bench
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
toolqa
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
utils
|
feabc97aba
Evaluation time travel: build sandbox on the fly (#2491)
|
1 éve |
|
webarena
|
cab7a288ca
Add NUM_WORKERS variable to run_infer.sh scripts for configurable woker settings (#2597)
|
1 éve |
|
README.md
|
ebafb702e5
Add ML-Bench Evaluation with OpenDevin (#2015)
|
1 éve |
|
TUTORIAL.md
|
41564c2eac
Use :main instead of :latest (#2539)
|
1 éve |
|
__init__.py
|
2406b901df
feat(SWE-Bench environment) integrate SWE-Bench sandbox (#1468)
|
1 éve |