Hannes Rudolph
|
3f0a6971ca
feat(web-evals): add task log viewing, export failed logs, and new run options (#9637)
|
hace 1 mes |
Chris Estreich
|
0e1b23d09c
Bare metal evals fixes (#8224)
|
hace 3 meses |
Chris Estreich
|
99448fc913
Minor fixes for local (non-Docker) evals (#5604)
|
hace 5 meses |
Steven T. Cramer
|
fca4bea7c3
Update evals Docker setup to work on Windows. (#4656)
|
hace 6 meses |
Chris Estreich
|
083ac9333a
Revert "chore(deps): update postgres docker tag to v17" (#4557)
|
hace 6 meses |
renovate[bot]
|
cf7c4a13a0
chore(deps): update postgres docker tag to v17 (#4350)
|
hace 6 meses |
Chris Estreich
|
8d5dab3518
GHA evals (#4472)
|
hace 6 meses |
John Richmond
|
736c4b1810
Improve eval docker ports (#4489)
|
hace 6 meses |
Chris Estreich
|
73ed9f2b26
Docker cleanup script (#4469)
|
hace 6 meses |
Chris Estreich
|
52673b3721
Harden evals with retry logic + centralize logs on Docker host (#4440)
|
hace 6 meses |
Chris Estreich
|
cb5b9c3718
Improve Docker setup for evals (#4327)
|
hace 6 meses |
Chris Estreich
|
d87f890556
Move evals into pnpm workspace, switch from SQLite to Postgres (#4278)
|
hace 7 meses |