Files
letta-code/benchmarks/terminal_bench/regression-tasks.txt
2026-03-13 14:26:38 -07:00

7 lines
246 B
Plaintext

# Terminal-Bench regression task subset for Letta Code
# These tasks are run on a schedule to detect regressions.
# Criteria: fast (<10 min), diverse capabilities, deterministic.
# Adjust based on known Letta Code pass rates.
cancel-async-tasks