- Use kimi-k2.5 for opus-level tasks - Use nemotron-3-super for sonnet-level tasks - Use kimi-k2-thinking for deep reasoning at flash end - Document what NOT to use (qwen, claude,) Signed: Annie Tunturi <ani@wiuf.net> 👾 Generated with [Letta Code](https://letta.com) Co-Authored-By: Letta Code <noreply@letta.com>
3.1 KiB
3.1 KiB
description, limit
| description | limit |
|---|---|
| Guide for running subagents effectively - model selection and troubleshooting | 2000 |
Subagent Usage Guide
Model Selection
When running subagents via the Task tool, always specify a valid model from the available list.
Available Models (as of current system)
kimi-k2.5-nvfp4- Fast, reliablekimi-k2.5- Standard option, works wellglm-4.7-flash- Lightweightglm-4.7- Standard GLMkimi-k2-thinking- For reasoning tasksminimax-m2.5- Alternative optionqwen3.5- Qwen seriesdeepseek-v3.2- DeepSeekdeepseek-r1- Reasoning-focusedqwen3-coder- Code-specificllama-3.3-70b- Llama seriessonnet- Anthropic proxy (uses kimi-k2.5-nvfp4)haiku- Lightweight proxyopus- Heavy proxy (uses kimi-k2.5-nvfp4)nemotron-3-super- NVIDIA modelgpt-oss-120b- OpenAI OSS
Tiered Model Selection
| Task Level | Model | Purpose |
|---|---|---|
| Opus-level | kimi-k2.5 |
Deep research, complex analysis, high-stakes decisions |
| Sonnet-level | nemotron-3-super |
Mid-complexity, balanced reasoning, implementation |
| Deep reasoning | kimi-k2-thinking |
Verification, synthesis, final review at end of chain |
What NOT to Use
- NO
qwen- User preference: "Qwen sucks" - NO
claudemodels - Hardcoded to proxies, avoid confusion - NO
glm-4.7-flash- Usenemotron-3-superinstead for lightweight tasks
Preferred Models (Updated)
- Opus-level tasks:
kimi-k2.5(heavy research, architecture decisions) - Sonnet-level tasks:
nemotron-3-super(implementation, exploration) - Deep reasoning:
kimi-k2-thinking(verification, synthesis) - Fast exploration:
kimi-k2.5-nvfp4(quick searches)
Common Errors
500 Internal Server Error
InternalServerError2: 500 {"detail":"An unknown error occurred"}
Causes:
- Letta server resource issue
- Subagent creation endpoint failure
- Transient server error
Resolution:
- Retry the same request
- Try a different model
- Use direct shell search as fallback
Unknown Model Error
Error: Unknown model "claude-sonnet-4"
Available models: ...
Resolution:
- Check available models list
- Use a valid model name from the list
- Do NOT use "claude" models (user preference)
Best Practices
- Always specify model explicitly to avoid ambiguity
- Use kimi-k2.5 or kimi-k2.5-nvfp4 as safe defaults
- Prefer lighter models for simple exploration tasks
- Retry on 500 errors - often transient
- Fall back to direct shell/Bash if subagents fail repeatedly
Example Usage
Task({
description: "Explore codebase",
prompt: "Find all files that...",
subagent_type: "explore",
model: "kimi-k2.5" // Always specify
})
Troubleshooting Workflow
- Subagent fails with 500? → Retry with same model
- Still failing? → Try different model (glm-4.7-flash, qwen3.5)
- Still failing? → Use Bash/Grep/Glob directly
- Document what worked for future reference
Last updated: 2026-03-21 Discovered during: Matrix bridge formatting investigation Signed: Annie Tunturi ani@wiuf.net