Per-Agent Model Selection
Try this to prioritize quality for the session:
Have all agents use Opus for the rest of this session
Try this to optimize costs:
Switch to Haiku — I'm trying to save costs
Try this to balance quality and budget:
Use Sonnet for code, Haiku for everything else
Squad adjusts model selection based on your directive. Agents writing code get quality models (Sonnet/Opus), agents doing docs/logs get cost-optimized models (Haiku). You can override anytime.
How It Works
Squad routes each agent to the right model based on what they’re doing — not a one-size-fits-all default. The governing principle: cost first, unless code is being written.
How It Works
Model selection uses a layered system. First match wins:
- User Override — You said “use opus” or “save costs”? Done. Session-wide directives persist until contradicted.
- Charter Preference — The agent’s charter specifies a
## Modelsection with a preferred model. - Task-Aware Auto-Selection — The coordinator checks what the agent is actually doing:
| Task Output | Model | Tier |
|---|---|---|
| Writing code (implementation, refactoring, tests, bug fixes) | claude-sonnet-4.5 |
Standard |
| Writing prompts or agent designs | claude-sonnet-4.5 |
Standard |
| Non-code work (docs, planning, triage, changelogs) | claude-haiku-4.5 |
Fast |
| Visual/design work requiring image analysis | claude-opus-4.5 |
Premium |
- Default — If nothing matched,
claude-haiku-4.5. Cost wins when in doubt.
Role-to-Model Mapping
| Role | Default Model | Why |
|---|---|---|
| Core Dev / Backend / Frontend | claude-sonnet-4.5 |
Writes code — quality first |
| Tester / QA | claude-sonnet-4.5 |
Writes test code |
| Lead / Architect | auto (per-task) | Mixed: code review vs. planning |
| Prompt Engineer | auto (per-task) | Prompt design is like code |
| DevRel / Writer | claude-haiku-4.5 |
Docs — not code |
| Scribe / Logger | claude-haiku-4.5 |
Mechanical file ops |
| Git / Release | claude-haiku-4.5 |
Changelogs, tags, version bumps |
| Designer / Visual | claude-opus-4.5 |
Vision capability required |
16-Model Catalog
Squad supports 16 models across three tiers:
- Premium: claude-opus-4.6, claude-opus-4.6-fast, claude-opus-4.5
- Standard: claude-sonnet-4.5, gpt-5.2-codex, claude-sonnet-4, gpt-5.2, gpt-5.1-codex, gpt-5.1, gpt-5, gemini-3-pro-preview
- Fast/Cheap: claude-haiku-4.5, gpt-5.1-codex-mini, gpt-4.1, gpt-5-mini, gpt-5.1-codex-mini
Fallback Chains
If a model is unavailable (plan restriction, rate limit, deprecation), Squad silently retries with the next in chain:
Premium: claude-opus-4.6 → claude-opus-4.6-fast → claude-opus-4.5 → claude-sonnet-4.5
Standard: claude-sonnet-4.5 → gpt-5.2-codex → claude-sonnet-4 → gpt-5.2
Fast: claude-haiku-4.5 → gpt-5.1-codex-mini → gpt-4.1 → gpt-5-mini
Never falls back UP in tier — a fast task won’t land on a premium model.
User Overrides
Tell the coordinator what you want:
"use opus for this"— one-off premium"always use haiku"— session-wide cost savings"use gpt-5.2-codex for Fenster"— agent-specific override
Sample Prompts
use opus for this architecture work
Override to premium model for a single high-stakes task.
always use haiku to save costs
Set session-wide preference for the cheapest model tier.
what model did Kane use for that last task?
Check which model was actually used for a completed task.
use gpt-5.2-codex for all backend work
Set a specific model for tasks in a particular domain.
switch back to automatic model selection
Clear any session-wide overrides and return to task-aware auto-selection.