fleet2.eduz.ai

Agent & model fleet — which model for which job, what's available, what it costs. Today: static catalog. v1: live session aggregator.

v0: static model catalog. Session-level live data (active runs, token spend, cost roll-up) needs a backend that reads ~/.claude/projects/-home-pk/*.jsonl — that's v1 work, and will be auth-gated (session content is private). For now: use this page as the routing reference when picking a model for a task.

Claude family (primary fleet)

Anthropic Claude 4.X — what's installed in this universe. Knowledge cutoff: January 2026.

modeltieridstrengthstypical use
Opus 4.8 flagship claude-opus-4-8 Deepest reasoning, longest-context judgment, most reliable on multi-step planning Architecture, CTO-mode, design reviews, ambiguous problems
Opus 4.7 flagship (prev) claude-opus-4-7 Same family as 4.8, slightly older; what this session is running on Same as 4.8 when 4.8 unavailable
Sonnet 4.6 balanced claude-sonnet-4-6 Good reasoning at ~5x cheaper than Opus; great for most coding work Default for app-level coding, refactors, ports where judgment isn't critical
Haiku 4.5 fast/cheap claude-haiku-4-5-20251001 Lowest cost, fast TTFB, still capable for mechanical work Subagent dispatch (file copies, mechanical ports, batched lookups). Used heavily in this repo's build.
Fable 5 special claude-fable-5 Specialized variant Niche use cases

Non-Claude providers (registered in Doppler)

Other model providers we have keys for. Stored in Doppler workplaces below, fetched on demand.

provideraccount / sourceDoppler scopetier notetypical use
DeepSeekvia API keyeduz1-core/deepseek/prdVery cheap; strong codingMechanical-but-tricky code, when Haiku is too brittle
Geminiprasoonky@gmail.com (AI Studio)eduz1b-core/gemini/prdFree tier (rate-limited) — verified via -FreeTier quota IDsFree-tier experiments; long-context tasks within the rate limit
GLM(zhipu)eduz1-core/glm/prdZ-family model seriesAlternative when Western models are throttled
OpenRouteraggregatoreduz1-core/openrouter/prdOne key, many modelsTesting models we don't have direct accounts with
Mimo(Xiaomi)eduz1-core/mimo/prdOpen-source small-model familySelf-hosted experiments
RunPodself-hosted computeeduz1-core/runpod/prdGPU pods for OSS model servingWhen we want full control of the model + data

Routing rules of thumb

For this repo specifically: Opus drives the conversation; Haiku does the heavy file copies and README ports (cheap, parallel). DeepSeek + Gemini are available for future cost-sensitive batch work.

v1 ideas (auth-gated)

  1. Live session list — read ~/.claude/projects/-home-pk/*.jsonl, render sessions (model, start time, message count, last activity).
  2. Token roll-up — sum subagent_tokens across sessions to surface daily / weekly spend.
  3. Fleet "now" — which agents are actively running (parallel forks, background tasks).
  4. Cost projection — extrapolate spend at current burn rate.

All gated by auth-allowlist once that's wired across the 2-series.