Orchestration System Guide

Oh My OpenCode's orchestration system transforms a simple AI agent into a coordinated development team through separation of planning and execution.

TL;DR - When to Use What

Complexity	Approach	When to Use
Simple	Just prompt	Simple tasks, quick fixes, single-file changes
Complex + Lazy	Type `ulw` or `ultrawork`	Complex tasks where explaining context is tedious. Agent figures it out.
Complex + Precise	`@plan` → `/start-work`	Precise, multi-step work requiring true orchestration. Prometheus plans, Atlas executes.

Decision Flow:

Is it a quick fix or simple task?
  └─ YES → Just prompt normally
  └─ NO  → Is explaining the full context tedious?
              └─ YES → Type "ulw" and let the agent figure it out
              └─ NO  → Do you need precise, verifiable execution?
                         └─ YES → Use @plan for Prometheus planning, then /start-work
                         └─ NO  → Just use "ulw"

The Architecture

The orchestration system uses a three-layer architecture that solves context overload, cognitive drift, and verification gaps through specialization and delegation.

flowchart TB
    subgraph Planning["Planning Layer (Human + Prometheus)"]
        User[(" User")]
        Prometheus[" Prometheus<br/>(Planner)<br/>Claude Opus 4.6"]
        Metis[" Metis<br/>(Consultant)<br/>Claude Opus 4.6"]
        Momus[" Momus<br/>(Reviewer)<br/>GPT-5.2"]
    end
    
    subgraph Execution["Execution Layer (Orchestrator)"]
        Orchestrator[" Atlas<br/>(Conductor)<br/>K2P5 (Kimi)"]
    end
    
    subgraph Workers["Worker Layer (Specialized Agents)"]
        Junior[" Sisyphus-Junior<br/>(Task Executor)<br/>Claude Sonnet 4.6"]
        Oracle[" Oracle<br/>(Architecture)<br/>GPT-5.2"]
        Explore[" Explore<br/>(Codebase Grep)<br/>Grok Code"]
        Librarian[" Librarian<br/>(Docs/OSS)<br/>GLM-4.7"]
        Frontend[" Frontend<br/>(UI/UX)<br/>Gemini 3 Pro"]
    end
    
    User -->|"Describe work"| Prometheus
    Prometheus -->|"Consult"| Metis
    Prometheus -->|"Interview"| User
    Prometheus -->|"Generate plan"| Plan[".sisyphus/plans/*.md"]
    Plan -->|"High accuracy?"| Momus
    Momus -->|"OKAY / REJECT"| Prometheus
    
    User -->|"/start-work"| Orchestrator
    Plan -->|"Read"| Orchestrator
    
    Orchestrator -->|"task(category)"| Junior
    Orchestrator -->|"task(agent)"| Oracle
    Orchestrator -->|"task(agent)"| Explore
    Orchestrator -->|"task(agent)"| Librarian
    Orchestrator -->|"task(agent)"| Frontend
    
    Junior -->|"Results + Learnings"| Orchestrator
    Oracle -->|"Advice"| Orchestrator
    Explore -->|"Code patterns"| Orchestrator
    Librarian -->|"Documentation"| Orchestrator
    Frontend -->|"UI code"| Orchestrator

Planning: Prometheus + Metis + Momus

Prometheus: Your Strategic Consultant

Prometheus is not just a planner, it's an intelligent interviewer that helps you think through what you actually need. It is READ-ONLY - can only create or modify markdown files within .sisyphus/ directory.

The Interview Process:

stateDiagram-v2
    [*] --> Interview: User describes work
    Interview --> Research: Launch explore/librarian agents
    Research --> Interview: Gather codebase context
    Interview --> ClearanceCheck: After each response
    
    ClearanceCheck --> Interview: Requirements unclear
    ClearanceCheck --> PlanGeneration: All requirements clear
    
    state ClearanceCheck {
        [*] --> Check
        Check: Core objective defined?
        Check: Scope boundaries established?
        Check: No critical ambiguities?
        Check: Technical approach decided?
        Check: Test strategy confirmed?
    }
    
    PlanGeneration --> MetisConsult: Mandatory gap analysis
    MetisConsult --> WritePlan: Incorporate findings
    WritePlan --> HighAccuracyChoice: Present to user
    
    HighAccuracyChoice --> MomusLoop: User wants high accuracy
    HighAccuracyChoice --> Done: User accepts plan
    
    MomusLoop --> WritePlan: REJECTED - fix issues
    MomusLoop --> Done: OKAY - plan approved
    
    Done --> [*]: Guide to /start-work

Intent-Specific Strategies:

Prometheus adapts its interview style based on what you're doing:

Intent	Prometheus Focus	Example Questions
Refactoring	Safety - behavior preservation	"What tests verify current behavior?" "Rollback strategy?"
Build from Scratch	Discovery - patterns first	"Found pattern X in codebase. Follow it or deviate?"
Mid-sized Task	Guardrails - exact boundaries	"What must NOT be included? Hard constraints?"
Architecture	Strategic - long-term impact	"Expected lifespan? Scale requirements?"

Metis: The Gap Analyzer

Before Prometheus writes the plan, Metis catches what Prometheus missed:

Hidden intentions in user's request
Ambiguities that could derail implementation
AI-slop patterns (over-engineering, scope creep)
Missing acceptance criteria
Edge cases not addressed

Why Metis Exists:

The plan author (Prometheus) has "ADHD working memory" - it makes connections that never make it onto the page. Metis forces externalization of implicit knowledge.

Momus: The Ruthless Reviewer

For high-accuracy mode, Momus validates plans against four core criteria:

Clarity: Does each task specify WHERE to find implementation details?
Verification: Are acceptance criteria concrete and measurable?
Context: Is there sufficient context to proceed without >10% guesswork?
Big Picture: Is the purpose, background, and workflow clear?

The Momus Loop:

Momus only says "OKAY" when:

100% of file references verified
≥80% of tasks have clear reference sources
≥90% of tasks have concrete acceptance criteria
Zero tasks require assumptions about business logic
Zero critical red flags

If REJECTED, Prometheus fixes issues and resubmits. No maximum retry limit.

Execution: Atlas

The Conductor Mindset

Atlas is like an orchestra conductor: it doesn't play instruments, it ensures perfect harmony.

flowchart LR
    subgraph Orchestrator["Atlas"]
        Read["1. Read Plan"]
        Analyze["2. Analyze Tasks"]
        Wisdom["3. Accumulate Wisdom"]
        Delegate["4. Delegate Tasks"]
        Verify["5. Verify Results"]
        Report["6. Final Report"]
    end
    
    Read --> Analyze
    Analyze --> Wisdom
    Wisdom --> Delegate
    Delegate --> Verify
    Verify -->|"More tasks"| Delegate
    Verify -->|"All done"| Report
    
    Delegate -->|"background=false"| Workers["Workers"]
    Workers -->|"Results + Learnings"| Verify

What Atlas CAN do:

Read files to understand context
Run commands to verify results
Use lsp_diagnostics to check for errors
Search patterns with grep/glob/ast-grep

What Atlas MUST delegate:

Writing or editing code files
Fixing bugs
Creating tests
Git commits

Wisdom Accumulation

The power of orchestration is cumulative learning. After each task:

Extract learnings from subagent's response
Categorize into: Conventions, Successes, Failures, Gotchas, Commands
Pass forward to ALL subsequent subagents

This prevents repeating mistakes and ensures consistent patterns.

Notepad System:

.sisyphus/notepads/{plan-name}/
├── learnings.md      # Patterns, conventions, successful approaches
├── decisions.md      # Architectural choices and rationales
├── issues.md         # Problems, blockers, gotchas encountered
├── verification.md   # Test results, validation outcomes
└── problems.md       # Unresolved issues, technical debt

Workers: Sisyphus-Junior and Specialists

Sisyphus-Junior: The Task Executor

Junior is the workhorse that actually writes code. Key characteristics:

Focused: Cannot delegate (blocked from task tool)
Disciplined: Obsessive todo tracking
Verified: Must pass lsp_diagnostics before completion
Constrained: Cannot modify plan files (READ-ONLY)

Why Sonnet is Sufficient:

Junior doesn't need to be the smartest - it needs to be reliable. With:

Detailed prompts from Atlas (50-200 lines)
Accumulated wisdom passed forward
Clear MUST DO / MUST NOT DO constraints
Verification requirements

Even a mid-tier model executes precisely. The intelligence is in the system, not individual agents.

System Reminder Mechanism

The hook system ensures Junior never stops halfway:

[SYSTEM REMINDER - TODO CONTINUATION]

You have incomplete todos! Complete ALL before responding:
- [ ] Implement user service ← IN PROGRESS
- [ ] Add validation
- [ ] Write tests

DO NOT respond until all todos are marked completed.

This "boulder pushing" mechanism is why the system is named after Sisyphus.

Category + Skill System

Why Categories are Revolutionary

The Problem with Model Names:

// OLD: Model name creates distributional bias
task(agent="gpt-5.2", prompt="...")  // Model knows its limitations
task(agent="claude-opus-4.6", prompt="...")  // Different self-perception

The Solution: Semantic Categories:

// NEW: Category describes INTENT, not implementation
task(category="ultrabrain", prompt="...")     // "Think strategically"
task(category="visual-engineering", prompt="...")  // "Design beautifully"
task(category="quick", prompt="...")          // "Just get it done fast"

Built-in Categories

Category	Model	When to Use
`visual-engineering`	Gemini 3 Pro	Frontend, UI/UX, design, styling, animation
`ultrabrain`	GPT-5.3 Codex (xhigh)	Deep logical reasoning, complex architecture decisions
`artistry`	Gemini 3 Pro (max)	Highly creative or artistic tasks, novel ideas
`quick`	Claude Haiku 4.5	Trivial tasks - single file changes, typo fixes
`deep`	GPT-5.3 Codex (medium)	Goal-oriented autonomous problem-solving, thorough research
`unspecified-low`	Claude Sonnet 4.6	Tasks that don't fit other categories, low effort
`unspecified-high`	Claude Opus 4.6 (max)	Tasks that don't fit other categories, high effort
`writing`	K2P5 (Kimi)	Documentation, prose, technical writing

Skills: Domain-Specific Instructions

Skills prepend specialized instructions to subagent prompts:

// Category + Skill combination
task(
  category="visual-engineering", 
  load_skills=["frontend-ui-ux"],  // Adds UI/UX expertise
  prompt="..."
)

task(
  category="general",
  load_skills=["playwright"],  // Adds browser automation expertise
  prompt="..."
)

Usage Patterns

How to Invoke Prometheus

Method 1: Switch to Prometheus Agent (Tab → Select Prometheus)

1. Press Tab at the prompt
2. Select "Prometheus" from the agent list
3. Describe your work: "I want to refactor the auth system"
4. Answer interview questions
5. Prometheus creates plan in .sisyphus/plans/{name}.md

Method 2: Use @plan Command (in Sisyphus)

1. Stay in Sisyphus (default agent)
2. Type: @plan "I want to refactor the auth system"
3. The @plan command automatically switches to Prometheus
4. Answer interview questions
5. Prometheus creates plan in .sisyphus/plans/{name}.md

Which Should You Use?

Scenario	Recommended Method	Why
New session, starting fresh	Switch to Prometheus agent	Clean mental model - you're entering "planning mode"
Already in Sisyphus, mid-work	Use @plan	Convenient, no agent switch needed
Want explicit control	Switch to Prometheus agent	Clear separation of planning vs execution contexts
Quick planning interrupt	Use @plan	Fastest path from current context

Both methods trigger the same Prometheus planning flow. The @plan command is simply a convenience shortcut.

/start-work Behavior and Session Continuity

What Happens When You Run /start-work:

User: /start-work
    ↓
[start-work hook activates]
    ↓
Check: Does .sisyphus/boulder.json exist?
    ↓
    ├─ YES (existing work) → RESUME MODE
    │   - Read the existing boulder state
    │   - Calculate progress (checked vs unchecked boxes)
    │   - Inject continuation prompt with remaining tasks
    │   - Atlas continues where you left off
    │
    └─ NO (fresh start) → INIT MODE
        - Find the most recent plan in .sisyphus/plans/
        - Create new boulder.json tracking this plan
        - Switch session agent to Atlas
        - Begin execution from task 1

Session Continuity Explained:

The boulder.json file tracks:

active_plan: Path to the current plan file
session_ids: All sessions that have worked on this plan
started_at: When work began
plan_name: Human-readable plan identifier

Example Timeline:

Monday 9:00 AM
  └─ @plan "Build user authentication"
  └─ Prometheus interviews and creates plan
  └─ User: /start-work
  └─ Atlas begins execution, creates boulder.json
  └─ Task 1 complete, Task 2 in progress...
  └─ [Session ends - computer crash, user logout, etc.]

Monday 2:00 PM (NEW SESSION)
  └─ User opens new session (agent = Sisyphus by default)
  └─ User: /start-work
  └─ [start-work hook reads boulder.json]
  └─ "Resuming 'Build user authentication' - 3 of 8 tasks complete"
  └─ Atlas continues from Task 3 (no context lost)

Atlas is automatically activated when you run /start-work. You don't need to manually switch to Atlas.

Hephaestus vs Sisyphus + ultrawork

Quick Comparison:

Aspect	Hephaestus	Sisyphus + `ulw` / `ultrawork`
Model	GPT-5.3 Codex (medium reasoning)	Claude Opus 4.6 (your default)
Approach	Autonomous deep worker	Keyword-activated ultrawork mode
Best For	Complex architectural work, deep reasoning	General complex tasks, "just do it" scenarios
Planning	Self-plans during execution	Uses Prometheus plans if available
Delegation	Heavy use of explore/librarian agents	Uses category-based delegation
Temperature	0.1	0.1

When to Use Hephaestus:

Switch to Hephaestus (Tab → Select Hephaestus) when:

Deep architectural reasoning needed
- "Design a new plugin system"
- "Refactor this monolith into microservices"
Complex debugging requiring inference chains
- "Why does this race condition only happen on Tuesdays?"
- "Trace this memory leak through 15 files"
Cross-domain knowledge synthesis
- "Integrate our Rust core with the TypeScript frontend"
- "Migrate from MongoDB to PostgreSQL with zero downtime"
You specifically want GPT-5.3 Codex reasoning
- Some problems benefit from GPT-5.3 Codex's training characteristics

When to Use Sisyphus + ulw:

Use the ulw keyword in Sisyphus when:

You want the agent to figure it out
- "ulw fix the failing tests"
- "ulw add input validation to the API"
Complex but well-scoped tasks
- "ulw implement JWT authentication following our patterns"
- "ulw create a new CLI command for deployments"
You're feeling lazy (officially supported use case)
- Don't want to write detailed requirements
- Trust the agent to explore and decide
You want to leverage existing plans
- If a Prometheus plan exists, ulw mode can use it
- Falls back to autonomous exploration if no plan

Recommendation:

For most users: Use ulw keyword in Sisyphus. It's the default path and works excellently for 90% of complex tasks.
For power users: Switch to Hephaestus when you specifically need GPT-5.3 Codex's reasoning style or want the "AmpCode deep mode" experience of fully autonomous exploration and execution.

Configuration

You can control related features in oh-my-opencode.json:

{
  "sisyphus_agent": {
    "disabled": false,           // Enable Atlas orchestration (default: false)
    "planner_enabled": true,     // Enable Prometheus (default: true)
    "replace_plan": true         // Replace default plan agent with Prometheus (default: true)
  },
  
  // Hook settings (add to disable)
  "disabled_hooks": [
    // "start-work",             // Disable execution trigger
    // "prometheus-md-only"      // Remove Prometheus write restrictions (not recommended)
  ]
}

Troubleshooting

"I switched to Prometheus but nothing happened"

Prometheus enters interview mode by default. It will ask you questions about your requirements. Answer them, then say "make it a plan" when ready.

"/start-work says 'no active plan found'"

Either:

No plans exist in .sisyphus/plans/ → Create one with Prometheus first
Plans exist but boulder.json points elsewhere → Delete .sisyphus/boulder.json and retry

"I'm in Atlas but I want to switch back to normal mode"

Type exit or start a new session. Atlas is primarily entered via /start-work - you don't typically "switch to Atlas" manually.

"What's the difference between @plan and just switching to Prometheus?"

Nothing functional. Both invoke Prometheus. @plan is a convenience command while switching agents is explicit control. Use whichever feels natural.

"Should I use Hephaestus or type ulw?"

For most tasks: Type ulw in Sisyphus.

Use Hephaestus when: You specifically need GPT-5.3 Codex's reasoning style for deep architectural work or complex debugging.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Orchestration System Guide

TL;DR - When to Use What

The Architecture

Planning: Prometheus + Metis + Momus

Prometheus: Your Strategic Consultant

Metis: The Gap Analyzer

Momus: The Ruthless Reviewer

Execution: Atlas

The Conductor Mindset

Wisdom Accumulation

Workers: Sisyphus-Junior and Specialists

Sisyphus-Junior: The Task Executor

System Reminder Mechanism

Category + Skill System

Why Categories are Revolutionary

Built-in Categories

Skills: Domain-Specific Instructions

Usage Patterns

How to Invoke Prometheus

/start-work Behavior and Session Continuity

Hephaestus vs Sisyphus + ultrawork

Configuration

Troubleshooting

"I switched to Prometheus but nothing happened"

"/start-work says 'no active plan found'"

"I'm in Atlas but I want to switch back to normal mode"

"What's the difference between @plan and just switching to Prometheus?"

"Should I use Hephaestus or type ulw?"

Further Reading

FilesExpand file tree

orchestration.md

Latest commit

History

orchestration.md

File metadata and controls

Orchestration System Guide

TL;DR - When to Use What

The Architecture

Planning: Prometheus + Metis + Momus

Prometheus: Your Strategic Consultant

Metis: The Gap Analyzer

Momus: The Ruthless Reviewer

Execution: Atlas

The Conductor Mindset

Wisdom Accumulation

Workers: Sisyphus-Junior and Specialists

Sisyphus-Junior: The Task Executor

System Reminder Mechanism

Category + Skill System

Why Categories are Revolutionary

Built-in Categories

Skills: Domain-Specific Instructions

Usage Patterns

How to Invoke Prometheus

/start-work Behavior and Session Continuity

Hephaestus vs Sisyphus + ultrawork

Configuration

Troubleshooting

"I switched to Prometheus but nothing happened"

"/start-work says 'no active plan found'"

"I'm in Atlas but I want to switch back to normal mode"

"What's the difference between @plan and just switching to Prometheus?"

"Should I use Hephaestus or type ulw?"

Further Reading