Introduction
Level of harness
- Prompt management
- Tool orchestrator
- Memory management
- Error handling
- Input/output formatting
- Guardrails
How to implement
Flows
- You —> give task
- Agent reads harness files
- Agent executes, harness governs every step:
- Instructions: what to do, in what order
- Scope: one feature at a time, no overreach
- State: progress log, feature list, git history
- Verification: tests, lint, type-check, smoke runs
- Lifecycle: init at start, clean state at end
- Agent stops only when verification passes
Session 1: agent reads instructions agent runs init.sh agent works on one feature agent verifies before claiming done agent updates progress log agent commits clean state
Session 2: agent reads progress log agent picks up exactly where it left off agent continues the unfinished feature you review, not rescue
Result: agent does the work, you verify the result
PROJECT ROOT
├── AGENTS.md <-- the agent's operating manual
├── init.sh <-- runs install + verify + start
├── feature_list.json <-- what features exist, which are done
├── progress.md <-- what happened each session
└── src/ <-- actual code