Introduction

Level of harness

Prompt management
Tool orchestrator
Memory management
Error handling
Input/output formatting
Guardrails

How to implement

Flows

You —> give task
Agent reads harness files
Agent executes, harness governs every step:
Instructions: what to do, in what order
Scope: one feature at a time, no overreach
State: progress log, feature list, git history
Verification: tests, lint, type-check, smoke runs
Lifecycle: init at start, clean state at end
Agent stops only when verification passes

Session 1: agent reads instructions agent runs init.sh agent works on one feature agent verifies before claiming done agent updates progress log agent commits clean state

Session 2: agent reads progress log agent picks up exactly where it left off agent continues the unfinished feature you review, not rescue

Result: agent does the work, you verify the result

PROJECT ROOT
  ├── AGENTS.md              <-- the agent's operating manual
  ├── init.sh                <-- runs install + verify + start
  ├── feature_list.json      <-- what features exist, which are done
  ├── progress.md            <-- what happened each session
  └── src/                   <-- actual code

AI harness

Introduction

Level of harness

How to implement