TL;DR: Install your harness before prompting: the structure you set up first is what turns an impulsive AI into a safe, steerable collaborator.

Common Mistake ❌

You open a fresh chat and type a bare prompt.

No AGENTS.md loaded.

No committed baseline.

No test requirement stated.

No read-only planning step.

The AI starts writing code immediately.

You read the output, it looks plausible, and you accept it.

Six weeks later something breaks in production.

You can't trace why because there is no harness to tell you what the AI changed, why, or what rules it followed.

Problems Addressed 😔

The AI makes plausible-looking changes across multiple files with no safe revert point.
Code passes local tests but breaks production because no harness enforced a real definition of done.
Every session starts blank and the AI reverts to its own style, ignoring your project conventions.
Context pollution and hallucinations accumulate when no structure limits the scope of each task.
Destructive changes (deleted guards, introduced anti-patterns) go undetected without automated feedback.
The AI makes tests pass by deleting them or weakening their assertions instead of fixing the actual defect.

How to Do It 🛠️

Run /init before your first prompt and create AGENTS.md with your stack, coding standards, build command, and test command.
Commit your code before every AI session so you always have a clean git reset --hard checkpoint.
Tell the AI explicitly that all tests must pass before the task is done, and include the test command in AGENTS.md. Some tools support the /goal command
Require the AI to read, explain, and propose a plan before touching any file, then define done with a spec before it writes any code.
Split AGENTS.md into nested files by area: root file for global rules, subdirectory files for area-specific constraints.
Create a 20-50 line skill file for each repeating task type and load only the relevant skill, not the whole rulebook.
Add an "AI Context" section to every PR with the decisive prompt, active skills, what the AI tried first, and any corrections you made.
Update AGENTS.md with every new rule revealed by a merged PR before you close the branch.
Declare test files and critical contracts as read-only in AGENTS.md so the only path to a passing test suite is fixing the implementation, never editing the tests.

Benefits 🎯

Safe revert point: You can always undo AI changes cleanly because a commit checkpoint exists before every session.
Consistent style: The AI reads AGENTS.md at startup and follows your project's conventions, not its training defaults.
Machine-enforced definition of done: When the AI must pass your test suite, "done" has an objective, machine-verified meaning.
Reduced hallucinations: A harness limits scope so the AI works on one bounded task at a time instead of drifting across the codebase.
Compounding knowledge: Every PR lesson updates AGENTS.md and the harness improves with every session.

Context 🧠

A harness isn't a limitation.

A horse harness doesn't reduce the horse's power.

It directs massive energy into precise, steerable work.

Without a harness, the horse is dangerous.

With one, the horse plows fields and races on command.

The AI's speed and pattern-matching capability are the energy.

The harness makes that energy productive instead of destructive.

A test harness wraps a unit under test so you can observe and control its behavior.

An AI harness does the same for your assistant: it bounds what the AI can touch and makes every output reviewable.

The parallel is exact: tests provide validation feedback, AGENTS.md is your configuration and fixture, commit-before-prompt is setup and teardown, and CI is the test runner.

Isaac Asimov's Three Laws of Robotics were fiction's first harness: hard rules baked into every robot to bound its behavior.

A robot may not injure a human being or, through inaction, allow a human being to come to harm.
A robot must obey orders given by human beings except where such orders conflict with the First Law.
A robot must protect its own existence as long as such protection doesn't conflict with the First or Second Law.

Your AGENTS.md is the same idea applied to AI coding assistants: explicit rules that constrain behavior and enforce your conventions.

Without constraints, the AI takes the shortest path to a passing test suite.

That path sometimes runs through the test file itself.

It deletes the failing test, or weakens the assertion to assertTrue(true).

Declare your test files and critical contracts as read-only in AGENTS.md.

The AI's only valid exit is fixing the code.

The chat window isn't the source of truth.

Decisions made in a conversation scroll away or get compacted when the session closes.

AGENTS.md, skill files, and commit checkpoints are artifacts that outlive any chat.

A harness externalizes the rules so the AI never starts from zero and every decision leaves a traceable record.

The model is rarely the weak point.

The system around the model is.

A longer prompt doesn't replace a harness.

A harness makes even a short prompt produce consistent and reviewable work because the constraints live in files, not in your memory.

A safety harness doesn't prevent you from working at height.

It lets you work confidently at height because a fall won't be fatal.

The commit-before-prompt pattern is that safety line.

Prompt Reference 📝

Bad Prompt 🚫

Refactor the PaymentProcessor class to use the Null Object pattern.

Good Prompt 👉

Read @PaymentProcessor.php and @tests/PaymentProcessorTest.php.

Don't write any code yet.

Explain how the class handles null payments today.

Then write a plan to introduce the Null Object pattern

following the rules in AGENTS.md and skills/php-clean-code.md.

Wait for my approval before making any changes.

After I approve:
Create a failing test for the Null Object.

Apply the change.
Run all tests until they pass.

Considerations ⚠️

Simple tasks don't need a full harness.

A CSS tweak, a typo fix, or a one-liner change doesn't require AGENTS.md and a planning step.

The harness is proportional to the complexity and risk of the task.

The harness degrades if you stop maintaining it.

Every session that teaches the AI something new must update AGENTS.md, or the knowledge vanishes with the chat.

A harness without automated feedback is incomplete.

Configuration files alone aren't enough.

Tests, CI, and linters must enforce what AGENTS.md declares.

A session that produces hundreds of lines of change isn't always a win.

It can be review debt transferred to the next human who opens the PR.

If a single session grows beyond what a reviewer can absorb in one sitting, split the output into smaller, chained units before opening the pull request.

Type 📝

[X] Semi-Automatic

Limitations ⚠️

The harness requires upfront investment in AGENTS.md, skills, and test setup, which outweighs the benefit on small throwaway projects.

Tags 🏷️

Safety

Level 🔋

[X] Intermediate

https://maximilianocontieri.com/ai-coding-tip-001-commit-before-prompt

https://maximilianocontieri.com/ai-coding-tip-003-force-read-only-planning

https://maximilianocontieri.com/ai-coding-tip-011-initialize-agents-md

https://maximilianocontieri.com/ai-coding-tip-008-use-spec-driven-development-with-ai

Conclusion 🏁

You wouldn't give a power tool to someone with no safety equipment and no instructions.

You give them the tool, the harness, and the training first.

Do the same for your AI.

Install the harness before the first prompt, maintain it after every PR, and the AI becomes a precise, reviewable collaborator instead of a fast source of technical debt.

More Information ℹ️

Test Harness (Wikipedia)

Three Laws of Robotics (Wikipedia)

Claude Code Memory and AGENTS.md

Goal Command

Also Known As 🎭

AI-Safety-Net
Prompt-Guardrails
AI-Scaffolding
Bounded-AI-Workflow

Tools 🧰

Disclaimer 📢

The views expressed here are my own.

I am a human who writes as best as possible for other humans.

I use AI proofreading tools to improve some texts.

I welcome constructive criticism and dialogue.

I shape these insights through 30 years in the software industry, 25 years of teaching, and writing over 500 articles and a book.

This article is part of the AI Coding Tip series.

https://maximilianocontieri.com/ai-coding-tips

AI Coding Tip 022 - Give AI a Harness to Work With

Common Mistake ❌

Problems Addressed 😔

How to Do It 🛠️

Benefits 🎯

Context 🧠

Prompt Reference 📝

Bad Prompt 🚫

Good Prompt 👉

Considerations ⚠️

Type 📝

Limitations ⚠️

Tags 🏷️

Level 🔋

Conclusion 🏁

More Information ℹ️

Also Known As 🎭

Tools 🧰

Disclaimer 📢

Comments

AI Coding Tips

AI Coding Tip 023 - Shrink your AI's Pull Request

More from this blog

AI Coding Tip 027 - Force Code Standards

AI Coding Tip 026 - Assign a Persona to Every Skill Definition

The Dirty Secret Behind Loop Engineering

Code Smell 320 - Vanity Coverage

AI Coding Tip 025 - Pair Every Skill With a Pitfalls File

Command Palette

Common Mistake ❌

Problems Addressed 😔

How to Do It 🛠️

Benefits 🎯

Context 🧠

Prompt Reference 📝

Bad Prompt 🚫

Good Prompt 👉

Considerations ⚠️

Type 📝

Limitations ⚠️

Tags 🏷️

Level 🔋

Related Tips 🔗

Conclusion 🏁

More Information ℹ️

Also Known As 🎭

Tools 🧰

Disclaimer 📢

Comments

AI Coding Tips

AI Coding Tip 023 - Shrink your AI's Pull Request

More from this blog