Build note

Claude Code Workflow

Repo instructions, skills, MCP, and a GitHub Actions review loop.

The problem

Coding agents without guardrails wander the repo, ignore conventions, and produce changes no one reviews. A team needs repo-level instructions, scoped skills, and an automated review loop to make agent-assisted work safe.

System architecture

A repo carries a CLAUDE.md operator runbook, a set of scoped skills, and MCP servers exposing internal tools; a GitHub Action runs an agent review on each pull request so changes are checked against the conventions before merge.

Workflow

  • Repo instructions and conventions are codified in a CLAUDE.md runbook.

  • Scoped skills package repeatable tasks the agent can invoke.

  • MCP servers expose internal tools to the agent under auth.

  • A developer makes agent-assisted changes on a branch.

  • A GitHub Action runs an agent review on the pull request.

  • Findings post back as comments before the change is merged.

Stack

  • Claude Code as the coding agent

  • A CLAUDE.md runbook for repo conventions

  • Scoped skills for repeatable tasks

  • MCP servers for internal tool access

  • GitHub Actions for the review loop

What gets logged

  • Which skill or MCP tool an agent invoked

  • The branch and pull request the change landed on

  • The review action's findings per pull request

  • Conventions the change passed or violated

  • Approvals and merge decisions

Where evals run

The review runs in CI on every pull request, checking changes against the repo conventions and surfacing findings before merge; a baseline set of review cases guards the reviewer itself from regressing.

Failure modes

  • The agent ignores a convention — the CI review flags it and blocks merge.

  • An MCP tool is called without authorization — access is gated so the call is refused.

  • A skill goes stale against the codebase — skills are versioned in-repo and updated with the conventions they encode.

  • The reviewer over-flags — findings are advisory comments, and a baseline set keeps its signal honest.

What this demo proves

That coding agents can be made repeatable and reviewable across a team — codified conventions, scoped skills, and an automated review gate instead of ad-hoc prompting.

Newsletter

One letter, every week. Working systems — not hot takes.

Build logs, agentic engineering decisions, agent failures, evals, and what survives real users. Sent weekly, never more.

Weekly. No spam. Unsubscribe anytime.