CLI coding agents over ACP

Hand a real coding job to a purpose-built CLI coding agent — protoCLI (proto), Claude Code, Codex, Gemini CLI — and get the result back. A coding agent carries its own file access, shell, repo-map, and edit/verify loop, so it reads/edits/runs code in a repo far better than a generic tool loop.

You reach one through the unified delegate registry (ADR 0025) as an acp delegate: delegate_to(target, query). protoAgent is the ACP client; proto --acp (or another CLI's ACP mode) is the matching server, driven over the Agent Client Protocol — JSON-RPC 2.0 over the child's stdin/stdout.

History

This used to be a standalone coding_agent plugin contributing a code_with tool (ADR 0024). That tool was retired in favour of delegate_to with an acp delegate, which does the same over one tool alongside a2a/openai delegates and a console panel. The ACP client mechanics described here are unchanged — delegate_to reuses them.

Security: a coding agent gets file + shell access in its workdir (confined to that directory — see Permission posture). Declare it deliberately, and prefer a scoped/throwaway workdir.

Configure an `acp` delegate

Coding agents run as local subprocesses, so they're declared in YAML (not in-app Settings — each grants local authority and deserves a deliberate edit):

yaml

# config/langgraph-config.yaml
delegates:
  - name: proto                 # the name you pass to delegate_to(target=…)
    type: acp
    description: Coding agent — implements a change in a repo.
    command: proto              # binary on PATH
    args: ["--acp"]             # ACP server mode
    workdir: ~/dev/my-repo      # session cwd — the confinement boundary
    permissions: allowlist      # auto | allowlist | readonly
    # env: { SOME_KEY: value }  # optional extra env, merged over the process env
    # timeout_s: 900            # optional per-call timeout (seconds)
    # allow_kinds: []           # override: kinds to allow
    # deny_kinds: [execute, delete]   # override: kinds to deny

The delegates registry is enabled by default — there's no plugin to turn on. Declaring (or editing) the delegates list hot-reloads on Save & Reload: the first delegate you add registers delegate_to for the next turn, no restart.

Other coding agents

Any agent that speaks ACP works — just point command/args at it:

yaml

delegates:
  - { name: proto,       type: acp, command: proto,       args: ["--acp"],              workdir: ~/dev/my-repo }
  - { name: claude-code, type: acp, command: claude-code, args: [],                     workdir: ~/dev/my-repo }   # alias → claude-agent-acp
  - { name: codex,       type: acp, command: codex-acp,   args: [],                     workdir: ~/dev/my-repo }   # @zed-industries/codex-acp adapter (codex has no native ACP)
  - { name: opencode,    type: acp, command: opencode,    args: ["acp"],                workdir: ~/dev/my-repo }
  - { name: copilot,     type: acp, command: copilot,     args: ["--acp"],              workdir: ~/dev/my-repo }
  - { name: gemini,      type: acp, command: gemini,      args: ["--experimental-acp"], workdir: ~/dev/my-repo }

The binary must be installed and on the PATH of the process running protoAgent. The delegates panel's Test button performs a real ACP initialize handshake — so a wrong launch command fails the probe instead of showing green (it's not just a missing-binary check). A misconfigured delegate surfaces at Test, not at first dispatch. The probe resolves the command against the same PATH the spawn uses — the process PATH with the delegate's env PATH overlaid — so probe and dispatch never disagree.

macOS desktop app & PATH

A GUI app launched from Finder/Dock/launchd inherits only launchd's minimal PATH (/usr/bin:/bin:/usr/sbin:/sbin), not your login-shell PATH — so Homebrew (/opt/homebrew/bin), nvm, Volta, and asdf installs (where npx/node/ACP adapters live) are invisible, and a command: npx delegate fails with binary not on PATH (#1299). The desktop build now hands the bundled server your real login-shell PATH, so this works out of the box. If you still hit it (an unusual shell setup), either set an absolute command (/opt/homebrew/bin/npx) or add a PATH to the delegate env — both pass the probe too. The web app (terminal-launched server) is unaffected.

No Node installed at all? Provision a managed one

All of the above finds a Node you already have. If you have none — a common case for a fresh desktop install — the npx-based agents (Claude Code, Codex) and npx-based MCP servers have nothing to launch. Provision a managed Node runtime once (ADR 0085):

bash

protoagent runtime install-node    # downloads a pinned Node into ~/.protoagent/runtime/node

It's a box-shared, hash-verified download; the server picks it up on the next start (a running server hot-adopts it), and protoagent runtime list shows Node status. A user Node install always takes precedence, so this only fills the gap. npx -y still fetches the adapter itself on first launch (then caches it).

Claude Code has no native ACP mode. Drive it through the claude-agent-acp adapter: install it (npm i -g @agentclientprotocol/claude-agent-acp) and use the claude-code alias above — it maps to command: claude-agent-acp with no args, so you don't have to know the incantation. (The older @zed-industries/claude-code-acp is deprecated — it was renamed to @agentclientprotocol/claude-agent-acp.) Setting command: claude directly does not work — claude isn't an ACP server, and the probe will tell you so.

Nested Claude: the adapter launches the claude binary, which refuses to start nested inside another Claude Code session (Error: Claude Code cannot be launched inside another Claude Code session). protoAgent now strips the nested-session markers (CLAUDECODE and the whole CLAUDE_CODE_* family) from the ACP launch env automatically (#1296), so launching protoAgent from within a claude session — the dogfooding case — works without the manual env -u … dance. (Partial strips were the footgun: missing just one of CLAUDE_CODE_SESSION_ID / CLAUDE_CODE_ENTRYPOINT / … still tripped the guard, so the agent respawned every ~2 min with no surfaced error.) A value you set explicitly in the delegate env still wins.

Codex has no native ACP mode either. Recent codex CLI (≥ 0.13x) dropped the acp subcommand — it speaks MCP natively, not ACP, so command: codex, args: ["acp"] no longer works (the probe fails). Drive it through the @zed-industries/codex-acp adapter: install it (npm i -g @zed-industries/codex-acp) → command: codex-acp, or run it zero-install with command: npx, args: ["-y", "@zed-industries/codex-acp"] (the form the ACP-runtime and MCP guides use).

opencode (opencode acp) and GitHub Copilot CLI (copilot --acp) ship native ACP servers — point command/args straight at them, no adapter needed.

Use it

The lead agent calls delegate_to; configured delegates appear in the tool's description:

delegate_to(target="proto", query="Add a GET /healthz route to server/, wire it
into the app, and run the tests. Report what you changed.")

Notes for whoever writes the query:

The coding agent does not see this conversation — make query a self-contained brief: the goal, the relevant files if known, and the definition of done ("run the tests", "and lint").
The delegate works in its configured workdir. To target a different tree, declare another delegate — or, programmatically, dispatch a workdir-scoped copy (the board loop does this per feature; see below).
The call blocks until the turn finishes (coding is slow), up to timeout_s.
Follow-up calls reuse the cached session — so you can iterate (delegate_to("proto", "now also add a test for it")).

Permission posture

A coding agent works in its configured workdir and uses its own file/shell access there; protoAgent advertises no client-served fs/terminal capability. When the coding agent asks to do something risky it sends a session/request_permission, which protoAgent answers with the delegate's permission policy:

`permissions`	Behaviour
`auto` (default)	Allow everything — the agent self-governs within its workdir.
`allowlist`	Allow all action kinds except `execute` and `delete` (override with `allow_kinds` / `deny_kinds`).
`readonly`	Allow only read-like kinds (`read`, `search`, `fetch`, …); deny edits, shell, and deletes.

Action kinds come from the ACP request (toolCall.kind: read / edit / execute / delete / fetch / move / search / …).

Per-action live HITL (approve each individual edit/shell command as the agent works) is not available — it would require pausing a blocking subprocess session mid-turn. Use permissions: readonly/allowlist for deterministic per-action control. With no container isolation, the workdir is the sandbox: scope it to a throwaway checkout (or a disposable git worktree) for untrusted runs.

Environment

The subprocess inherits protoAgent's environment (plus any per-delegate env), minus the nested-Claude markers (CLAUDECODE / CLAUDE_CODE_*) — see the caveat above. Run protoAgent under an account whose ambient credentials you're willing to lend the coding agent, or scope the workdir to a throwaway checkout.

In a container, wired to a gateway

The setup above assumes the coder binary is already on PATH — true for a local run, but a containerized protoAgent starts from a bare image with no coder and no model credentials. Two things to add to your deploy (not the template — this is your Dockerfile + entrypoint, COPY . /opt/protoagent/ already ships your config):

1. Bake the coder into the image. For proto (a Node CLI), that's Node + one npm i -g; the other adapters install the same way (@agentclientprotocol/claude-agent-acp, @zed-industries/codex-acp, …):

dockerfile

ARG PROTOCLI_VERSION=latest
RUN curl -fsSL https://deb.nodesource.com/setup_20.x | bash - \
    && apt-get install -y --no-install-recommends nodejs \
    && rm -rf /var/lib/apt/lists/* \
    && npm install -g "@protolabsai/proto@${PROTOCLI_VERSION}" \
    && proto --version   # fail the build if it didn't land

2. Point the coder at your gateway, not a cloud key. A CLI coder normally wants its own provider API key. To reuse the same OpenAI-compatible gateway protoAgent already uses — one key, one bill, local models available — write the coder's config at entrypoint rather than baking it: the sandbox $HOME is typically a tmpfs mount that would shadow a baked file, and writing at start keeps it idempotent and env-tunable. proto reads ~/.proto/settings.json; the shape differs per CLI but the idea is the same (base URL → your gateway, key from an env var):

# entrypoint.sh — before `exec … python -m server`
if command -v proto >/dev/null 2>&1; then
    GATEWAY_URL="${CODER_GATEWAY_URL:-http://gateway:4000/v1}"
    mkdir -p "$HOME/.proto"
    cat > "$HOME/.proto/settings.json" <<JSON
{ "modelProviders": { "openai": [
    { "id": "my/coder-model", "baseUrl": "${GATEWAY_URL}", "envKey": "OPENAI_API_KEY" }
  ] },
  "security": { "auth": { "selectedType": "openai" } },
  "model": { "name": "my/coder-model" } }
JSON
fi

Because the ACP child inherits protoAgent's environment (see above), OPENAI_API_KEY — the gateway key protoAgent already has — flows straight through, and so does anything else the coder needs from its shell (e.g. a GH_TOKEN for git push / gh pr create from its workdir — run by the coder itself in the default mode, or by the framework's git harness under manage_git: true, §Managed git below). No second secret store.

The workdir still has to be a real, writable checkout of the repo the coder edits — provision it however you like (bake a clone, or git clone it at entrypoint with a token). A neat trick to keep the token out of the persisted .git/config: set it via a global url.<https://x-access-token:$TOK@github.com/>.insteadOf rewrite in the tmpfs $HOME — written fresh each boot, never stored in the volume.

Parallel builds: a worktree-backed coder pool

One coder in one workdir is sequential — a second code_with/delegate_to into the same directory while the first is mid-edit will collide (shared working tree + index + branch). An orchestrator that wants to build several independent things at once (a lead fanning issues out to a crew) needs each concurrent coder in its own working tree.

The clean way is a pool of coders over git worktrees: linked worktrees share one clone's .git object store but have an isolated working dir, index, and checked-out branch — exactly the isolation concurrent coders need, without N full clones.

1. Provision the worktrees at entrypoint (cap N = your concurrency budget):

git clone https://github.com/you/repo /work/repo         # the base clone (on main)
for i in $(seq 1 "${CODER_POOL:-3}"); do
    git -C /work/repo worktree add --force -B "pool-$i" "/work/wt-$i" origin/main
done
# recreate them fresh each boot — worktrees hold no state you keep (coders push to origin)

2. Declare one coder per worktree — same binary, distinct workdir:

yaml

delegates:
  - { name: coder-1, type: acp, command: proto, args: ["--acp"], workdir: /work/wt-1, manage_git: true }
  - { name: coder-2, type: acp, command: proto, args: ["--acp"], workdir: /work/wt-2, manage_git: true }
  - { name: coder-3, type: acp, command: proto, args: ["--acp"], workdir: /work/wt-3, manage_git: true }

3. Fan out. The agent issues several delegate_to(coder-N, …) calls in one turn (the tool node runs a turn's tool calls concurrently), or several delegate_to(…, background=True) calls — each lands on a free coder in its own worktree. The pool size is the cap; extra work queues.

Two caveats worth planning for: worktrees don't share node_modules/build caches (install per-worktree, or share a package store), and two parallel PRs that touch the same file will conflict at merge time (normal parallel-dev friction — rebase the loser), not at build time.

Managed git: the framework owns branch/commit/push/PR (ADR 0076)

By default the coder owns its own git lifecycle — fine for a single supervised coder in a disposable checkout. At pool scale it is the reliability ceiling: coders invent colliding branch names (linked worktrees refuse the same branch twice), report "done" without ever pushing, open duplicate PRs when one item is fanned to several coders, and git add -A their scratch into the diff. Every one of those is a deterministic step an LLM was asked to perform.

manage_git: true on an acp delegate moves the whole lifecycle into the framework (plugins/coding_agent/git_harness.py); the coder is told to edit files and run tests only. Per dispatch, the harness:

derives a stable work-item id — delegate_to(…, item_id="issue-42"), or a hash of the query text when omitted — and claims it: a second dispatch of an in-flight item (any coder) is refused instead of duplicated, and an already-open PR for the item's branch short-circuits before the coder even runs;
mints the branch deterministically (<branch_prefix>/<slug>-<id7>, prefix defaults to the delegate name) and cuts it from fresh origin/<base_branch> — never local HEAD;
after the coder finishes: refuses to commit on the base branch (work stays recoverable in the worktree — no completion theater), scans the diff for secrets, commits on the coder's behalf, rebases onto fresh base (a conflict is reported and pushed as-is, not fatal), pushes with --force-with-lease, verifies the remote SHA actually moved, and opens the PR idempotently (re-runs reuse the existing PR).

The lifecycle is idempotent to a coder that did partial git anyway (its commits are adopted, not duplicated), and the run's outcome — branch, verified push, PR URL, or the exact reason nothing was published — is appended to the coder's reply.

yaml

delegates:
  - name: coder-1
    type: acp
    command: proto
    args: ["--acp"]
    workdir: /work/wt-1
    manage_git: true       # framework-owned git lifecycle
    base_branch: main      # branches cut from origin/<base>; PRs target it
    # branch_prefix: wt-1  # optional; defaults to the delegate name

The PR step needs gh on PATH and a GH_TOKEN/GITHUB_TOKEN (the same container env as above). Without them the branch is still pushed and verified — the reply just reports the PR step's failure instead of a URL.

How it works

delegate_to(target="proto", query=…)
  → AcpAdapter.dispatch (plugins/delegates/adapters.py)
      → AcpClient (plugins/coding_agent/acp_client.py)
          → spawn `command args` in workdir, JSON-RPC 2.0 over its stdio:
            initialize → session/load(saved id) or session/new(cwd) → session/prompt(query)
          ← session/update {agent_message_chunk}  → accumulated into the answer
          ← session/update {agent_thought_chunk}  → surfaced as the reasoning trace
          ← session/update {tool_call, title}       → narrated (logged)
          ← session/request_permission              → answered by the policy
  → returns the agent's final message text
            … session/cancel on abort · session/close on teardown

One AcpClient (subprocess + session) is cached per launch+policy signature (the key includes workdir) so follow-up calls reuse the session. A caller that dispatches into a transient, per-call workdir — e.g. dataclasses.replaceing a delegate onto a disposable git worktree — should call AcpAdapter.teardown(d) in a finally to reap that worktree's subprocess (a plain cache drop forgets the handle but leaves the process alive).

Sessions survive a restart

The sessionId is persisted per launch signature (under ~/.protoagent/acp_sessions/). On the next start, if the agent advertises the ACP loadSession capability the client session/loads the saved thread (replaying its history silently to reattach) instead of starting fresh — so a crash, a CI bounce, or a re-dispatch continues the same coding thread rather than losing its context. A stale or unknown id falls back to a fresh session/new. The ACP protocolVersion is negotiated at initialize; the client closes the connection if the agent counters with a version it doesn't speak.

Eval it

A gated eval case (acp_delegation) verifies end-to-end delegation against a live agent. It's skipped unless you opt in — configure an acp delegate, then:

bash

export EVAL_CODING_AGENT=1
python -m evals.runner --tasks acp_delegation

It drives a real A2A turn that asks the agent to use delegate_to, and asserts (via the audit channel) that the tool fired. Without EVAL_CODING_AGENT set it SKIPs, so it never breaks the default board. See Eval your fork.

See Delegates for the registry + panel, Plugins for the plugin model, and ADR 0024 / ADR 0025 for the design rationale.

CLI coding agents over ACP ​

Configure an acp delegate ​

Other coding agents ​

Use it ​

Permission posture ​

Environment ​

In a container, wired to a gateway ​

Parallel builds: a worktree-backed coder pool ​

Managed git: the framework owns branch/commit/push/PR (ADR 0076) ​

How it works ​

Sessions survive a restart ​

Eval it ​

CLI coding agents over ACP

Configure an `acp` delegate

Other coding agents

Use it

Permission posture

Environment

In a container, wired to a gateway

Parallel builds: a worktree-backed coder pool

Managed git: the framework owns branch/commit/push/PR (ADR 0076)

How it works

Sessions survive a restart

Eval it