py-agent-ctrl

Unified Python bridge for CLI coding agents.

This repo exposes:

a Python API under py_agent_ctrl
a unified ctrlagent CLI
direct CLI bridges for claude, codex, opencode, pi, and gemini

Scope and Non-goals

py-agent-ctrl is a Python-first control facade for existing CLI coding agents. It does not implement Agent Client Protocol (ACP), expose an ACP agent/client runtime, replace the provider CLIs, or try to be a general-purpose agent framework. ACP remains useful as a design reference for structured events, capabilities, permissions, cancellation, and hardening, but the runtime contract here is direct CLI execution through installed provider tools.

The codebase follows the clean layout described in docs/dev/architecture.md:

apps/ for runnable shells
libs/ for importable code
resources/ for passive assets
docs/ for documentation
tests/ for unit, integration, feature, and regression coverage

Development

Use uv only:

uv sync --extra dev
uv run python -m ruff format --check .
uv run python -m ruff check .
uv run python -m mypy libs/py_agent_ctrl apps
uv run python -m pytest -q -m "not live"
uv run python -m pytest -m "not live" --cov=py_agent_ctrl --cov-report=term-missing
uv run python -m compileall -q apps libs tests
uv build --wheel
uv run python -m py_agent_ctrl.cli agents list

Prefer uv run python -m ... for Python tools. That form avoids stale virtualenv console-script shebangs and matches the optional workflow command shape. See docs/dev/quality.md for the full local quality lane, including the isolated wheel install smoke. Use docs/dev/release.md before publishing, tagging, or treating a revision as release-ready.

Python API

The execution and streaming examples below invoke external provider CLIs. The selected CLI must be installed and authenticated before running them.

Execute (blocking)

from py_agent_ctrl import AgentCtrl

response = (
    AgentCtrl.claude_code()
    .with_model("claude-sonnet-4-5")
    .with_permission_mode("bypassPermissions")
    .execute("Summarize this repository.")
)

print(response.text)
print(response.session_id)
print(response.usage)       # TokenUsage with input/output/cache tokens
print(response.tool_calls)  # list[ToolCall]

Stream (real-time)

from py_agent_ctrl import AgentCtrl, AgentReasoningEvent, AgentTextEvent, AgentToolCallEvent

result = AgentCtrl.gemini().yolo().stream("Explain the architecture.")

for event in result:
    if isinstance(event, AgentTextEvent):
        print(event.text, end="", flush=True)
    elif isinstance(event, AgentReasoningEvent):
        print(f"\n[reasoning] {event.text}")
    elif isinstance(event, AgentToolCallEvent):
        print(f"\n[tool: {event.tool_call.name}]")

print(f"\nexit code: {result.exit_code}")

Streams may also emit richer normalized events such as AgentPlanUpdateEvent, AgentUsageEvent, AgentWarningEvent, and AgentFileChangeEvent when a provider exposes that information.

Other bridges

from py_agent_ctrl import AgentCtrl

AgentCtrl.make("codex").execute("Review the tests.")
AgentCtrl.codex().with_sandbox("workspace-write").execute("Review the tests.")
AgentCtrl.open_code().with_agent("coder").execute("Refactor the payment flow.")
AgentCtrl.pi().with_thinking("high").execute("Create an implementation plan.")
AgentCtrl.gemini().plan_mode().execute("Inspect the architecture.")

Callbacks and wiretaps

from py_agent_ctrl import AgentCtrl

response = (
    AgentCtrl.codex()
    .on_text(lambda text: print(text, end=""))
    .on_tool_call(lambda tool_call: print(f"\n[tool: {tool_call.name}]"))
    .on_complete(lambda response: print(f"\nexit code: {response.exit_code}"))
    .execute("Summarize this repository.")
)

For streaming responses, on_text receives deduplicated text deltas. The underlying stream(...) iterator still yields the original normalized events, and on_event sees those original events before text-specific filtering.

Error handling

from py_agent_ctrl import AgentCtrl, BinaryNotFoundError, ProcessTimeoutError

try:
    response = AgentCtrl.claude_code().execute("Hello.")
except BinaryNotFoundError as e:
    print(f"Install the agent first: {e.install_hint}")

The core error taxonomy also includes AgentExecutionError, WorkingDirectoryNotFoundError, ProcessStartError, ProcessTimeoutError, ProcessFailedError, JsonDecodeFailureError, and ProviderParseFailureError. Current bridge execution returns normalized AgentResponse / ProcessOutput diagnostics where possible and keeps BinaryNotFoundError for binary preflight failures.

CLI

The execute, stream, resume, and continue commands invoke external provider CLIs. The selected CLI must be installed and authenticated before running those commands. The agents list and agents capabilities commands are safe local metadata checks.

uv run ctrlagent agents list
uv run ctrlagent agents capabilities --agent claude-code
uv run ctrlagent execute --agent claude-code "Summarize this repository."
uv run ctrlagent stream --agent gemini "Explain the architecture."
uv run ctrlagent resume --agent codex --session thread_123 "Continue."
uv run ctrlagent continue --agent gemini "Proceed."

Supported Agents

Agent	Python Facade	CLI Name
Claude Code	`AgentCtrl.claude_code()`	`claude-code`
Codex	`AgentCtrl.codex()`	`codex`
OpenCode	`AgentCtrl.opencode()` / `AgentCtrl.open_code()`	`opencode`
Pi	`AgentCtrl.pi()`	`pi`
Gemini	`AgentCtrl.gemini()`	`gemini`

Migration

This repo is a clean cut from the old flat Claude-only layout.

old root package: agent_ctrl
new root package: py_agent_ctrl
old client: ClaudeCodeClient
new entrypoint: AgentCtrl

See:

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.beads		.beads
.github/workflows		.github/workflows
apps/cli		apps/cli
docs		docs
libs/py_agent_ctrl		libs/py_agent_ctrl
research		research
resources/skills/upgrade		resources/skills/upgrade
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

py-agent-ctrl

Scope and Non-goals

Development

Python API

Execute (blocking)

Stream (real-time)

Other bridges

Callbacks and wiretaps

Error handling

CLI

Supported Agents

Migration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

py-agent-ctrl

Scope and Non-goals

Development

Python API

Execute (blocking)

Stream (real-time)

Other bridges

Callbacks and wiretaps

Error handling

CLI

Supported Agents

Migration

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages