- Python SDK: instruments agent actions, enforces local and synced policies, writes the local ledger, and optionally uploads telemetry.
- Platform API: ingests telemetry, stores runs/actions/policies, serves governance APIs, and broadcasts real-time events.
- Web Console: gives operators visibility and control over runs, interventions, approvals, policies, replay, analytics, and agents.
Repository Map
Runtime Flow
Core Boundaries
SDK
The SDK owns runtime instrumentation and local enforcement:@guarded_actionand integration adapters wrap actions.- Policy checks happen before action execution.
- Ledger writes happen after execution.
- Telemetry failures should not crash user code.
- Policy violations intentionally block execution.
- Remote sync is optional and backgrounded.
remote, llm, integrations, or all.
Platform API
The platform owns shared state and governance workflows:- Ingests runs, actions, interventions, replay summaries, and compliance metadata.
- Stores policy state and exposes policy sync APIs to SDK clients.
- Serves approval, ledger, agent, analytics, replay, MCP, and compliance APIs.
- Emits WebSocket events for the console.
- Applies authentication and tenant boundaries before returning organization data.
Console
The console owns operator workflows:- Explore runs and actions.
- Review interventions and approval requests.
- Manage API keys, policies, integrations, and settings.
- Trigger and inspect replay/determinism workflows.
- Surface analytics, agents, and compliance/audit state.
docs/console/. API behavior belongs in docs/platform/.
Data Model
The durable product model is:- Organization / tenant: isolation boundary for users, agents, policies, and telemetry.
- Agent: logical autonomous system or integration source.
- Run: one execution session for an agent.
- Action: one instrumented tool/function/LLM call within a run.
- Policy: organization-level control rules synced to SDK clients.
- Intervention: a policy or guardrail decision that blocked, required approval, or otherwise changed execution.
- Approval: human decision record for sensitive actions.
- Ledger entry: audit representation of action execution and metadata.
- Replay summary: local replay outcome, divergence data, determinism score, and estimated savings.
Design Principles
- Runtime authority first: Agent Sentinel decides whether an autonomous action may proceed, not merely whether it should be logged.
- Local-first SDK, platform-backed governance: agents can write local ledgers and enforce local policy; organization policy and audit state are centralized in the platform.
- Fail-open telemetry, fail-closed enforcement: logging/sync failures should not crash agents, but policy violations block by design.
- Explicit tenant boundaries: platform and console behavior must preserve organization isolation.
- Evidence over claims: public docs and marketing claims must match shipped behavior or clearly label roadmap items.
Where Architecture Changes Go
Update this page when changing the product shape, runtime flow, service boundaries, or durable data model. Do not create a second architecture document. If a proposal needs temporary detail, put it underdocs/archive/ after the decision is made and fold the final decision back here.