Agent Sentinel - Agent Sentinel

Agent Sentinel helps you observe, control cost, and enforce safety policies across AI agent runs—without breaking your agent when telemetry fails.

What you get

Python SDK

Instrumentation: @guarded_action decorator for automatic telemetry
Policy enforcement: Budgets, allow/deny lists, rate limits
Guardrails: Built-in PII detection, content moderation (incl. Gemini semantic), loop protection, idempotency replay
Human-in-the-loop: Approval workflows for sensitive actions
Intervention tracking: See what Sentinel blocks — with Gemini-enriched explanations
Remote sync: Background sync to platform
Replay mode: Zero-cost deterministic replay for debugging
LLM integrations: Transparent instrumentation for OpenAI, Anthropic, Grok, Gemini
Framework support: Native LangChain, CrewAI, AutoGen, and LangGraph integrations
MCP client: Model Context Protocol for LLM-platform interaction
Error handling: Comprehensive error types with recovery strategies

Platform

Agent discovery: Automatic registration and metrics
Interventions: Track policy enforcement, with Gemini-enriched reasons and remediation hints
Approvals: Human-in-the-loop approval workflows
Kill switch: Emergency org-scoped halt for an agent / run / mission
Prose policies: Compile English / YAML / JSON to the policy IR via POST /policies/compile
Policy versioning: Immutable version history, side-by-side diff, one-click rollback
Evals & benchmarks: Submit benchmark results, compare runs for regressions, generate adversarial scenarios
Activity Ledger: Complete audit trail with advanced filtering
WebSocket real-time: Live event streaming
MCP endpoints: LLM-native API for autonomous operations
Stats & analytics: Cost tracking and performance metrics
Replay analysis: Org-wide local-replay history, determinism scoring, divergence detection
Compliance: EU AI Act Article 14 support

Web Console

Real-time dashboard: Live monitoring with WebSocket updates
Interventions: Typed guardrail panels (PII / moderation / loop / idempotency) with enriched reasons
Approvals inbox: Review and approve sensitive actions
Kill Switch (/kill-switch): Emergency controls promoted to top-level
Runs explorer: Execution history with replay simulation
Replay Hub (/replay): Org-wide ledger of local-replay submissions and savings
Activity Ledger: Complete audit trail with export
Agent management: Fleet overview with performance metrics
Evals (/evals): Benchmark scorecards, regression diff, scenario generator (Gemini-powered)
Analytics: Cost dashboards, per-action token + latency drill-down
MCP Explorer (/mcp): Browse the platform’s tools / resources / prompts
Settings: Policy builder with prose authoring, version diff/rollback, evidence DAG viewer

Quickstart

Get started in 10 minutes - instrument your agent and sync to the platform.

Python SDK

Full SDK documentation - instrumentation, policies, approvals, LLM integrations.

Platform API

Platform documentation - agents, interventions, approvals, real-time updates.

Web Console

Web UI guide - dashboard, interventions, approvals, runs, analytics.

How Agent Sentinel fits into your agent

At runtime your agent code calls tools / functions. Agent Sentinel wraps those calls:

Before execution: policy engine checks allow/deny lists, budgets, and rate limits.
After execution: a ledger entry is appended (inputs, outputs, cost, duration, outcome).
Optional: a background sync thread batches ledger entries to the platform (/api/v1/ingest/).

┌────────────────────────────┐
│ Your agent code            │
│  @guarded_action(...)      │
└──────────────┬─────────────┘
               │ (policy check + timing)
               ▼
┌────────────────────────────┐
│ Local ledger (.jsonl)      │
│  .agent-sentinel/ledger…   │
└──────────────┬─────────────┘
               │ (optional background sync)
               ▼
┌────────────────────────────┐
│ Platform                   │
│  POST /api/v1/ingest/      │
└──────────────┬─────────────┘
               ▼
┌────────────────────────────┐
│ Postgres (runs + actions)  │
└────────────────────────────┘

Key features

Guardrails

PII, moderation, loop protection, idempotency — first-class runtime primitives.

Interventions

Track what Agent Sentinel blocks, with Gemini-enriched reasons.

Prose policies

Compile plain English, YAML, or JSON into the policy IR.

Kill switch

Emergency org-scoped halt for any agent, run, or mission.

Evals & benchmarks

CI-gated scorecards, regression tracking, Gemini scenario generation.

Replay hub

Org-wide history of local-replay submissions and dollars saved.

Core concepts

Actions & runs

The core data model: runs contain actions; actions have cost and outcome.

Policies

Budgets, allow/deny lists, rate limits, and remote policy sync.

Activity Ledger

Complete audit trail with compliance metadata and export.

Agent Sentinel is fail-open for telemetry (ledger writes and sync should never crash your agent), but policy enforcement is fail-closed by design (violations raise).

​What you get

​Python SDK

​Platform

​Web Console

Quickstart

Python SDK

Platform API

Web Console

​How Agent Sentinel fits into your agent

​Key features

Guardrails

Interventions

Prose policies

Kill switch

Evals & benchmarks

Replay hub

​Core concepts

Actions & runs

Policies

Activity Ledger

What you get

Python SDK

Platform

Web Console

How Agent Sentinel fits into your agent

Key features

Core concepts