Guardrails
For creating custom guardrails, see Creating Custom Guardrails. For the underlying safety primitives, see Safety Primitives.
Guardrails Architecture
AgentOS guardrails internals — input/output dispatcher, ALLOW/SANITIZE/BLOCK/FLAG verdicts, two-phase scanning, fail-open and fail-closed semantics
Creating Custom Guardrails
This guide walks you through everything you need to create, package, test, and deploy a custom guardrail for AgentOS. By the end you will understand the full lifecycle -- from implementing the IGuardrailService interface to publishing a self-contained extension pack.
Safety Primitives
Six operational safety primitives that wrap every AgentOS LLM call: killswitch, cost guard, circuit breaker, stuck detection, action audit log. Prevent runaway loops, money fires, and zombie agents — independently or as one guard chain via wrapLLMCallback().
Sandbox Security
This page is sourced from the monorepo and is not available in this build.