Code Safety

Regex-based OWASP Top 10 code safety scanning for LLM-generated code. Detects insecure patterns in streaming markdown output and tool call arguments using language-aware rule sets. Pure regex -- no models, no dependencies, less than 1ms per scan.

Package: @framers/agentos-ext-code-safety

Overview

The Code Safety extension provides two modes of operation:

Passive protection via a built-in guardrail that automatically scans code blocks in agent output and intercepts tool calls with code arguments
Active capability via an agent-callable tool (scan_code) for on-demand code safety scanning

It detects:

Injection -- eval(), exec(), os.system(), dynamic code execution
SQL injection -- string concatenation in SQL queries, format-string injection
XSS -- innerHTML, document.write, dangerouslySetInnerHTML
Path traversal -- unsanitized ../ in file paths, user-input path construction
Hardcoded secrets -- AWS keys, passwords, tokens, private keys in code
Weak cryptography -- MD5/SHA1 for passwords, Math.random() for security
Insecure deserialization -- pickle.loads(), yaml.load() without SafeLoader
SSRF -- unvalidated URL construction from user input
Incorrect permissions -- chmod 777, world-writable files

All detection is stateless and synchronous via compiled regex patterns. No ML models, no network calls, no external dependencies.

Installation

npm install @framers/agentos-ext-code-safety

No additional dependencies required.

Usage

Direct factory usage

import { AgentOS } from '@framers/agentos';
import { createCodeSafetyPack } from '@framers/agentos-ext-code-safety';

const codeSafetyPack = createCodeSafetyPack({
  severityActions: {
    critical: 'block',
    high: 'block',
    medium: 'flag',
    low: 'flag',
  },
});

const agent = new AgentOS();
await agent.initialize({
  ...config,
  manifest: { packs: [{ factory: () => codeSafetyPack }] },
});

Manifest-based loading

await agent.initialize({
  manifest: {
    packs: [
      {
        package: '@framers/agentos-ext-code-safety',
        options: {
          disabledRules: ['weak-crypto-hash'], // allow MD5 for non-security use
          guardrailScope: 'output',
        },
      },
    ],
  },
});

Language-Aware Patterns

Rules have per-language regex variants. The universal '*' key matches in any language. Language-specific keys add patterns that only apply when that language is detected.

When scanning Python code, both '*' and 'python' patterns run. When the language cannot be determined, only universal '*' patterns apply.

Supported Languages

Python, JavaScript/TypeScript, SQL, Ruby, PHP, Bash, Go, Java.

Language Detection

Fence tag -- extracted from markdown code fence (e.g., ```python)
Keyword heuristics -- when no fence tag is present, detected via patterns:
- Python: def/import/from...import
- JavaScript/TypeScript: function/const/=>/require
- SQL: SELECT/INSERT/CREATE TABLE
- Ruby: class...< /.each do/puts
- PHP: <?php/$variable =
- Bash: shebang/if [ ... ]
- Go: func/package/import (
- Java: public class/System.out/import java

Code Fence Extraction

The guardrail extracts code blocks from markdown text using a lightweight fence parser:

Here is some code:

` ` `python
import os
os.system("rm -rf /")
` ` `

This text is not scanned.

Only content inside triple-backtick fences is scanned. Non-code text passes through with zero overhead.

Default Rules (25 rules)

Injection (5 rules)

Rule ID	Severity	What It Detects
`code-injection-eval`	critical	`eval()`, `exec()`, `compile()`, `new Function()`, `setTimeout("...")`
`command-injection-system`	critical	`os.system()`, `subprocess.call(shell=True)`, `system()`
`command-injection-backtick`	high	Backtick execution in Ruby/Perl/Bash, `$(...)` command substitution
`code-injection-template`	medium	Template literals inside eval/Function, f-strings in exec
`unsafe-reflection`	medium	`Class.forName(userInput)`, `getattr(obj, userInput)`

SQL Injection (3 rules)

Rule ID	Severity	What It Detects
`sql-injection-concat`	critical	String concatenation in SQL: `"SELECT * FROM " + var`, f-strings, template literals
`sql-injection-format`	high	`.format()` or `%s` in SQL strings
`sql-injection-keywords`	high	`' OR 1=1`, `UNION SELECT`, `; DROP TABLE` in string literals

XSS (3 rules)

Rule ID	Severity	What It Detects
`xss-innerhtml`	high	`.innerHTML =`, `v-html=`, `[innerHTML]=`
`xss-document-write`	high	`document.write()`, `document.writeln()`
`xss-dangerously-set`	medium	`dangerouslySetInnerHTML` (flagged for review, not blocked)

Path Traversal (2 rules)

Rule ID	Severity	What It Detects
`path-traversal-dotdot`	high	`../` repeated 2+ times in string literals, `..\\`
`path-traversal-user-input`	medium	`path.join(userInput`, `os.path.join(request.`, `open(user`

Secrets (4 rules)

| Rule ID | Severity | What It Detects | | ----------------------- | -------- | ------------------------------------------------------------------ | --- | ---------------------- | | hardcoded-aws-key | critical | AKIA[0-9A-Z]{16} pattern | | hardcoded-password | high | password = "...", passwd =, secret = "..." | | hardcoded-token | high | Bearer eyJ, token = "eyJ...", api_key = "..." with 20+ chars | | hardcoded-private-key | critical | -----BEGIN (RSA | EC | DSA)?PRIVATE KEY----- |

Crypto (2 rules)

Rule ID	Severity	What It Detects
`weak-crypto-hash`	medium	`md5()`, `sha1()`, `hashlib.md5`, `hashlib.sha1` in password context
`insecure-random`	medium	`Math.random()` in security context, `random.random()` for tokens

Deserialization (2 rules)

Rule ID	Severity	What It Detects
`insecure-pickle`	critical	`pickle.loads()`, `pickle.load()`, `cPickle.loads()`
`insecure-yaml`	high	`yaml.load()` without `Loader=SafeLoader`

SSRF (2 rules)

Rule ID	Severity	What It Detects
`ssrf-unvalidated-url`	high	`fetch(userInput)`, `requests.get(user_`, `urllib.request.urlopen(user`
`ssrf-redirect-follow`	medium	`redirect: 'follow'` with dynamic URL

Permissions (2 rules)

Rule ID	Severity	What It Detects
`world-writable`	high	`chmod 777`, `os.chmod(path, 0o777)`
`insecure-tmp`	low	`open('/tmp/...)` without `tempfile` or `mkstemp`

Tool Call Scanning

The guardrail intercepts TOOL_CALL_REQUEST chunks for code-executing tools. When a tool call targets a registered code-executing tool, the code argument is extracted and scanned before execution.

Default Tool Mapping

Tool Name	Argument Key	Default Language
`shell_execute`	`command`	`bash`
`run_sql`	`query`	`sql`
`write_file`	`content`	auto-detect
`create_file`	`content`	auto-detect
`edit_file`	`content`	auto-detect

Custom Tool Mapping

const pack = createCodeSafetyPack({
  codeExecutingTools: ['shell_execute', 'run_sql', 'my_custom_tool'],
  codeArgumentMapping: {
    my_custom_tool: { argKey: 'script', language: 'python' },
  },
});

Custom Rules

Add custom rules using the ICodeSafetyRule interface:

interface ICodeSafetyRule {
  id: string;
  name: string;
  description: string;
  category: CodeSafetyCategory;
  severity: 'critical' | 'high' | 'medium' | 'low';
  patterns: Record<string, RegExp[]>; // '*' = universal, 'python' = Python-only, etc.
  action?: 'flag' | 'block'; // override severity-to-action mapping
}

Example:

const pack = createCodeSafetyPack({
  customRules: [
    {
      id: 'my-custom-rule',
      name: 'Unsafe Database Connection',
      description: 'Detects hardcoded database connection strings',
      category: 'secrets',
      severity: 'high',
      patterns: {
        '*': [/mongodb:\/\/[^:]+:[^@]+@/i, /postgres:\/\/[^:]+:[^@]+@/i],
      },
    },
  ],
});

Configuration

`CodeSafetyPackOptions`

Option	Type	Default	Description
`customRules`	`ICodeSafetyRule[]`	`[]`	Custom rules to add alongside defaults.
`includeDefaultRules`	`boolean`	`true`	Whether to include the default ~25 rules. Set to `false` for custom-only.
`disabledRules`	`string[]`	`[]`	Rule IDs to disable from the default set.
`severityActions`	`Partial<Record<severity, action>>`	`{ critical: 'block', high: 'block', medium: 'flag', low: 'flag' }`	Action mapping by severity.
`codeExecutingTools`	`string[]`	`['shell_execute', 'run_sql', 'write_file', 'create_file', 'edit_file']`	Tools whose arguments should be scanned.
`codeArgumentMapping`	`Record<string, { argKey, language }>`	see defaults above	Maps tool names to code argument key and language.
`guardrailScope`	`'input' \| 'output' \| 'both'`	`'output'`	Defaults to output-only (scans agent responses and tool calls).

Agent Tools

`scan_code`

On-demand code safety scanning. Lets agents proactively check code before writing to files, executing, or presenting to users.

Agent: Let me check this code for security issues before writing it.
-> scan_code({ code: "eval(user_input)", language: "python" })
<- {
    safe: false,
    violations: [
      {
        ruleId: 'code-injection-eval',
        ruleName: 'Dynamic Code Execution',
        category: 'injection',
        severity: 'critical',
        matchedText: 'eval(user_input)',
        language: 'python',
        action: 'block'
      }
    ],
    blocksScanned: 1,
    summary: "1 critical issue: Dynamic Code Execution"
  }

Memory Impact

Component	Memory	When Loaded
Default rules (~25 compiled regexes)	~10KB	Pack load
Per-stream fence buffer	~1KB per active stream	First TEXT_DELTA
Total (100 streams)	~110KB	--

Negligible compared to PII (~115MB), ML classifiers (~98MB), and topicality (~1.7MB). The lightest guardrail in the suite.

Graceful Degradation

Condition	Behavior
No rules configured (empty set)	Guardrail is a no-op, returns null
Regex throws (malformed custom rule)	That rule skipped, warning logged, other rules continue
No code fences in text	Zero-cost passthrough (no scanning performed)
Unknown language	Only universal `'*'` patterns applied
Tool call for non-code-executing tool	Skipped (not in `codeExecutingTools` list)

Overview​

Installation​

Usage​

Direct factory usage​

Manifest-based loading​

Language-Aware Patterns​

Supported Languages​

Language Detection​

Code Fence Extraction​

Default Rules (25 rules)​

Injection (5 rules)​

SQL Injection (3 rules)​

XSS (3 rules)​

Path Traversal (2 rules)​

Secrets (4 rules)​

Crypto (2 rules)​

Deserialization (2 rules)​

SSRF (2 rules)​

Permissions (2 rules)​

Tool Call Scanning​

Default Tool Mapping​

Custom Tool Mapping​

Custom Rules​

Configuration​

CodeSafetyPackOptions​

Agent Tools​

scan_code​

Memory Impact​

Graceful Degradation​

Related Documentation​