awaf v1.0  ·  Open Specification

Production-ready
AI agents.
Measurably.

awaf is an open framework for evaluating AI agent architecture across 10 pillars. Score your agent. Find the gaps. Ship with confidence.

Read the Spec
pip install awaf Copied!
0 Not Ready High Risk Needs Work Near Ready Production Ready 100

10 Pillars. 3 Tiers.

A complete architectural model for agent systems, from foundational requirements to agent-native concerns that have no cloud equivalent. Read the intro post →

Foundation
Prerequisite

Foundation

Agents must own their domain end-to-end with independent tools, context, and data. A vertically sliced agent owns its domain: its tools, its context, its data.

0 – 100
FAIL < 40
Cloud WAF Adapted
1.0×

Operational Excellence

SLOs, playbooks, and postmortems. Determines whether the other pillars remain effective in production.

Security

Enforced in code, not prompts. Credentials must never enter the agent. Blast radius must be explicitly bounded.

Reliability

Designed for failure, not just uptime. Chain boundaries as fault domains. Fail-loud behavior and circuit breakers at the MCP layer. Checkpoint/resume for multi-step runs.

Performance Efficiency

Optimizes execution speed and resource usage across agent operations.

Cost Optimization

Tracks every token and tool call. Session budgets and loop detection from day one. Hard stop at 100% budget. Non-negotiable. Prevents solutions that cost more than the problems they solve.

Sustainability

Long-term viability and environmental considerations adapted from cloud WAF principles.

Agent-Native
1.5×

Reasoning Integrity

Addresses silent, confident failures — the worst failure type. Agents can hallucinate arguments, select wrong tools, or derail without visible errors. Requires evals covering tool selection, argument accuracy, and chain-of-thought faithfulness.

Controllability

Human control through code-level enforcement, not prompts. Any in-flight agent must be externally stoppable. Requires pause, notify, and resume/abort primitives.

Context Integrity

Manages agent perception of reality. Prevents stale context from corrupting reasoning. Requires external content sanitization through MCP and active lifecycle management for long sessions. The agent must understand its own knowledge limitations.

The Ecosystem

Spec-first. Multiple implementations. Community-owned.

awaf is community-owned.

The spec is open. Implementations are open.

If you build agents in production, your patterns belong here.