WSO2 API Platform Policy Hub¶

The Policy Hub is the official, curated collection of gateway policies for the WSO2 API Platform. It gives you a growing library of production-ready, versioned policy implementations that you can apply to any API or MCP server without writing custom code.

Policies are self-contained units of behavior that plug into the gateway's request and response pipeline. You compose them, like stacking authentication on top of rate limiting on top of a guardrail, as opposed to building each concern from scratch. If your requisites exceed what the hub provides, you can author and ship your own policies alongside hub ones.

The Policy Hub is powered by an open source repository on GitHub. Every policy is versioned independently, the full source and documentation live in the repo, and contributions are welcome. For more information about contributing policies, see Policy Development Guide.

How Policies Work¶

A policy attaches to an API, covering all operations, or to a specific operation, and runs on the request phase, the response phase, or both. Multiple policies chain together on the same API, each one processing the message in sequence before it reaches the next. The gateway evaluates the chain at runtime.

For more information, see API Platform Policies overview.

Policy Categories¶

Security¶

Authenticate and authorize traffic before it reaches your backend.

Policy	What it does
API Key Auth	Protects APIs with pre-shared API keys
Basic Auth	Enforces HTTP Basic Authentication
JWT Auth	Validates JWT access tokens against one or more JWKS providers
CORS	Handles preflight requests and injects CORS response headers
Subscription Validation	Confirms the caller holds an active subscription for the target API
Rate Limit — Basic	Caps requests per time window
Rate Limit — Advanced	Multi-dimensional quotas with GCRA or fixed-window algorithms, Redis backend support, and weighted limiting

Guardrails¶

Control what goes into and out of AI models. Guardrail policies block unsafe prompts in the request pipeline before they reach a model. In the response pipeline, they filter unsafe or non-compliant output before it reaches the caller.

Policy	What it does
AWS Bedrock Guardrail	Validates content against AWS Bedrock Guardrails
Azure Content Safety	Screens content against Azure Content Safety API
Semantic Prompt Guard	Blocks or allows prompts based on semantic similarity to configured allow/deny phrases
Semantic Tool Filtering	Filters MCP tools to only those semantically relevant to the user query
PII Masking	Masks or redacts PII from request/response bodies using configurable regex patterns
JSON Schema Guardrail	Enforces a JSON Schema on request or response payloads
Regex Guardrail	Validates content against a regular expression
URL Guardrail	Validates URLs found in request or response bodies
Content Length Guardrail	Enforces byte-length limits on payloads
Word Count Guardrail	Enforces word-count limits on payloads
Sentence Count Guardrail	Enforces sentence-count limits on payloads

AI and LLM¶

Policies designed specifically for the characteristics of AI APIs — cost, tokens, latency, and prompt structure.

Policy	What it does
LLM Cost	Calculates the monetary cost of each LLM call and stores it for downstream policies
LLM Cost-Based Rate Limit	Enforces monetary budget quotas on LLM usage
Token-Based Rate Limit	Caps usage by token count rather than request count
Semantic Cache	Caches LLM responses using vector similarity, returning cached results for semantically equivalent prompts
Prompt Decorator	Injects system instructions or context into prompts at the gateway layer
Prompt Template	Applies configurable templates to transform prompts before they reach the model
Prompt Compressor	Compresses prompt text to reduce token usage before upstream calls
Model Round Robin	Distributes requests evenly across a pool of AI model endpoints
Model Weighted Round Robin	Distributes requests across model endpoints according to configured weights
Respond	Returns an immediate response without forwarding to the upstream backend

MCP (Model Context Protocol)¶

A dedicated policy tier for securing and controlling MCP servers — the emerging standard for exposing tools, resources, and prompts to AI agents.

Policy	What it does
MCP Authentication	Secures MCP server traffic per the MCP specification authorization profile
MCP Authorization	Validates access to MCP tools, resources, and prompts using JWT claims or OAuth scopes
MCP Access Control	Controls which tools, resources, and prompts a caller can reach using allow/deny lists
MCP Rewrite	Defines user-facing tool names and maps them to backend capability names

Transformation¶

Modify requests and responses in flight.

Policy	What it does
Request Rewrite	Rewrites path, query parameters, and HTTP method before forwarding upstream
Host Rewrite	Overrides the `Host`/`:authority` header sent to the upstream
Dynamic Endpoint	Routes a request to a named upstream at runtime
Set Headers	Adds or overwrites headers on requests or responses
Remove Headers	Strips specified headers from requests or responses
JSON/XML Mediator	Converts payloads between JSON and XML formats
Interceptor Service	Calls a user-defined HTTP service during the request and/or response phase

Logging, Analytics, and Monitoring¶

Policy	What it does
Log Message	Logs request/response payload and headers
Analytics Header Filter	Controls which headers appear in analytics data using allow or deny mode

Common Use Cases¶

AI guardrails: Stack prompt-injection detection, PII masking, and a semantic prompt guard on an LLM API to enforce safety and compliance requirements.

LLM cost governance: Combine the LLM Cost policy with LLM Cost-Based Rate Limit to define cost constraints that the gateway enforces transparently.

MCP server security: Apply MCP Authentication, MCP Authorization, and MCP Access Control together to fully secure an MCP server: authenticate the caller, validate their OAuth scopes, and restrict which tools they can invoke.

Semantic caching: Place the Semantic Cache policy on a high-traffic LLM API to serve cached responses for semantically equivalent questions, cutting both latency and cost.

Multi-model routing: Use Model Weighted Round Robin to distribute traffic across model endpoints by weight.

Independent Versioning¶

Every policy in the hub is versioned on its own track. When a new version ships, prior versions remain fully available; so running deployments do not break by upon an update.

Extending the Hub¶

The API Platform CLI can build a custom gateway image that bundles any combination of hub policies and policies you write yourself. So you can fill your requirement gaps, if any, with local implementations, and contribute them to Policy Hub.