AI Prompt Guard

The ai.promptGuard policy inspects LLM request prompts and/or model responses before they are forwarded. Guards can be applied to request (incoming user prompts) and response (model output) independently. Multiple guards can be chained in each list. ai.promptGuard is part of the ai policy, which marks a route as LLM traffic:

binds:
- port: 3000
  listeners:
  - routes:
    - policies:
        ai:
          promptGuard:
            request:
            - regex:
                action: reject
                rules:
                - pattern: SSN
            response:
            - bedrockGuardrails:
                guardrailIdentifier: my-guardrail
                guardrailVersion: DRAFT
                region: us-west-2

Guard types

Each item in request[] or response[] is exactly one of the following guard types:

Regex
Webhook
OpenAI Moderation
Bedrock Guardrails
Google Model Armor

Inspect content using regular expression patterns. Supports both custom patterns and built-in named patterns for common PII types.

object

Regex-based content guard.

Show regex fields

string

required

Action to take when a rule matches. Valid values: reject.

action: reject

object[]

required

A list of match rules. Each rule is either a builtin named pattern or a custom pattern.

Show rule types

string

A built-in named pattern. Currently supported values include email.

rules:
- builtin: email

string

A custom regular expression string to match against message content.

rules:
- pattern: SSN
- pattern: Social Security

Example — reject requests containing PII:

ai:
  promptGuard:
    request:
    - regex:
        action: reject
        rules:
        - pattern: SSN
        - pattern: Social Security
      rejection:
        status: 400
        headers:
          set:
            content-type: "application/json"
        body: |
          {
            "error": {
              "message": "Request rejected: Content contains sensitive information",
              "type": "invalid_request_error",
              "code": "content_policy_violation"
            }
          }
    - regex:
        action: reject
        rules:
        - builtin: email
      rejection:
        status: 400
        body: '{"error": {"message": "Contains email address"}}'

Forward content to an external HTTP service for inspection. The service returns an allow or deny decision.

object

Webhook-based content guard.

Show webhook fields

object

required

The external service to forward content to. One of service, host, or backend must be set.

Show target options

string

Hostname or IP address (with port) of the webhook service.

object

Reference to a named service.

Show service fields

string

Namespace of the service.

string

Hostname of the service.

integer

Port of the service.

string

Explicit backend reference. The backend must be defined in the top-level backends list.

object[]

A list of request headers to forward to the webhook service. Each entry matches by header name and optional value.

Show forwardHeaderMatches fields

string

required

Header name to match.

string

Exact value to match.

string

Regex value to match.

Example:

ai:
  promptGuard:
    request:
    - webhook:
        target:
          host: "guard-service:8080"
        forwardHeaderMatches:
        - name: x-user-id

Use the OpenAI Moderation API to classify content.

object

OpenAI Moderation API-based content guard.

Show openAIModeration fields

string

default:"omni-moderation-latest"

The moderation model to use. Defaults to omni-moderation-latest.

model: omni-moderation-latest

object

Backend connection policies for the OpenAI API (TLS, auth, headers, etc.). Supports the same policy fields as other backend connections.

Example:

ai:
  promptGuard:
    request:
    - openAIModeration:
        model: omni-moderation-latest

Use AWS Bedrock Guardrails to evaluate content.

object

AWS Bedrock Guardrails-based content guard.

Show bedrockGuardrails fields

string

required

The unique identifier of the Bedrock guardrail to invoke.

guardrailIdentifier: bedrock-guardrail-identifier

string

required

The version of the guardrail (e.g. DRAFT or a version number).

guardrailVersion: DRAFT

string

required

The AWS region where the guardrail is deployed.

region: us-west-2

object

Backend policies for AWS authentication. When omitted, implicit AWS credential chain authentication is used.

Example:

ai:
  promptGuard:
    request:
    - bedrockGuardrails:
        guardrailIdentifier: bedrock-guardrail-identifier
        guardrailVersion: DRAFT
        region: us-west-2
    response:
    - bedrockGuardrails:
        guardrailIdentifier: bedrock-guardrail-identifier
        guardrailVersion: DRAFT
        region: us-west-2

Use Google Cloud Model Armor to evaluate content.

object

Google Cloud Model Armor-based content guard.

Show googleModelArmor fields

string

required

The Model Armor template ID to use.

templateId: model-armor-template-id

string

required

The GCP project ID where Model Armor is configured.

projectId: my-gcp-project

string

default:"us-central1"

The GCP region. Defaults to us-central1.

location: us-central1

object

Backend policies for GCP authentication. When omitted, implicit GCP credential chain authentication is used.

Example:

ai:
  promptGuard:
    request:
    - googleModelArmor:
        templateId: model-armor-template-id
        projectId: model-armor-project-id
        location: us-central1
    response:
    - googleModelArmor:
        templateId: model-armor-template-id
        projectId: model-armor-project-id
        location: us-central1

Rejection configuration

Each guard entry can include a rejection block that customizes the HTTP response returned when the guard denies a request.

object

Configures the HTTP response sent to the client when the guard rejects a request.

Show rejection fields

integer

HTTP status code to return. Defaults to 403.

rejection:
  status: 400

string

Response body to send. Supports multi-line strings.

rejection:
  body: |
    {"error": {"message": "Request rejected"}}

object

Headers to add, set, or remove from the rejection response.

Show rejection.headers fields

object

Headers to add to the rejection response.

object

Headers to set on the rejection response (overrides existing values).

rejection:
  headers:
    set:
      content-type: "application/json"

string[]

Header names to remove from the rejection response.

Guard chaining

Multiple guards can be listed under request[] or response[]. Guards are evaluated in order. If any guard rejects the content, the associated rejection response is returned immediately and subsequent guards are not evaluated.

ai:
  promptGuard:
    request:
    - regex:                  # evaluated first
        action: reject
        rules:
        - pattern: SSN
      rejection:
        status: 400
        body: 'Rejected: contains SSN'
    - regex:                  # evaluated second (only if first passes)
        action: reject
        rules:
        - builtin: email
      rejection:
        status: 400
        body: 'Rejected: contains email'

The ai.promptGuard policy only applies to routes that process LLM traffic. The parent ai policy must be set on the route for prompt guard to take effect.

Overview

Resources

Policies

CEL Reference

Guard types

Rejection configuration

Guard chaining

​Guard types

​Rejection configuration

​Guard chaining

Guard types

Rejection configuration

Guard chaining