UGC Moderation for Apps

The developer-first platform for content moderation

Content filtering, user reporting, user blocking, appeals, policy versioning, and immutable audit trails - one API, production-ready in hours, not quarters.

Free tier: 15,000 decisions/mo - enough to moderate a 5,000-user app. No credit card required.

Policy versioning
Appeals workflow
Decision replay
Compliance exports

More than a filter API

An OpenAI moderation call gives you scores. Vettly gives you the full trust-and-safety stack around those scores.

FeatureBuild In-HouseGeneric Moderation APIVettly
Content filteringYes, multi-provider
User reportingYes, built-in
User blockingYes, built-in
Audit trailYes, immutable + exportable
Policy templatesYes, pre-built
Appeals workflowYes, built-in
Policy version pinningYes, every decision
Decision replayYes, test new policies on old data
DECISION LIFECYCLE

Every decision is traceable, contestable, and replayable

A regex gives you a yes/no. An LLM call gives you a score. Vettly gives you a decision record that tracks policy version, appeal outcome, reviewer action, and retention status - so when trust, legal, or App Review asks, you have the answer.

1

Intake

Content and context

2

Policy evaluation

Under a specific version

3

Action taken

And recorded

4

History stored

With retention rules

5

Appeal or review

If contested

6

Search, export, replay

Whenever you need

HOW IT WORKS

Three integration steps. Six workflows behind the scenes.

You integrate three endpoints. Vettly handles filtering, reporting, blocking, appeals, audit trails, and policy versioning behind them.

1. Add one moderation call

Call /v1/check before content is published.

Policy evaluation, graduated actions, category scoring, and full decision logging.

2. Turn on report + block

Enable user reporting and blocking endpoints.

Report routing, resolution tracking, mutual-block enforcement, and appeals workflow.

3. Ship with audit evidence

Every decision logged with the policy version that produced it.

Decision replay, compliance exports, retention rules, and searchable history.

Get free API key

No credit card required. 15,000 decisions/mo - enough to moderate a 5,000-user app through launch.

WHO IT'S FOR

Built for teams shipping user-generated content.

Define your content policy in plain English. Vettly enforces it across text, images, and video with decisions your team can trace, explain, and defend.

Social & Community Apps

Chat, comments, and user profiles. Filter text and images before they appear, add report buttons, and let users block each other.

  • Chat messages & comments
  • User profiles & bios
  • Photo & video sharing
Marketplace & Review Apps
AI & Chatbot Apps
Kids & Education Apps
TRY IT LIVE

See it in action

Edit the text, click Analyze, and see the real API response.

This demo uses a rate-limited API key. For production use, sign up for a free account.

Get free API key
Input
Policy (YAML)
name: community-safe
version: "1.0"
rules:
- category: hate_speech
threshold: 0.6
action: block
- category: harassment
threshold: 0.6
action: flag
- category: violence
threshold: 0.7
action: warn
- category: spam
threshold: 0.5
action: flag
Response
{
"decisionId": "dec_xxxxxxxx",
"action": "allow",
"safe": true,
"policyVersion": "[email protected]",
"categories": [],
"latency_ms": "---"
}

What happens next in the backend

Evidence Retained
Appeal / Review
Audit
Export
Replay

Consistent API

Same JSON structure for every request. Text, images, video, one predictable format.

Built for Speed

Optimized for low-latency responses. See real performance in the playground above.

Your Data, Your Control

We don't train on your content. Configure retention policies to match your requirements.

Production Infrastructure

Deployed on enterprise-grade cloud infrastructure with monitoring and alerting.

WHAT YOU GET

Everything your moderation stack needs, nothing extra

Filtering, reporting, blocking, and auditable decision history in one API key.

Content Filtering

Screen text, images, and video against your policy before content goes live. Blocks hate speech, sexual content, violence, and spam.

User Reporting

Let users report offensive content with one API call. Reports are tracked, assigned, and resolved with a full audit trail.

User Blocking

Add users to a blocklist so they can't contact or interact with the reporter. Built-in to the API.

Audit Trail

Every decision is pinned to the policy version that produced it. Replay past decisions against new policies, export for legal holds, and trace appeals from dispute to resolution.

Policy Templates

Pre-built starter policies for common moderation use cases. Start in minutes, customize when you need to.

Appeals Workflow

Handle disputes and overturn decisions. Users can contest blocks, and moderators can review with full context.

POLICIES IN PLAIN ENGLISH

Rules you can actually read

Policies are explicit. They are written in versioned configuration, not buried in model behavior. Old decisions are never reinterpreted. Policy changes apply forward, not retroactively. This is how automated systems earn trust.

community-safe.yml
name: community-safe
description: UGC safety baseline
rules:
- category: sexual
threshold: 0.6
action: reject
- category: hate
threshold: 0.5
action: reject
- category: suggestive
threshold: 0.8
action: flag

Block explicit content

Reject sexual content scoring above 60%

60%

Block hate speech

Block content with hate speech above 50% confidence

50%

Flag borderline content

Flag suggestive content for human review above 80%

80%

Custom Prompts

Pro+

Ask AI anything about your images. Write rules in plain English like "Is this counterfeit?" or "Does this food look undercooked?"

customPrompt: "Does this show counterfeit luxury goods?"
BUILT WITH VETTLY

One API call. Six workflows.

This Discord bot calls one endpoint. Behind that endpoint: policy evaluation, graduated actions, decision logging, appeal handling, compliance exports, and policy versioning. All automatic.

Vettly Bot
Online
BadActorToday at 2:34 PM
I hate everyone in this server you all suck
Vettly BotBOTToday at 2:34 PM
Decision: BLOCK
Policy rule triggered: harassment > 0.80
GoodUserToday at 2:35 PM
Hey everyone! Just finished the tutorial, this is awesome 🎉
Vettly BotBOTToday at 2:35 PM
Decision: ALLOW
Governed by policy [email protected]
Protected
Real-time

One API call. Every decision recorded.

This bot took an afternoon to build. The decision logic? Just one line of code. Try it on your server or check out the source to build your own.

// Every call produces a decision record
const result = await vettly.check({ content, contentType: "text" })
  • Full Discord bot in ~200 lines
  • Text and image evaluation
  • Policy versioning per server
  • Open source on GitHub
NEW: OPENCLAW GUARDRAILS

🦞 Stop risky agent actions before they execute

Use Vettly as the guardrail control plane for OpenClaw. Enforce fail-closed runtime decisions, approval thresholds, blocked command patterns, and policy rollback in one workflow.

In-path authorization for shell, file, network, and env access.

Fail-closed defaults so outages do not silently bypass safety.

Presets, policy history, restore, and operating metrics in dashboard.

How it works

1. Vet skills before install.

2. Authorize runtime actions in-path.

3. Enforce allow, warn, flag, or block.

4. Track metrics and rollback policy versions.

Core endpoints

POST /v1/openclaw/guardrails/skill-vettingPOST /v1/openclaw/guardrails/action-authorizeGET /v1/openclaw/guardrails/metrics?days=30
SIMPLE PRICING

Simple, predictable pricing

Start free. Upgrade when you need more volume, longer history, or advanced workflows.

Every plan includes decision IDs, policy versioning, and searchable history.

Free

Enough for a 5,000-user app

$0forever
  • 15,000 decisions / month
  • Text, images & video
  • Starter policy templates
  • Decision IDs & audit trail
  • 24-hour decision history
  • Policy versioning

Starter

Ship your MVP without hitting limits

$19/month
  • 100,000 decisions / month
  • Everything in Free
  • 1-year decision history
  • Custom policy rules
  • Decision webhooks
Popular

Growth

Ship publicly with stronger protection

$49/month
  • 300,000 decisions / month
  • Everything in Starter
  • Spam & scam detection
  • Advanced risk controls
  • 3-year moderation logs

Pro

Custom rules, priority processing, compliance

$149/month
  • 1,500,000 decisions / month
  • Everything in Growth
  • Custom moderation rules
  • Priority processing
  • Compliance & audit support
  • Priority support

Enterprise

Custom infrastructure at scale

Custom
  • Everything in Pro
  • Unlimited decisions
  • Multi-provider routing
  • Batch moderation API
  • SLA guarantees
  • Dedicated support

All plans include: decision IDs, policy versioning, searchable history, and a unified API for text, image, and video.

Overage pricing (all paid plans): text $0.0001/unit · image $0.001/unit · video $0.01/unit, prorated and billed monthly.