Skip to main content

Perf Platform Documentation

Welcome to Perf - the intelligent AI runtime orchestrator that optimizes your LLM applications for cost, quality, and reliability while providing unified access to text, image, audio, and video generation.

What is Perf?

Perf is an AI infrastructure layer that sits between your application and AI providers (OpenAI, Anthropic, Google, Stability AI, Runway, and more). We provide:
  • Unified API - One API for text, images, audio, and video generation
  • Intelligent Orchestration - Automatically select the optimal model based on task, budget, and quality requirements
  • Schema Enforcement - Validate and auto-repair LLM outputs against your JSON schemas
  • Continuous Learning - Performance improves as we learn from millions of inferences

Why Perf?

The Problem

Building production AI applications is complex:
  • Fragmented APIs - Different providers have different APIs, authentication, and response formats
  • Cost Uncertainty - Model costs vary 30x+ between providers and models
  • Unreliable Outputs - LLMs return malformed JSON, hallucinate, or refuse requests
  • Manual Optimization - Teams spend weeks tuning model selection and prompts
  • Provider Lock-in - Switching providers requires significant code changes

The Solution

Perf provides:
  • One API, All Modalities - Text, images, audio, video through OpenAI-compatible endpoints
  • Intelligent Orchestration - Automatically select the best model for each request
  • Schema Enforcement - Define JSON schemas, we validate and auto-repair outputs
  • Cost Control - Enforce budgets and automatically optimize spend
  • Zero Lock-in - Switch models/providers without code changes

Key Features

Multi-Modal Generation

  • Text - Chat completions with intelligent model selection
  • Images - DALL-E 3, Stable Diffusion 3, Flux, Ideogram, Imagen
  • Audio - Text-to-speech (TTS) and speech-to-text (Whisper)
  • Video - Veo 3, Runway Gen-3, Luma Dream Machine, Pika
  • Voice Agents - Real-time conversational AI agents with custom instructions, knowledge base, and content safety

Smart Model Selection

  • Automatic task classification (extraction, reasoning, code, vision, audio)
  • Complexity-aware model selection
  • Per-customer preference learning
  • Real-time provider health monitoring

Schema Enforcement

  • Define JSON schemas for structured outputs
  • Automatic validation and repair
  • Type coercion and format correction
  • Per-project default schemas

Policy Enforcement

  • Routing Policies - Control model selection, set cost limits, block providers
  • Content Policies - Detect and redact PII, filter sensitive terms
  • Compliance - HIPAA-ready PII detection, audit logging
  • Governance - Policy templates for common use cases

Tools Library

  • Web Search - Real-time web search for up-to-date information
  • Documents/RAG - Upload and query documents with semantic search
  • Memory - Persistent conversation context across sessions
  • Coming Soon - External actions (Slack, GitHub, Jira, and more)

Cost Optimization

  • Per-request budget constraints
  • Automatic model downgrade when needed
  • Up to 60% cost savings vs GPT-4
  • Transparent per-call billing

Production-Ready

  • Automatic failover across providers
  • Quality validation with retry logic
  • Real-time dashboards and logs
  • Enterprise observability

Quick Start

1. Get Your API Key

Sign up at dashboard.withperf.pro/sign-up and generate your API key.

2. Make Requests

# Text generation
curl https://api.withperf.pro/v1/chat \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"messages": [{"role": "user", "content": "Hello!"}]}'

# Image generation
curl https://api.withperf.pro/v1/images/generations \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"prompt": "A sunset over mountains", "model": "dall-e-3"}'

3. View Analytics

Monitor usage, costs, and performance in the Dashboard.

API Reference

Text Generation

Voice Agents

Media Generation

Governance & Quality

Agentic Tools

  • Tools API - Web search, documents/RAG, conversation memory

Analytics

Documentation

Getting Started

Platform Guide

Advanced

Support