Supported Models
Perf supports the latest and most capable models from leading AI providers. We automatically orchestrate your requests to the optimal model based on task type, quality requirements, and cost constraints.Current Model Portfolio
OpenAI Models
GPT-5.2 (Latest)
Best for: Coding, agentic tasks, complex problem solvingGPT-5 mini
Best for: Fast, cost-efficient general-purpose tasksGPT-5 nano
Best for: Maximum speed and cost efficiencyGPT-4.1
Best for: Non-reasoning tasks, general intelligenceAnthropic Claude Models
Claude Opus 4.5 (Latest)
Best for: Maximum intelligence with practical performanceClaude Sonnet 4.5
Best for: Complex agents and coding tasksClaude Haiku 4.5
Best for: Speed and cost efficiency with near-frontier intelligenceGoogle Gemini Models
Gemini 3 Pro (Preview)
Best for: World’s best multimodal understandingGemini 3 Flash (Preview)
Best for: Speed, scale, and frontier intelligenceGemini 2.5 Flash (Stable)
Best for: Production-ready large-scale processingGemini 2.5 Pro (Stable)
Best for: State-of-the-art thinking and reasoningGemini 2.5 Flash-Lite (Stable)
Best for: Maximum cost efficiencyMistral AI Models
Mistral Large
Best for: European data residency, reasoning, multilingualAlibaba Qwen Models
Qwen 2.5 72B
Best for: Cost-effective reasoning, Asian language supportMeta Llama Models
Llama 3.1 405B
Best for: Open source, self-hosting, customizationLlama 3.1 70B
Best for: Balanced open-source performanceModel Comparison Matrix
| Feature | GPT-5.2 | Claude Opus 4.5 | Claude Sonnet 4.5 | Gemini 3 Pro | Gemini 2.5 Pro |
|---|---|---|---|---|---|
| Reasoning | Excellent | Excellent | Excellent | Excellent | Excellent |
| Speed | Medium | Moderate | Fast | Medium | Medium |
| Context | 400K | 200K | 200K/1M | 1M | 1M |
| Max Output | 64K | 64K | 64K | 65K | 65K |
| Multimodal | Yes | No | No | Yes | No |
| Training Data | Jul 2025 | Aug 2025 | Jul 2025 | Current | Current |
| Best For | Coding, Agents | Max Intelligence | Balanced | Multimodal | Reasoning |
Intelligent Orchestration
Perf automatically selects the optimal model based on:1. Task Type Detection
2. Complexity Scoring
3. Cost Constraints
Multimodal Support
Perf supports multimodal inputs for compatible models:Vision (Image Understanding)
- Gemini 3 Pro (best quality, all input types)
- GPT-5.2 (excellent vision support)
- Gemini 3 Flash (fastest multimodal)
- GPT-4.1 (legacy vision support)
Audio & Video
- Gemini 3 Pro (video, audio, PDF)
- Gemini 3 Flash (video, audio, PDF)
Provider Reliability
Historical Uptime (Last 90 Days)
Failover Strategy
Perf automatically handles provider issues:Best Practices
Choose the Right Model
Optimize for Your Use Case
Context Window Best Practices
Optimal Context Usage
Cost Optimization
Example: Customer Support Chatbot
FAQ
Q: Which is the best model? A: Depends on your task. For coding: GPT-5.2 or Claude Sonnet 4.5. For multimodal: Gemini 3 Pro. For cost: Gemini Flash-Lite or GPT-5 nano. Use Perf auto-orchestration to let us choose. Q: Can I use only one provider? A: Yes, configure in Settings → Orchestration → Provider Preference Q: How often are new models added? A: We add new models within days of provider release Q: Can I bring my own model? A: Yes (Enterprise), contact [email protected] Q: Do you support fine-tuned models? A: Yes, you can upload and deploy fine-tuned versions of supported models Q: What about model deprecations? A: We handle migrations automatically when providers deprecate models Q: Do these models support function calling? A: Yes, all GPT-5, Claude 4.5, and Gemini models support function/tool callingNext Steps
- Try different models in the Playground
- View model performance in Analytics
- Read API documentation
- Learn about cost optimization
Support
- Email: [email protected]
- Model Requests: [email protected]
- Documentation: docs.withperf.pro