Model Catalog
Explore our comprehensive collection of AI models for image, video, audio generation, and LLM.
Featured Models
Top picks from each category - the best models for getting started
FLUX.1 Schnell
Black Forest Labs
Ultra-fast image generation model optimized for speed. Generates high-quality images in just 1-2 seconds, perfect for real-time applications and rapid prototyping.
Kling 1.6 Pro
Kuaishou
State-of-the-art video generation with exceptional motion quality and scene understanding. Supports both text-to-video and image-to-video generation.
ElevenLabs Turbo
ElevenLabs
Industry-leading voice synthesis with the most natural-sounding AI voices and instant voice cloning.
GPT-4o
OpenAI
OpenAI's flagship multimodal model. Industry-leading performance in reasoning, coding, and creative tasks with native vision capabilities and structured output support.
How to Choose the Right Model
Need Speed?
flux-schnell, Gemini Flash
Need Quality?
FLUX Pro, Kling Pro, Claude
Budget-Friendly?
flux-schnell, MiniMax, Gemini
Most Versatile?
GPT-4o, Claude, FLUX Dev
Image Generation Models
Generate stunning images with FLUX, Stable Diffusion, and more
FLUX.1 Schnell
FeaturedBlack Forest Labs
Ultra-fast image generation model optimized for speed. Generates high-quality images in just 1-2 seconds, perfect for real-time applications and rapid prototyping.
FLUX.1 Pro
Black Forest Labs
Professional-grade image generation with superior quality and detail. Best choice for commercial projects requiring the highest visual fidelity.
FLUX.1 Dev
Black Forest Labs
Development-focused model offering a balance between speed and quality. Ideal for testing and development workflows.
Stable Diffusion XL
Stability AI
Industry-standard open-source image generation model. Highly versatile with excellent community support and fine-tuning options.
Seedream 4.0
ByteDance
ByteDance's latest image generation model with exceptional prompt understanding and creative capabilities.
Ideogram V2
Ideogram
Best-in-class text rendering in images. Perfect for logos, posters, and any design requiring accurate text.
Recraft V3
Recraft
Vector-style and design-focused image generation. Excellent for icons, illustrations, and design assets.
Video Generation Models
Create AI videos with Kling, MiniMax, and cutting-edge models
Kling 1.6 Pro
FeaturedKuaishou
State-of-the-art video generation with exceptional motion quality and scene understanding. Supports both text-to-video and image-to-video generation.
MiniMax Video-01
MiniMax
Fast and efficient video generation with good quality-to-speed ratio. Ideal for quick video prototyping.
Runway Gen-3 Alpha Turbo
Runway
Runway's latest video generation model with industry-leading quality and creative control options.
Luma Ray2
Luma AI
Luma's advanced video generation with exceptional 3D understanding and camera control.
Audio & TTS Models
Text-to-speech, voice cloning, and audio generation
Fish Speech
FeaturedFish Audio
High-quality multilingual text-to-speech with natural prosody and voice cloning capabilities.
ElevenLabs Turbo
ElevenLabs
Industry-leading voice synthesis with the most natural-sounding AI voices and instant voice cloning.
LLM Models
GPT-4o, Claude, Gemini - OpenAI-compatible chat API
GPT-4o
FeaturedOpenAI
OpenAI's flagship multimodal model. Industry-leading performance in reasoning, coding, and creative tasks with native vision capabilities and structured output support.
GPT-4o Mini
OpenAI
Cost-effective, fast model with strong performance. Best for high-volume tasks where speed and cost matter more than absolute capability.
OpenAI o1
OpenAI
OpenAI's most advanced reasoning model. Uses extended thinking time to solve complex problems in science, coding, and math with exceptional accuracy.
OpenAI o1-mini
OpenAI
Fast reasoning model optimized for coding and STEM tasks. Provides strong reasoning at lower cost than o1.
Claude 3.5 Sonnet
Anthropic
Anthropic's most intelligent model. Best-in-class for complex reasoning, nuanced understanding, and coding tasks with exceptional instruction following.
Claude 3 Opus
Anthropic
Anthropic's most powerful model for highly complex tasks. Exceptional at research, analysis, and creative projects requiring deep expertise.
Claude 3.5 Haiku
Anthropic
Fast, cost-effective model for everyday tasks. Great balance of speed, intelligence, and cost for high-volume applications.
Gemini 2.0 Flash
Google's fastest and most capable model. Features a massive 1M token context window, native multimodal support, and real-time capabilities.
Gemini 2.0 Flash Thinking
Experimental reasoning model with explicit thinking process. Shows step-by-step reasoning for complex problems.
Gemini 1.5 Pro
Google's production-ready model with excellent balance of capability and cost. Features 2M token context window for massive document processing.
Pricing Comparison
Compare credit costs across all models to find the best fit for your needs
| Category | Model | Credits | Unit | Best For |
|---|---|---|---|---|
| Image | flux-schnell | 4 | per image | Fast generation, real-time apps |
| flux-pro | 25 | per image | Best quality, commercial | |
| ideogram-v2 | 15 | per image | Text rendering, logos | |
| Video | kling-1.6-pro | 100 | per 5s | Best quality video |
| minimax-video-01 | 50 | per video | Cost-effective | |
| Audio | elevenlabs-turbo | 8 | per 1K chars | Best quality TTS |
| fish-speech | 5 | per 1K chars | Multilingual, voice clone | |
| LLM | gpt-4o | 3 | per 1K tokens | General, code, multimodal |
| claude-3.5-sonnet | 4 | per 1K tokens | Analysis, long-form, code | |
| gemini-2.0-flash | 1 | per 1K tokens | Ultra-fast, bulk processing |
Ready to Get Started?
Try our models in the Playground or follow the Quickstart guide