Description
Replicate is a platform for running open-source AI models via API with simple pay-per-second pricing based on hardware used. It hosts 100+ official models with stable APIs and predictable pricing, plus thousands of community models across image generation (FLUX 2 Pro, Stable Diffusion), video (Kling, Veo 3), language (Llama 3, DeepSeek R1), and more. Users can also deploy custom models. CPU compute starts at $0.000025/second, while H100 GPU costs $0.001525/second. A free tier with limited runs is available for exploration, and volume discounts exist for large-scale usage. Ideal for developers who want to access the latest open-source models without managing infrastructure.
Features
- ●Model Hub: 100+ official and thousands of community AI models
- ●Pay-per-Second: Transparent billing based on actual hardware usage
- ●Custom Models: Deploy your own models with simple API
- ●Auto-scaling: Infrastructure scales automatically with demand
- ●Instant Access: Run latest models like FLUX 2, Llama 3 immediately
Pricing
- Limited free runs
- All models accessible
- API access
- CPU from $0.000025/sec
- H100 GPU $0.001525/sec
- 100+ official models
- Volume discounts
- Custom pricing
- Dedicated account
- Priority support
- SLA