Authorization: Bearer. See the Cerebras API docs for provider-specific parameters.
Supports both managed (Lava’s API keys) and unmanaged (bring your own credentials) mode.
Quick Start
Chat Completions
Target URL:https://api.cerebras.ai/v1/chat/completions
| Content Type | application/json |
| Streaming | Yes (set stream: true in request body) |
| Model | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
| qwen-3-235b-instruct | $0.60 | $1.20 |
| gpt-oss-120b | $0.35 | $0.75 |
| llama3.1-8b | Free | Free |
| llama-3.3-70b | Free | Free |
| deepseek-r1-distill-llama-70b | Free | Free |
| llama-4-scout-17b-16e-instruct | Free | Free |
| qwen-3-32b | Free | Free |
Next Steps
All Providers
Browse all supported AI providers
Forward Proxy
Learn how to construct proxy URLs and authenticate requests