Setup
Get your API key from Cerebras Cloud.Config
Use it
Models
| Model | Best for |
|---|---|
gpt-oss-cerebras | Fast general-purpose inference |
| Llama variants | Open model inference at speed |
Features
| Feature | Supported |
|---|---|
| Streaming | Yes |
| Tool use | Yes |
| Vision (images) | No |
Provider Details
| Provider ID | cerebras |
| Env variable | CEREBRAS_API_KEY |
| API type | OpenAI-compatible |
| Auto-infer prefix | gpt-oss- |
Notes
- Cerebras is one of the fastest inference providers available. Time-to-first-token is often under 100ms.
- Good for the same “fast-worker” pattern as Groq — assign high-volume, simpler tasks to Cerebras agents.