Cerebras offers 7 models through Lava’s AI Gateway, supporting Chat Completions. Authentication uses Authorization: Bearer. See the Cerebras API docs for provider-specific parameters.
Supports both managed API keys (from Lava) and BYOK mode.
Quick Start
const response = await fetch('https://api.lavapayments.com/v1/forward?u=https%3A%2F%2Fapi.cerebras.ai%2Fv1%2Fchat%2Fcompletions', {
method: 'POST',
headers: {
'Content-Type': 'application/json',
Authorization: `Bearer ${forwardToken}`,
},
body: JSON.stringify({
model: 'gpt-oss-120b',
messages: [{ role: "user", content: "Hello!" }],
}),
});
Chat Completions
Target URL: https://api.cerebras.ai/v1/chat/completions
| |
|---|
| Content Type | application/json |
| Streaming | Yes (set stream: true in request body) |
| Model | Input / 1M tokens | Output / 1M tokens |
|---|
| qwen-3-235b-instruct | $0.60 | $1.20 |
| gpt-oss-120b | $0.35 | $0.75 |
| llama3.1-8b | Free | Free |
| llama-3.3-70b | Free | Free |
| deepseek-r1-distill-llama-70b | Free | Free |
| llama-4-scout-17b-16e-instruct | Free | Free |
| qwen-3-32b | Free | Free |
Next Steps