Skip to main content
Cerebras offers 7 models through Lava’s AI Gateway, supporting Chat Completions. Authentication uses Authorization: Bearer. See the Cerebras API docs for provider-specific parameters.
Supports both managed API keys (from Lava) and BYOK mode.

Quick Start

const response = await fetch('https://api.lavapayments.com/v1/forward?u=https%3A%2F%2Fapi.cerebras.ai%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'gpt-oss-120b',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});

Chat Completions

Target URL: https://api.cerebras.ai/v1/chat/completions
Content Typeapplication/json
StreamingYes (set stream: true in request body)
ModelInput / 1M tokensOutput / 1M tokens
qwen-3-235b-instruct$0.60$1.20
gpt-oss-120b$0.35$0.75
llama3.1-8bFreeFree
llama-3.3-70bFreeFree
deepseek-r1-distill-llama-70bFreeFree
llama-4-scout-17b-16e-instructFreeFree
qwen-3-32bFreeFree

Next Steps