Skip to main content
Chutes offers 20 models through Lava’s AI Gateway, supporting Chat Completions. Authentication uses Authorization: Bearer. See the Chutes API docs for provider-specific parameters.
Supports both managed API keys (from Lava) and BYOK mode.

Quick Start

const response = await fetch('https://api.lavapayments.com/v1/forward?u=https%3A%2F%2Fllm.chutes.ai%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'deepseek-ai/DeepSeek-V3-0324',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});

Chat Completions

Target URL: https://llm.chutes.ai/v1/chat/completions
Content Typeapplication/json
StreamingYes (set stream: true in request body)
ModelInput / 1M tokensOutput / 1M tokens
tngtech/DeepSeek-TNG-R1T2-Chimera$0.30$0.30
deepseek-ai/DeepSeek-V3-0324$0.27$0.27
deepseek-ai/DeepSeek-R1-0528$0.27$0.27
deepseek-ai/DeepSeek-V3$0.27$0.27
unsloth/gemma-3-27b-it$0.16$0.16
Qwen/Qwen2.5-72B-Instruct$0.10$0.10
Qwen/Qwen2.5-VL-72B-Instruct$0.10$0.10
NousResearch/DeepHermes-3-Mistral-24B-Preview$0.07$0.07
deepseek-ai/DeepSeek-R1-Distill-Llama-70B$0.05$0.05
Qwen/Qwen3-30B-A3B$0.03$0.03
chutesai/Mistral-Small-3.2-24B-Instruct-2506$0.03$0.03
Qwen/Qwen3-14B$0.03$0.03
Qwen/Qwen2.5-VL-32B-Instruct$0.03$0.03
Qwen/Qwen2.5-Coder-32B-Instruct$0.03$0.03
unsloth/gemma-3-12b-it$0.03$0.03
unsloth/Mistral-Small-24B-Instruct-2501$0.03$0.03
Qwen/Qwen3-32B$0.03$0.03
chutesai/Mistral-Small-3.1-24B-Instruct-2503$0.03$0.03
unsloth/Mistral-Nemo-Instruct-2407$0.02$0.02
OpenGVLab/InternVL3-78B$0.02$0.02

Next Steps