Chutes offers 20 models through Lava’s AI Gateway, supporting Chat Completions. Authentication uses Authorization: Bearer. See the Chutes API docs for provider-specific parameters.
Supports both managed API keys (from Lava) and BYOK mode.
Quick Start
const response = await fetch('https://api.lavapayments.com/v1/forward?u=https%3A%2F%2Fllm.chutes.ai%2Fv1%2Fchat%2Fcompletions', {
method: 'POST',
headers: {
'Content-Type': 'application/json',
Authorization: `Bearer ${forwardToken}`,
},
body: JSON.stringify({
model: 'deepseek-ai/DeepSeek-V3-0324',
messages: [{ role: "user", content: "Hello!" }],
}),
});
Chat Completions
Target URL: https://llm.chutes.ai/v1/chat/completions
| |
|---|
| Content Type | application/json |
| Streaming | Yes (set stream: true in request body) |
| Model | Input / 1M tokens | Output / 1M tokens |
|---|
| tngtech/DeepSeek-TNG-R1T2-Chimera | $0.30 | $0.30 |
| deepseek-ai/DeepSeek-V3-0324 | $0.27 | $0.27 |
| deepseek-ai/DeepSeek-R1-0528 | $0.27 | $0.27 |
| deepseek-ai/DeepSeek-V3 | $0.27 | $0.27 |
| unsloth/gemma-3-27b-it | $0.16 | $0.16 |
| Qwen/Qwen2.5-72B-Instruct | $0.10 | $0.10 |
| Qwen/Qwen2.5-VL-72B-Instruct | $0.10 | $0.10 |
| NousResearch/DeepHermes-3-Mistral-24B-Preview | $0.07 | $0.07 |
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.05 | $0.05 |
| Qwen/Qwen3-30B-A3B | $0.03 | $0.03 |
| chutesai/Mistral-Small-3.2-24B-Instruct-2506 | $0.03 | $0.03 |
| Qwen/Qwen3-14B | $0.03 | $0.03 |
| Qwen/Qwen2.5-VL-32B-Instruct | $0.03 | $0.03 |
| Qwen/Qwen2.5-Coder-32B-Instruct | $0.03 | $0.03 |
| unsloth/gemma-3-12b-it | $0.03 | $0.03 |
| unsloth/Mistral-Small-24B-Instruct-2501 | $0.03 | $0.03 |
| Qwen/Qwen3-32B | $0.03 | $0.03 |
| chutesai/Mistral-Small-3.1-24B-Instruct-2503 | $0.03 | $0.03 |
| unsloth/Mistral-Nemo-Instruct-2407 | $0.02 | $0.02 |
| OpenGVLab/InternVL3-78B | $0.02 | $0.02 |
Next Steps