Chutes

Supports both managed (Lava’s API keys) and unmanaged (bring your own credentials) mode.

Quick Start

const response = await fetch('https://api.lava.so/v1/forward?u=https%3A%2F%2Fllm.chutes.ai%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'deepseek-ai/DeepSeek-V3-0324',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});

Chat Completions

Target URL: https://llm.chutes.ai/v1/chat/completions


Content Type	`application/json`
Streaming	Yes (set `stream: true` in request body)

Model	Input / 1M tokens	Output / 1M tokens
tngtech/DeepSeek-TNG-R1T2-Chimera	$0.3024	$0.3024
deepseek-ai/DeepSeek-V3-0324	$0.27216	$0.27216
deepseek-ai/DeepSeek-R1-0528	$0.27216	$0.27216
deepseek-ai/DeepSeek-V3	$0.27216	$0.27216
unsloth/gemma-3-27b-it	$0.1568	$0.1568
Qwen/Qwen2.5-72B-Instruct	$0.1008	$0.1008
NousResearch/DeepHermes-3-Mistral-24B-Preview	$0.06552	$0.06552
deepseek-ai/DeepSeek-R1-Distill-Llama-70B	$0.0504	$0.0504
Qwen/Qwen3-30B-A3B	$0.03024	$0.03024
chutesai/Mistral-Small-3.2-24B-Instruct-2506	$0.03024	$0.03024
Qwen/Qwen3-14B	$0.03024	$0.03024
Qwen/Qwen2.5-VL-32B-Instruct	$0.03024	$0.03024
Qwen/Qwen2.5-Coder-32B-Instruct	$0.03024	$0.03024
unsloth/gemma-3-12b-it	$0.03024	$0.03024
unsloth/Mistral-Small-24B-Instruct-2501	$0.03024	$0.03024
Qwen/Qwen3-32B	$0.027216	$0.027216
chutesai/Mistral-Small-3.1-24B-Instruct-2503	$0.027216	$0.027216
unsloth/Mistral-Nemo-Instruct-2407	$0.01512	$0.01512

Quick Start

Chat Completions

Next Steps

All Providers

Forward Proxy

​Quick Start

​Chat Completions

​Next Steps

All Providers

Forward Proxy

Quick Start

Chat Completions

Next Steps