Skip to main content
together.ai offers 73 models through Lava’s AI Gateway, supporting Chat Completions, Embeddings. Authentication uses Authorization: Bearer. See the together.ai API docs for provider-specific parameters.
Supports both managed API keys (from Lava) and BYOK mode.

Quick Start

const response = await fetch('https://api.lavapayments.com/v1/forward?u=https%3A%2F%2Fapi.together.xyz%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'moonshotai/Kimi-K2.5',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});

Chat Completions

Target URL: https://api.together.xyz/v1/chat/completions
Content Typeapplication/json
StreamingYes (set stream: true in request body)
ModelInput / 1M tokensOutput / 1M tokens
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo$3.50$3.50
deepseek-ai/DeepSeek-R1$3.00$7.00
perplexity-ai/r1-1776$3.00$7.00
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8$2.00$2.00
deepseek-ai/DeepSeek-R1-Distill-Llama-70B$2.00$2.00
Qwen/Qwen2.5-VL-72B-Instruct$1.95$8.00
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B$1.60$1.60
deepseek-ai/DeepSeek-V3$1.25$1.25
moonshotai/Kimi-K2-Thinking$1.20$4.00
Qwen/QwQ-32B$1.20$1.20
Qwen/Qwen2-VL-72B-Instruct$1.20$1.20
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo$1.20$1.20
Qwen/Qwen2.5-72B-Instruct-Turbo$1.20$1.20
moonshotai/Kimi-K2-Instruct-0905$1.00$3.00
zai-org/GLM-5$1.00$3.20
Qwen/Qwen2-72B-Instruct$0.90$0.90
arcee-ai/maestro-reasoning$0.90$3.30
meta-llama/Llama-3.3-70B-Instruct-Turbo$0.88$0.88
scb10x/scb10x-llama3-1-typhoon2-70b-instruct$0.88$0.88
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo$0.88$0.88
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF$0.88$0.88
meta-llama/Meta-Llama-3-70B-Instruct-Turbo$0.88$0.88
meta-llama/Llama-3-70b-chat-hf$0.88$0.88
mistralai/Mistral-Small-24B-Instruct-2501$0.80$0.80
Qwen/Qwen2.5-Coder-32B-Instruct$0.80$0.80
google/gemma-2-27b-it$0.80$0.80
arcee-ai/virtuoso-large$0.75$1.20
Qwen/Qwen3-235B-A22B-Thinking-2507$0.65$3.00
deepseek-ai/DeepSeek-V3.1$0.60$1.70
Qwen/Qwen3.5-397B-A17B$0.60$3.60
mistralai/Mixtral-8x7B-Instruct-v0.1$0.60$0.60
togethercomputer/Refuel-Llm-V2$0.60$0.60
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO$0.60$0.60
deepseek-ai/DeepSeek-R1-0528-tput$0.55$2.19
moonshotai/Kimi-K2.5$0.50$2.80
Qwen/Qwen3-Coder-Next-FP8$0.50$1.20
arcee-ai/coder-large$0.50$0.80
zai-org/GLM-4.7$0.45$2.00
Qwen/Qwen2.5-7B-Instruct-Turbo$0.30$0.30
upstage/SOLAR-10.7B-Instruct-v1.0$0.30$0.30
MiniMaxAI/MiniMax-M2.5$0.30$1.20
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8$0.27$0.85
zai-org/GLM-4.5-Air-FP8$0.20$1.10
Qwen/Qwen3-235B-A22B-fp8-tput$0.20$0.60
meta-llama/Llama-3-8b-chat-hf$0.20$0.20
mistralai/Mistral-7B-Instruct-v0.1$0.20$0.20
togethercomputer/Refuel-Llm-V2-Small$0.20$0.20
mistralai/Mistral-7B-Instruct-v0.2$0.20$0.20
mistralai/Mistral-7B-Instruct-v0.3$0.20$0.20
scb10x/scb10x-typhoon-2-1-gemma3-12b$0.20$0.20
meta-llama/Llama-4-Scout-17B-16E-Instruct$0.18$0.59
marin-community/marin-8b-instruct$0.18$0.18
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B$0.18$0.18
arcee_ai/arcee-spotlight$0.18$0.18
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo$0.18$0.18
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo$0.18$0.18
openai/gpt-oss-120b$0.15$0.60
Qwen/Qwen3-Next-80B-A3B-Instruct$0.15$1.50
Qwen/Qwen3-Next-80B-A3B-Thinking$0.15$1.50
meta-llama/Meta-Llama-3-8B-Instruct-Lite$0.10$0.10
meta-llama/Llama-3.2-3B-Instruct-Turbo$0.06$0.06
openai/gpt-oss-20b$0.05$0.20
google/gemma-3n-E4B-it$0.02$0.04
meta-llama/Llama-3.3-70B-Instruct-Turbo-FreeFreeFree
lgai/exaone-3-5-32b-instructFreeFree
deepseek-ai/DeepSeek-R1-Distill-Llama-70B-freeFreeFree
meta-llama/Llama-Vision-FreeFreeFree

Embeddings

Target URL: https://api.together.xyz/v1/embeddings
Content Typeapplication/json
StreamingNo
ModelInput / 1M tokensOutput / 1M tokens
Alibaba-NLP/gte-modernbert-base$0.08Free
intfloat/multilingual-e5-large-instruct$0.02Free
BAAI/bge-large-en-v1.5$0.02Free
togethercomputer/m2-bert-80M-32k-retrieval$0.01Free
BAAI/bge-base-en-v1.5-vllm$0.01Free
BAAI/bge-base-en-v1.5$0.01Free

Next Steps