> ## Documentation Index
> Fetch the complete documentation index at: https://lava.so/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Together

> Together AI offers the widest open-source model selection in the market, providing access to Llama 4, DeepSeek R1, Mixtral, and 100+ other models through a single OpenAI-compatible API.

Together offers 81 models through Lava's AI Gateway, supporting Chat Completions, Embeddings. Authentication uses `Authorization: Bearer`. See the [Together API docs](https://docs.together.ai/reference) for provider-specific parameters.

<Info>Supports both **managed** (Lava's API keys) and **unmanaged** (bring your own credentials) mode.</Info>

## Quick Start

```typescript theme={null}
const response = await fetch('https://api.lava.so/v1/forward?u=https%3A%2F%2Fapi.together.xyz%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'moonshotai/Kimi-K2.5',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});
```

## Chat Completions

**Target URL:** `https://api.together.xyz/v1/chat/completions`

|                  |                                          |
| ---------------- | ---------------------------------------- |
| **Content Type** | `application/json`                       |
| **Streaming**    | Yes (set `stream: true` in request body) |

| Model                                             | Input / 1M tokens | Output / 1M tokens |
| ------------------------------------------------- | ----------------- | ------------------ |
| meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo     | \$3.50            | \$3.50             |
| deepseek-ai/DeepSeek-R1                           | \$3.00            | \$7.00             |
| perplexity-ai/r1-1776                             | \$3.00            | \$7.00             |
| Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8           | \$2.00            | \$2.00             |
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B         | \$2.00            | \$2.00             |
| Qwen/Qwen2.5-VL-72B-Instruct                      | \$1.95            | \$8.00             |
| deepseek-ai/DeepSeek-V4-Pro                       | \$1.74            | \$3.48             |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-14B          | \$1.60            | \$1.60             |
| zai-org/GLM-5.1                                   | \$1.40            | \$4.40             |
| Qwen/Qwen3.7-Max                                  | \$1.25            | \$3.75             |
| deepseek-ai/DeepSeek-V3                           | \$1.25            | \$1.25             |
| moonshotai/Kimi-K2-Thinking                       | \$1.20            | \$4.00             |
| moonshotai/Kimi-K2.6                              | \$1.20            | \$4.50             |
| Qwen/QwQ-32B                                      | \$1.20            | \$1.20             |
| Qwen/Qwen2-VL-72B-Instruct                        | \$1.20            | \$1.20             |
| meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo    | \$1.20            | \$1.20             |
| Qwen/Qwen2.5-72B-Instruct-Turbo                   | \$1.20            | \$1.20             |
| meta-llama/Llama-3.3-70B-Instruct-Turbo           | \$1.04            | \$1.04             |
| moonshotai/Kimi-K2-Instruct-0905                  | \$1.00            | \$3.00             |
| zai-org/GLM-5                                     | \$1.00            | \$3.20             |
| Qwen/Qwen2-72B-Instruct                           | \$0.90            | \$0.90             |
| arcee-ai/maestro-reasoning                        | \$0.90            | \$3.30             |
| scb10x/scb10x-llama3-1-typhoon2-70b-instruct      | \$0.88            | \$0.88             |
| meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo      | \$0.88            | \$0.88             |
| nvidia/Llama-3.1-Nemotron-70B-Instruct-HF         | \$0.88            | \$0.88             |
| meta-llama/Meta-Llama-3-70B-Instruct-Turbo        | \$0.88            | \$0.88             |
| meta-llama/Llama-3-70b-chat-hf                    | \$0.88            | \$0.88             |
| mistralai/Mistral-Small-24B-Instruct-2501         | \$0.80            | \$0.80             |
| Qwen/Qwen2.5-Coder-32B-Instruct                   | \$0.80            | \$0.80             |
| google/gemma-2-27b-it                             | \$0.80            | \$0.80             |
| arcee-ai/virtuoso-large                           | \$0.75            | \$1.20             |
| Qwen/Qwen3-235B-A22B-Thinking-2507                | \$0.65            | \$3.00             |
| zai-org/GLM-4.6                                   | \$0.60            | \$2.20             |
| deepseek-ai/DeepSeek-V3.1                         | \$0.60            | \$1.70             |
| Qwen/Qwen3.5-397B-A17B                            | \$0.60            | \$3.60             |
| mistralai/Mixtral-8x7B-Instruct-v0.1              | \$0.60            | \$0.60             |
| togethercomputer/Refuel-Llm-V2                    | \$0.60            | \$0.60             |
| NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO       | \$0.60            | \$0.60             |
| deepseek-ai/DeepSeek-R1-0528-tput                 | \$0.55            | \$2.19             |
| moonshotai/Kimi-K2.5                              | \$0.50            | \$2.80             |
| Qwen/Qwen3.6-Plus                                 | \$0.50            | \$3.00             |
| Qwen/Qwen3-Coder-Next-FP8                         | \$0.50            | \$1.20             |
| arcee-ai/coder-large                              | \$0.50            | \$0.80             |
| zai-org/GLM-4.7                                   | \$0.45            | \$2.00             |
| MiniMaxAI/MiniMax-M2.7                            | \$0.30            | \$1.20             |
| Qwen/Qwen2.5-7B-Instruct-Turbo                    | \$0.30            | \$0.30             |
| upstage/SOLAR-10.7B-Instruct-v1.0                 | \$0.30            | \$0.30             |
| MiniMaxAI/MiniMax-M2.5                            | \$0.30            | \$1.20             |
| meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | \$0.27            | \$0.85             |
| zai-org/GLM-4.5-Air-FP8                           | \$0.20            | \$1.10             |
| Qwen/Qwen3-235B-A22B-fp8-tput                     | \$0.20            | \$0.60             |
| meta-llama/Llama-3-8b-chat-hf                     | \$0.20            | \$0.20             |
| mistralai/Mistral-7B-Instruct-v0.1                | \$0.20            | \$0.20             |
| togethercomputer/Refuel-Llm-V2-Small              | \$0.20            | \$0.20             |
| mistralai/Mistral-7B-Instruct-v0.2                | \$0.20            | \$0.20             |
| mistralai/Mistral-7B-Instruct-v0.3                | \$0.20            | \$0.20             |
| scb10x/scb10x-typhoon-2-1-gemma3-12b              | \$0.20            | \$0.20             |
| meta-llama/Llama-4-Scout-17B-16E-Instruct         | \$0.18            | \$0.59             |
| marin-community/marin-8b-instruct                 | \$0.18            | \$0.18             |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B         | \$0.18            | \$0.18             |
| arcee\_ai/arcee-spotlight                         | \$0.18            | \$0.18             |
| meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo       | \$0.18            | \$0.18             |
| meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo    | \$0.18            | \$0.18             |
| Qwen/Qwen3.5-9B                                   | \$0.17            | \$0.25             |
| openai/gpt-oss-120b                               | \$0.15            | \$0.60             |
| Qwen/Qwen3-Next-80B-A3B-Instruct                  | \$0.15            | \$1.50             |
| Qwen/Qwen3-Next-80B-A3B-Thinking                  | \$0.15            | \$1.50             |
| meta-llama/Meta-Llama-3-8B-Instruct-Lite          | \$0.14            | \$0.14             |
| google/gemma-3n-E4B-it                            | \$0.06            | \$0.12             |
| meta-llama/Llama-3.2-3B-Instruct-Turbo            | \$0.06            | \$0.06             |
| openai/gpt-oss-20b                                | \$0.05            | \$0.20             |
| meta-llama/Llama-3.3-70B-Instruct-Turbo-Free      | Free              | Free               |
| lgai/exaone-3-5-32b-instruct                      | Free              | Free               |
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B-free    | Free              | Free               |
| meta-llama/Llama-Vision-Free                      | Free              | Free               |

## Embeddings

**Target URL:** `https://api.together.xyz/v1/embeddings`

|                  |                    |
| ---------------- | ------------------ |
| **Content Type** | `application/json` |
| **Streaming**    | No                 |

| Model                                      | Input / 1M tokens | Output / 1M tokens |
| ------------------------------------------ | ----------------- | ------------------ |
| Alibaba-NLP/gte-modernbert-base            | \$0.08            | Free               |
| intfloat/multilingual-e5-large-instruct    | \$0.02            | Free               |
| BAAI/bge-large-en-v1.5                     | \$0.02            | Free               |
| togethercomputer/m2-bert-80M-32k-retrieval | \$0.01            | Free               |
| BAAI/bge-base-en-v1.5-vllm                 | \$0.01            | Free               |
| BAAI/bge-base-en-v1.5                      | \$0.01            | Free               |

## Next Steps

<CardGroup cols={2}>
  <Card title="All Providers" icon="grid" href="/gateway/supported-providers">
    Browse all supported AI providers
  </Card>

  <Card title="Forward Proxy" icon="route" href="/gateway/forward-proxy">
    Learn how to construct proxy URLs and authenticate requests
  </Card>
</CardGroup>