> ## Documentation Index
> Fetch the complete documentation index at: https://lava.so/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Groq

> Ultra-fast LLM inference using custom LPU hardware.

Groq offers 11 models through Lava's AI Gateway, supporting Chat Completions, Transcription. Authentication uses `Authorization: Bearer`. See the [Groq API docs](https://console.groq.com/docs/api-reference) for provider-specific parameters.

<Info>Supports both **managed** (Lava's API keys) and **unmanaged** (bring your own credentials) mode.</Info>

## Quick Start

```typescript theme={null}
const response = await fetch('https://api.lava.so/v1/forward?u=https%3A%2F%2Fapi.groq.com%2Fopenai%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'openai/gpt-oss-safeguard-20b',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});
```

## Chat Completions

**Target URL:** `https://api.groq.com/openai/v1/chat/completions`

|                  |                                          |
| ---------------- | ---------------------------------------- |
| **Content Type** | `application/json`                       |
| **Streaming**    | Yes (set `stream: true` in request body) |

| Model                                     | Input / 1M tokens | Output / 1M tokens |
| ----------------------------------------- | ----------------- | ------------------ |
| moonshotai/kimi-k2-instruct-0905          | \$1.00            | \$3.00             |
| llama-3.3-70b-versatile                   | \$0.59            | \$0.79             |
| qwen/qwen3-32b                            | \$0.29            | \$0.59             |
| openai/gpt-oss-120b                       | \$0.15            | \$0.60             |
| meta-llama/llama-4-scout-17b-16e-instruct | \$0.11            | \$0.34             |
| openai/gpt-oss-safeguard-20b              | \$0.075           | \$0.30             |
| openai/gpt-oss-20b                        | \$0.075           | \$0.30             |
| llama-3.1-8b-instant                      | \$0.05            | \$0.08             |

## Transcription

**Target URL:** `https://api.groq.com/openai/v1/audio/transcriptions`

|                  |                       |
| ---------------- | --------------------- |
| **Content Type** | `multipart/form-data` |
| **Streaming**    | No                    |

| Model                  | Input / minute | Output / minute |
| ---------------------- | -------------- | --------------- |
| whisper-large-v3       | \$0.00185      | Free            |
| whisper-large-v3-turbo | \$0.001        | Free            |
| whisper-large-v3-en    | \$0.0003       | Free            |

## Next Steps

<CardGroup cols={2}>
  <Card title="All Providers" icon="grid" href="/gateway/supported-providers">
    Browse all supported AI providers
  </Card>

  <Card title="Forward Proxy" icon="route" href="/gateway/forward-proxy">
    Learn how to construct proxy URLs and authenticate requests
  </Card>
</CardGroup>
