Skip to main content
Novita AI offers 38 models through Lava’s AI Gateway, supporting Chat Completions, Embeddings. Authentication uses Authorization: Bearer. See the Novita AI API docs for provider-specific parameters.
Supports both managed API keys (from Lava) and BYOK mode.

Quick Start

const response = await fetch('https://api.lavapayments.com/v1/forward?u=https%3A%2F%2Fapi.novita.ai%2Fv3%2Fopenai%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'qwen/qwen3-4b-fp8',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});

Chat Completions

Target URL: https://api.novita.ai/v3/openai/chat/completions
Content Typeapplication/json
StreamingYes (set stream: true in request body)
ModelInput / 1M tokensOutput / 1M tokens
deepseek/deepseek-r1$4.00$4.00
sao10k/l3-70b-euryale-v2.1$1.48$1.48
sao10k/l31-70b-euryale-v2.2$1.48$1.48
deepseek/deepseek_v3$0.89$0.89
qwen/qwen2.5-vl-72b-instruct$0.80$0.80
deepseek/deepseek-r1-distill-llama-70b$0.80$0.80
deepseek/deepseek-prover-v2-671b$0.70$2.50
deepseek/deepseek-r1-0528$0.70$2.50
deepseek/deepseek-r1-turbo$0.70$2.50
microsoft/wizardlm-2-8x22b$0.62$0.62
meta-llama/llama-3-70b-instruct$0.51$0.74
deepseek/deepseek-v3-turbo$0.40$1.30
qwen/qwen-2.5-72b-instruct$0.38$0.40
deepseek/deepseek-v3-0324$0.33$1.30
deepseek/deepseek-r1-distill-qwen-32b$0.30$0.30
thudm/glm-4-32b-0414$0.24$0.24
qwen/qwen3-235b-a22b-fp8$0.20$0.80
meta-llama/llama-4-maverick-17b-128e-instruct-fp8$0.17$0.85
deepseek/deepseek-r1-distill-qwen-14b$0.15$0.15
nousresearch/hermes-2-pro-llama-3-8b$0.14$0.14
meta-llama/llama-3.3-70b-instruct$0.13$0.39
google/gemma-3-27b-it$0.12$0.20
qwen/qwen3-30b-a3b-fp8$0.10$0.45
qwen/qwen3-32b-fp8$0.10$0.45
meta-llama/llama-4-scout-17b-16e-instruct$0.10$0.50
gryphe/mythomax-l2-13b$0.09$0.09
meta-llama/llama-3.1-8b-instruct-bf16$0.06$0.06
Sao10K/L3-8B-Stheno-v3.2$0.05$0.05
sao10k/l3-8b-lunaris$0.05$0.05
mistralai/mistral-nemo$0.04$0.17
meta-llama/llama-3-8b-instruct$0.04$0.04
qwen/qwen3-8b-fp8$0.04$0.14
meta-llama/llama-3.2-3b-instruct$0.03$0.05
meta-llama/llama-3.1-8b-instruct$0.02$0.05
qwen/qwen3-4b-fp8FreeFree
qwen/qwen2.5-7b-instructFreeFree
meta-llama/llama-3.2-1b-instructFreeFree

Embeddings

Target URL: https://api.novita.ai/v3/openai/embeddings
Content Typeapplication/json
StreamingNo
ModelInputOutput
baai/bge-m3FreeFree

Next Steps