Novita offers 37 models through Lava’s AI Gateway, supporting Chat Completions, Embeddings. Authentication uses Authorization: Bearer. See the Novita API docs for provider-specific parameters.
Supports both managed (Lava’s API keys) and unmanaged (bring your own credentials) mode.
Quick Start
const response = await fetch ( 'https://api.lava.so/v1/forward?u=https%3A%2F%2Fapi.novita.ai%2Fv3%2Fopenai%2Fchat%2Fcompletions' , {
method: 'POST' ,
headers: {
'Content-Type' : 'application/json' ,
Authorization: `Bearer ${ forwardToken } ` ,
},
body: JSON . stringify ({
model: 'qwen/qwen3-4b-fp8' ,
messages: [{ role: "user" , content: "Hello!" }],
}),
});
Chat Completions
Target URL: https://api.novita.ai/v3/openai/chat/completions
Content Type application/jsonStreaming Yes (set stream: true in request body)
Model Input / 1M tokens Output / 1M tokens deepseek/deepseek-r1 $4.00 $4.00 sao10k/l3-70b-euryale-v2.1 $1.48 $1.48 sao10k/l31-70b-euryale-v2.2 $1.48 $1.48 deepseek/deepseek_v3 $0.89 $0.89 qwen/qwen2.5-vl-72b-instruct $0.80 $0.80 deepseek/deepseek-r1-distill-llama-70b $0.80 $0.80 deepseek/deepseek-prover-v2-671b $0.70 $2.50 deepseek/deepseek-r1-0528 $0.70 $2.50 deepseek/deepseek-r1-turbo $0.70 $2.50 microsoft/wizardlm-2-8x22b $0.62 $0.62 meta-llama/llama-3-70b-instruct $0.51 $0.74 deepseek/deepseek-v3-turbo $0.40 $1.30 qwen/qwen-2.5-72b-instruct $0.38 $0.40 deepseek/deepseek-v3-0324 $0.33 $1.30 deepseek/deepseek-r1-distill-qwen-32b $0.30 $0.30 thudm/glm-4-32b-0414 $0.24 $0.24 qwen/qwen3-235b-a22b-fp8 $0.20 $0.80 meta-llama/llama-4-maverick-17b-128e-instruct-fp8 $0.17 $0.85 deepseek/deepseek-r1-distill-qwen-14b $0.15 $0.15 nousresearch/hermes-2-pro-llama-3-8b $0.14 $0.14 meta-llama/llama-3.3-70b-instruct $0.13 $0.39 google/gemma-3-27b-it $0.119 $0.20 qwen/qwen3-30b-a3b-fp8 $0.10 $0.45 qwen/qwen3-32b-fp8 $0.10 $0.45 meta-llama/llama-4-scout-17b-16e-instruct $0.10 $0.50 gryphe/mythomax-l2-13b $0.09 $0.09 qwen/qwen2.5-7b-instruct $0.07 $0.07 Sao10K/L3-8B-Stheno-v3.2 $0.05 $0.05 sao10k/l3-8b-lunaris $0.05 $0.05 mistralai/mistral-nemo $0.04 $0.17 meta-llama/llama-3-8b-instruct $0.04 $0.04 qwen/qwen3-8b-fp8 $0.035 $0.138 qwen/qwen3-4b-fp8 $0.03 $0.03 meta-llama/llama-3.2-3b-instruct $0.03 $0.05 meta-llama/llama-3.2-1b-instruct $0.02 $0.05 meta-llama/llama-3.1-8b-instruct $0.02 $0.05
Embeddings
Target URL: https://api.novita.ai/v3/openai/embeddings
Content Type application/jsonStreaming No
Model Input Output baai/bge-m3 Free Free
Next Steps
All Providers Browse all supported AI providers
Forward Proxy Learn how to construct proxy URLs and authenticate requests