Skip to main content
Chutes offers 18 models through Lava’s AI Gateway, supporting Chat Completions. Authentication uses Authorization: Bearer. See the Chutes API docs for provider-specific parameters.
Supports both managed (Lava’s API keys) and unmanaged (bring your own credentials) mode.

Quick Start

const response = await fetch('https://api.lava.so/v1/forward?u=https%3A%2F%2Fllm.chutes.ai%2Fv1%2Fchat%2Fcompletions', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    Authorization: `Bearer ${forwardToken}`,
  },
  body: JSON.stringify({
    model: 'deepseek-ai/DeepSeek-V3-0324',
    messages: [{ role: "user", content: "Hello!" }],
  }),
});

Chat Completions

Target URL: https://llm.chutes.ai/v1/chat/completions
Content Typeapplication/json
StreamingYes (set stream: true in request body)
ModelInput / 1M tokensOutput / 1M tokens
tngtech/DeepSeek-TNG-R1T2-Chimera$0.3024$0.3024
deepseek-ai/DeepSeek-V3-0324$0.27216$0.27216
deepseek-ai/DeepSeek-R1-0528$0.27216$0.27216
deepseek-ai/DeepSeek-V3$0.27216$0.27216
unsloth/gemma-3-27b-it$0.1568$0.1568
Qwen/Qwen2.5-72B-Instruct$0.1008$0.1008
NousResearch/DeepHermes-3-Mistral-24B-Preview$0.06552$0.06552
deepseek-ai/DeepSeek-R1-Distill-Llama-70B$0.0504$0.0504
Qwen/Qwen3-30B-A3B$0.03024$0.03024
chutesai/Mistral-Small-3.2-24B-Instruct-2506$0.03024$0.03024
Qwen/Qwen3-14B$0.03024$0.03024
Qwen/Qwen2.5-VL-32B-Instruct$0.03024$0.03024
Qwen/Qwen2.5-Coder-32B-Instruct$0.03024$0.03024
unsloth/gemma-3-12b-it$0.03024$0.03024
unsloth/Mistral-Small-24B-Instruct-2501$0.03024$0.03024
Qwen/Qwen3-32B$0.027216$0.027216
chutesai/Mistral-Small-3.1-24B-Instruct-2503$0.027216$0.027216
unsloth/Mistral-Nemo-Instruct-2407$0.01512$0.01512

Next Steps

All Providers

Browse all supported AI providers

Forward Proxy

Learn how to construct proxy URLs and authenticate requests