Docsone key · every model · OpenAI-compatible

Integrate every model in minutes

Keep the OpenAI SDK you already use. Point it at NexusAI, swap the key, and call GPT, Claude, Gemini, DeepSeek, Qwen, GLM, Kimi or Doubao by changing one string.

One key, every model

Drop-in OpenAI-compatible API. Keep your SDK, change the base URL and key, then call any model — Western or Chinese — by its id.

Quickstart

Authenticate once.
Reach every provider.

No per-vendor accounts, no Chinese phone verification, no separate billing. One NexusAI key authenticates GPT, Claude, Gemini, Grok and DeepSeek, Qwen, GLM, Kimi, Doubao behind a single global endpoint.

Base URL

https://api.nexusai.com/v1

Auth header

Authorization: Bearer sk-nexus-•••••••••

SDK

openai · @anthropic-ai/sdk · any OpenAI client
Model IDs
gpt-5.1OpenAI · Flagship multimodal
claude-opus-4.5Anthropic · Long-context agentic
gemini-3-proGoogle · 1M-token context
grok-4xAI · Real-time knowledge
deepseek-v3.2DeepSeek · 深度求索 · Cost-efficient MoE
qwen3-maxAlibaba · 通义千问 · Multilingual, 100+ langs
glm-4.6Zhipu · 智谱 · Strong Chinese reasoning
kimi-k2Moonshot · 月之暗面 · Very long context
doubao-proByteDance · 字节豆包 · Low-latency, high-volume
deepseek-r1DeepSeek · 推理 · Open reasoning
curl https://api.nexusai.com/v1/chat/completions \
  -H "Authorization: Bearer $NEXUSAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v3.2",
    "messages": [
      { "role": "user", "content": "Explain MoE in one sentence." }
    ]
  }'

# Switch provider by changing ONE string — same key, same endpoint:
#   "model": "gpt-5.1"          -> OpenAI
#   "model": "claude-opus-4.5"  -> Anthropic
#   "model": "kimi-k2"          -> Moonshot 月之暗面
#   "model": "doubao-pro"       -> ByteDance 字节豆包
#   "model": "qwen3-max"        -> Alibaba 通义千问

Command every model from one place

Real-time token consumption, spend and latency per model — Western and Chinese — on a single dashboard. One key, total visibility, no surprises.

$866.80

Spend this month

−18% vs last month

4.92M

Requests (30d)

across 6 models

412 ms

Avg. latency

p50, routed

1,043

Failovers handled

0 dropped requests

ModelTokensSpendShare of spend
deepseek-v3.2

DeepSeek · 深度求索

184.2M$312.4
34%
gpt-5.1

OpenAI

92.7M$268.9
29%
claude-opus-4.5

Anthropic

61.0M$173.5
19%
qwen3-max

Alibaba · 通义千问

74.8M$58.2
9%
kimi-k2

Moonshot · 月之暗面

40.3M$31.7
5%
doubao-pro

ByteDance · 字节豆包

120.5M$22.1
4%

Set per-model budgets and spend alerts. Throttle, route around, or hard-cap any provider — without touching your code.

Every model, one endpoint

Switch between GPT, Claude, Gemini, Grok and the leading Chinese models — DeepSeek, Qwen, GLM, Kimi, Doubao — by changing a single string. Same OpenAI-compatible API.

G

GPT-5.1

Flagship

OpenAI's flagship multimodal model. Best-in-class general reasoning, coding, and tool use.

Learn more
C

Claude Opus 4.5

Flagship

Anthropic’s top model. Long context, strong agentic coding, careful instruction following.

Learn more
G

Gemini 3 Pro

Flagship

Google’s flagship. Native multimodal, 1M-token context, fast at scale.

Learn more
G

Grok 4

Flagship

xAI’s frontier model with real-time knowledge and strong STEM reasoning.

Learn more
O

OpenAI o4

Reasoning

Deliberate step-by-step reasoning for math, science, and complex planning.

Learn more
C

Claude Opus 4.5 (Thinking)

Reasoning

Extended thinking mode for hard, multi-step problems and deep analysis.

Learn more
D

DeepSeek R1

Reasoning

Open reasoning model rivaling closed frontier models at a fraction of the cost.

Learn more
G

Gemini 3 Deep Think

Reasoning

Google’s parallel-reasoning mode for the hardest benchmarks.

Learn more
D

DeepSeek V3.2

Chinese Models

深度求索 · Top-tier coding and reasoning, extremely cost-efficient MoE architecture.

Learn more
Q

Qwen3 Max

Chinese Models

阿里通义千问 · Strong multilingual and agentic performance, 100+ languages.

Learn more
G

GLM-4.6

Chinese Models

智谱 Zhipu · Balanced reasoning and tool use, excellent Chinese comprehension.

Learn more
K

Kimi K2

Chinese Models

月之暗面 Moonshot · Agentic model with very long context and strong coding.

Learn more
D

Doubao Pro

Chinese Models

字节豆包 · Fast, low-latency model optimized for high-volume production.

Learn more
H

Hunyuan

Chinese Models

腾讯混元 · Strong Chinese generation and multimodal understanding.

Learn more
E

ERNIE 4.5

Chinese Models

百度文心 · Knowledge-enhanced model with deep Chinese-language grounding.

Learn more
M

MiniMax M2

Chinese Models

MiniMax · Efficient agentic model tuned for coding and tool workflows.

Learn more
L

Llama 4

Open Source

Meta’s open-weights flagship. Self-hostable, permissive licensing.

Learn more
M

Mistral Large

Open Source

European open model with strong reasoning and function calling.

Learn more
Q

Qwen3 (Open Weights)

Open Source

Full open-weights family from 0.6B to 235B, Apache-2.0 licensed.

Learn more
D

DeepSeek V3 (Open Weights)

Open Source

Open MoE weights you can self-host or fine-tune freely.

Learn more

Ship with any model in minutes

One OpenAI-compatible key for GPT, Claude, Gemini, DeepSeek, Qwen and more. Start free with a 1:1 deposit match, or talk to our team about Enterprise.