Integrate every model in minutes
Keep the OpenAI SDK you already use. Point it at NexusAI, swap the key, and call GPT, Claude, Gemini, DeepSeek, Qwen, GLM, Kimi or Doubao by changing one string.
One key, every model
Drop-in OpenAI-compatible API. Keep your SDK, change the base URL and key, then call any model — Western or Chinese — by its id.
Authenticate once.
Reach every provider.
No per-vendor accounts, no Chinese phone verification, no separate billing. One NexusAI key authenticates GPT, Claude, Gemini, Grok and DeepSeek, Qwen, GLM, Kimi, Doubao behind a single global endpoint.
Base URL
https://api.nexusai.com/v1Auth header
Authorization: Bearer sk-nexus-•••••••••SDK
openai · @anthropic-ai/sdk · any OpenAI clientgpt-5.1OpenAI · Flagship multimodalclaude-opus-4.5Anthropic · Long-context agenticgemini-3-proGoogle · 1M-token contextgrok-4xAI · Real-time knowledgedeepseek-v3.2DeepSeek · 深度求索 · Cost-efficient MoEqwen3-maxAlibaba · 通义千问 · Multilingual, 100+ langsglm-4.6Zhipu · 智谱 · Strong Chinese reasoningkimi-k2Moonshot · 月之暗面 · Very long contextdoubao-proByteDance · 字节豆包 · Low-latency, high-volumedeepseek-r1DeepSeek · 推理 · Open reasoningcurl https://api.nexusai.com/v1/chat/completions \
-H "Authorization: Bearer $NEXUSAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek-v3.2",
"messages": [
{ "role": "user", "content": "Explain MoE in one sentence." }
]
}'
# Switch provider by changing ONE string — same key, same endpoint:
# "model": "gpt-5.1" -> OpenAI
# "model": "claude-opus-4.5" -> Anthropic
# "model": "kimi-k2" -> Moonshot 月之暗面
# "model": "doubao-pro" -> ByteDance 字节豆包
# "model": "qwen3-max" -> Alibaba 通义千问Command every model from one place
Real-time token consumption, spend and latency per model — Western and Chinese — on a single dashboard. One key, total visibility, no surprises.
$866.80
Spend this month
−18% vs last month
4.92M
Requests (30d)
across 6 models
412 ms
Avg. latency
p50, routed
1,043
Failovers handled
0 dropped requests
deepseek-v3.2DeepSeek · 深度求索
gpt-5.1OpenAI
claude-opus-4.5Anthropic
qwen3-maxAlibaba · 通义千问
kimi-k2Moonshot · 月之暗面
doubao-proByteDance · 字节豆包
Set per-model budgets and spend alerts. Throttle, route around, or hard-cap any provider — without touching your code.
Every model, one endpoint
Switch between GPT, Claude, Gemini, Grok and the leading Chinese models — DeepSeek, Qwen, GLM, Kimi, Doubao — by changing a single string. Same OpenAI-compatible API.
GPT-5.1
OpenAI's flagship multimodal model. Best-in-class general reasoning, coding, and tool use.
Learn moreClaude Opus 4.5
Anthropic’s top model. Long context, strong agentic coding, careful instruction following.
Learn moreGemini 3 Pro
Google’s flagship. Native multimodal, 1M-token context, fast at scale.
Learn moreOpenAI o4
Deliberate step-by-step reasoning for math, science, and complex planning.
Learn moreClaude Opus 4.5 (Thinking)
Extended thinking mode for hard, multi-step problems and deep analysis.
Learn moreDeepSeek R1
Open reasoning model rivaling closed frontier models at a fraction of the cost.
Learn moreDeepSeek V3.2
深度求索 · Top-tier coding and reasoning, extremely cost-efficient MoE architecture.
Learn moreQwen3 Max
阿里通义千问 · Strong multilingual and agentic performance, 100+ languages.
Learn moreGLM-4.6
智谱 Zhipu · Balanced reasoning and tool use, excellent Chinese comprehension.
Learn moreKimi K2
月之暗面 Moonshot · Agentic model with very long context and strong coding.
Learn moreDoubao Pro
字节豆包 · Fast, low-latency model optimized for high-volume production.
Learn moreERNIE 4.5
百度文心 · Knowledge-enhanced model with deep Chinese-language grounding.
Learn moreMiniMax M2
MiniMax · Efficient agentic model tuned for coding and tool workflows.
Learn moreQwen3 (Open Weights)
Full open-weights family from 0.6B to 235B, Apache-2.0 licensed.
Learn moreDeepSeek V3 (Open Weights)
Open MoE weights you can self-host or fine-tune freely.
Learn moreShip with any model in minutes
One OpenAI-compatible key for GPT, Claude, Gemini, DeepSeek, Qwen and more. Start free with a 1:1 deposit match, or talk to our team about Enterprise.