One API for every LLM
Western models and Chinese models, unified
NexusAI is a single OpenAI-compatible gateway to GPT, Claude, Gemini and Grok — plus the leading Chinese models DeepSeek, Qwen, GLM, Kimi and Doubao. Smart routing, transparent per-token pricing, pay as you go.
Why NexusAI
One endpoint, every model. Smart routing, automatic failover, transparent per-token pricing — Western and Chinese models, unified.
Secure & Transparent
Ensures that all workflows are secure with encryption and gives you full transparency into the tasks your AI performs.
Smart Model Routing
Automates repetitive tasks seamlessly across multiple apps, saving you valuable time and effort every single day.
















Cross-Provider Failover
Keeps your tools and apps perfectly synced, ensuring consistency and reliability across your entire workflow.
































































Multi-AI Integration
Seamlessly integrates with over 100 popular tools and apps, significantly enhancing your workflow and productivity effortlessly every day.
Gathering info from calendar, docs, and last weeks notes...
Extracted key insights: 3 wins, 2 blockers, 4 metrics.
Drafting email (intro, highlights, next steps)...
Refining tone for concise + friendly style.
Gathering info from calendar, docs, and last weeks notes...
Extracted key insights: 3 wins, 2 blockers, 4 metrics.
Drafting email (intro, highlights, next steps)...
Refining tone for concise + friendly style.
Real-Time Usage & Alerts
Get instant updates on the status of your workflows, tasks, and deadlines.
Every model, one endpoint
Switch between GPT, Claude, Gemini, Grok and the leading Chinese models — DeepSeek, Qwen, GLM, Kimi, Doubao — by changing a single string. Same OpenAI-compatible API.
GPT-5.1
OpenAI's flagship multimodal model. Best-in-class general reasoning, coding, and tool use.
Learn moreClaude Opus 4.5
Anthropic’s top model. Long context, strong agentic coding, careful instruction following.
Learn moreGemini 3 Pro
Google’s flagship. Native multimodal, 1M-token context, fast at scale.
Learn moreOpenAI o4
Deliberate step-by-step reasoning for math, science, and complex planning.
Learn moreClaude Opus 4.5 (Thinking)
Extended thinking mode for hard, multi-step problems and deep analysis.
Learn moreDeepSeek R1
Open reasoning model rivaling closed frontier models at a fraction of the cost.
Learn moreDeepSeek V3.2
深度求索 · Top-tier coding and reasoning, extremely cost-efficient MoE architecture.
Learn moreQwen3 Max
阿里通义千问 · Strong multilingual and agentic performance, 100+ languages.
Learn moreGLM-4.6
智谱 Zhipu · Balanced reasoning and tool use, excellent Chinese comprehension.
Learn moreKimi K2
月之暗面 Moonshot · Agentic model with very long context and strong coding.
Learn moreDoubao Pro
字节豆包 · Fast, low-latency model optimized for high-volume production.
Learn moreERNIE 4.5
百度文心 · Knowledge-enhanced model with deep Chinese-language grounding.
Learn moreMiniMax M2
MiniMax · Efficient agentic model tuned for coding and tool workflows.
Learn moreQwen3 (Open Weights)
Full open-weights family from 0.6B to 235B, Apache-2.0 licensed.
Learn moreDeepSeek V3 (Open Weights)
Open MoE weights you can self-host or fine-tune freely.
Learn moreCommand every model from one place
Real-time token consumption, spend and latency per model — Western and Chinese — on a single dashboard. One key, total visibility, no surprises.
$866.80
Spend this month
−18% vs last month
4.92M
Requests (30d)
across 6 models
412 ms
Avg. latency
p50, routed
1,043
Failovers handled
0 dropped requests
deepseek-v3.2DeepSeek · 深度求索
gpt-5.1OpenAI
claude-opus-4.5Anthropic
qwen3-maxAlibaba · 通义千问
kimi-k2Moonshot · 月之暗面
doubao-proByteDance · 字节豆包
Set per-model budgets and spend alerts. Throttle, route around, or hard-cap any provider — without touching your code.
One key, every model
Drop-in OpenAI-compatible API. Keep your SDK, change the base URL and key, then call any model — Western or Chinese — by its id.
Authenticate once.
Reach every provider.
No per-vendor accounts, no Chinese phone verification, no separate billing. One NexusAI key authenticates GPT, Claude, Gemini, Grok and DeepSeek, Qwen, GLM, Kimi, Doubao behind a single global endpoint.
Base URL
https://api.nexusai.com/v1Auth header
Authorization: Bearer sk-nexus-•••••••••SDK
openai · @anthropic-ai/sdk · any OpenAI clientgpt-5.1OpenAI · Flagship multimodalclaude-opus-4.5Anthropic · Long-context agenticgemini-3-proGoogle · 1M-token contextgrok-4xAI · Real-time knowledgedeepseek-v3.2DeepSeek · 深度求索 · Cost-efficient MoEqwen3-maxAlibaba · 通义千问 · Multilingual, 100+ langsglm-4.6Zhipu · 智谱 · Strong Chinese reasoningkimi-k2Moonshot · 月之暗面 · Very long contextdoubao-proByteDance · 字节豆包 · Low-latency, high-volumedeepseek-r1DeepSeek · 推理 · Open reasoningcurl https://api.nexusai.com/v1/chat/completions \
-H "Authorization: Bearer $NEXUSAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek-v3.2",
"messages": [
{ "role": "user", "content": "Explain MoE in one sentence." }
]
}'
# Switch provider by changing ONE string — same key, same endpoint:
# "model": "gpt-5.1" -> OpenAI
# "model": "claude-opus-4.5" -> Anthropic
# "model": "kimi-k2" -> Moonshot 月之暗面
# "model": "doubao-pro" -> ByteDance 字节豆包
# "model": "qwen3-max" -> Alibaba 通义千问How it works
From signup to production in three steps — no migration, no lock-in.
Built for every workload
From chatbots to agents to batch pipelines — one gateway across every model.
See how we drives success for every team.
Sales Leaders use ai agent to automate and optimise sales workflows
This helps sales teams reduce tasks like lead qualification and follow-ups. With automated workflows, sales teams can focus on closing deals faster, prioritizing high-value prospects, and increasing conversion rates consistently.
Learn more
Trusted by builders
Teams shipping with NexusAI — from indie devs to AI-native startups.
This is completely transformed how we manage our daily tasks. What used to take hours now happens automatically and we've never been more productive.












pricing
Pay only for the tokens you use. No markup, no minimums, no lock-in.
Pricing that scales with your usage
Find the perfect balance of features and scalability for your workflow,
from first prototype to millions of requests.
- Access to all 30+ models
- OpenAI-compatible API
- Per-token pricing at provider cost + 0%
- 1:1 deposit match up to $100
- Chat playground
- 3 requests/sec rate limit
- Community support
- Everything in Pay As You Go
- Smart routing + automatic failover
- Volume discounts on tokens
- 60 requests/sec rate limit
- Usage analytics & spend alerts
- Prompt caching enabled
- Priority email support
- Everything in Pro
- 99.9% uptime SLA
- Dedicated capacity & custom rate limits
- SSO / SAML, audit logs, RBAC
- Self-hosted / private deployment
- China-region data residency option
- Dedicated solutions engineer
FAQ
Everything about models, routing, pricing and data privacy.
Need Help? We've Got Answers
Explore Our Most Commonly Asked Questions and Find the Information You Need.
Product & Features
Plans & Usage
Ship with any model in minutes
One OpenAI-compatible key for GPT, Claude, Gemini, DeepSeek, Qwen and more. Start free with a 1:1 deposit match, or talk to our team about Enterprise.
Frontier and open models — Western and Chinese — behind one endpoint.
Tokens routed every week with automatic failover and zero lock-in.
Gateway uptime, with sub-second routing overhead across all providers.



