New1:1 deposit match — up to $100 free credits

One API for every LLM
Western models and Chinese models, unified

NexusAI is a single OpenAI-compatible gateway to GPT, Claude, Gemini and Grok — plus the leading Chinese models DeepSeek, Qwen, GLM, Kimi and Doubao. Smart routing, transparent per-token pricing, pay as you go.

Get API Key Browse Models

30+ Models

99.9% Uptime

input

New Lead Inquiry

Automatically qualify new inbound leads from email or forms.

Fetching detail...

Company details

LinkedIn profiles

0.0 sec

action

Scoring & Categorization

Using AI logic, the lead is scored based on size, industry, and engagement.

1.8 sec

GPT-4-1 Mini

output

Response

The agent have prepared a tailored response.

Answer

Lead status, notes, and follow-up are logged in the CRM, ready for the sales team.

File updated

0.0 sec

Why NexusAI

One endpoint, every model. Smart routing, automatic failover, transparent per-token pricing — Western and Chinese models, unified.

Secure & Transparent

Ensures that all workflows are secure with encryption and gives you full transparency into the tasks your AI performs.

Searching the web...

Smart Model Routing

Automates repetitive tasks seamlessly across multiple apps, saving you valuable time and effort every single day.

Cross-Provider Failover

Keeps your tools and apps perfectly synced, ensuring consistency and reliability across your entire workflow.

Multi-AI Integration

Seamlessly integrates with over 100 popular tools and apps, significantly enhancing your workflow and productivity effortlessly every day.

Gathering info from calendar, docs, and last weeks notes...

Extracted key insights: 3 wins, 2 blockers, 4 metrics.

Drafting email (intro, highlights, next steps)...

Refining tone for concise + friendly style.

Gathering info from calendar, docs, and last weeks notes...

Extracted key insights: 3 wins, 2 blockers, 4 metrics.

Drafting email (intro, highlights, next steps)...

Refining tone for concise + friendly style.

Real-Time Usage & Alerts

Get instant updates on the status of your workflows, tasks, and deadlines.

Every model, one endpoint

Switch between GPT, Claude, Gemini, Grok and the leading Chinese models — DeepSeek, Qwen, GLM, Kimi, Doubao — by changing a single string. Same OpenAI-compatible API.

GPT-5.1

Flagship

OpenAI's flagship multimodal model. Best-in-class general reasoning, coding, and tool use.

Learn more

Claude Opus 4.5

Flagship

Anthropic’s top model. Long context, strong agentic coding, careful instruction following.

Learn more

Gemini 3 Pro

Flagship

Google’s flagship. Native multimodal, 1M-token context, fast at scale.

Learn more

Grok 4

Flagship

xAI’s frontier model with real-time knowledge and strong STEM reasoning.

Learn more

OpenAI o4

Reasoning

Deliberate step-by-step reasoning for math, science, and complex planning.

Learn more

Claude Opus 4.5 (Thinking)

Reasoning

Extended thinking mode for hard, multi-step problems and deep analysis.

Learn more

DeepSeek R1

Reasoning

Open reasoning model rivaling closed frontier models at a fraction of the cost.

Learn more

Gemini 3 Deep Think

Reasoning

Google’s parallel-reasoning mode for the hardest benchmarks.

Learn more

DeepSeek V3.2

Chinese Models

深度求索 · Top-tier coding and reasoning, extremely cost-efficient MoE architecture.

Learn more

Qwen3 Max

Chinese Models

阿里通义千问 · Strong multilingual and agentic performance, 100+ languages.

Learn more

GLM-4.6

Chinese Models

智谱 Zhipu · Balanced reasoning and tool use, excellent Chinese comprehension.

Learn more

Kimi K2

Chinese Models

月之暗面 Moonshot · Agentic model with very long context and strong coding.

Learn more

Doubao Pro

Chinese Models

字节豆包 · Fast, low-latency model optimized for high-volume production.

Learn more

Hunyuan

Chinese Models

腾讯混元 · Strong Chinese generation and multimodal understanding.

Learn more

ERNIE 4.5

Chinese Models

百度文心 · Knowledge-enhanced model with deep Chinese-language grounding.

Learn more

MiniMax M2

Chinese Models

MiniMax · Efficient agentic model tuned for coding and tool workflows.

Learn more

Llama 4

Open Source

Meta’s open-weights flagship. Self-hostable, permissive licensing.

Learn more

Mistral Large

Open Source

European open model with strong reasoning and function calling.

Learn more

Qwen3 (Open Weights)

Open Source

Full open-weights family from 0.6B to 235B, Apache-2.0 licensed.

Learn more

DeepSeek V3 (Open Weights)

Open Source

Open MoE weights you can self-host or fine-tune freely.

Learn more

Command every model from one place

Real-time token consumption, spend and latency per model — Western and Chinese — on a single dashboard. One key, total visibility, no surprises.

$866.80

Spend this month

−18% vs last month

4.92M

Requests (30d)

across 6 models

412 ms

Avg. latency

p50, routed

1,043

Failovers handled

0 dropped requests

ModelTokensSpendShare of spend

deepseek-v3.2

DeepSeek · 深度求索

184.2M$312.4

34%

gpt-5.1

OpenAI

92.7M$268.9

29%

claude-opus-4.5

Anthropic

61.0M$173.5

19%

qwen3-max

Alibaba · 通义千问

74.8M$58.2

kimi-k2

Moonshot · 月之暗面

40.3M$31.7

doubao-pro

ByteDance · 字节豆包

120.5M$22.1

Set per-model budgets and spend alerts. Throttle, route around, or hard-cap any provider — without touching your code.

One key, every model

Drop-in OpenAI-compatible API. Keep your SDK, change the base URL and key, then call any model — Western or Chinese — by its id.

Quickstart

Authenticate once.
Reach every provider.

No per-vendor accounts, no Chinese phone verification, no separate billing. One NexusAI key authenticates GPT, Claude, Gemini, Grok and DeepSeek, Qwen, GLM, Kimi, Doubao behind a single global endpoint.

Base URL

https://api.nexusai.com/v1

Auth header

Authorization: Bearer sk-nexus-•••••••••

SDK

openai · @anthropic-ai/sdk · any OpenAI client

Model IDs

gpt-5.1OpenAI · Flagship multimodal

claude-opus-4.5Anthropic · Long-context agentic

gemini-3-proGoogle · 1M-token context

grok-4xAI · Real-time knowledge

deepseek-v3.2DeepSeek · 深度求索 · Cost-efficient MoE

qwen3-maxAlibaba · 通义千问 · Multilingual, 100+ langs

glm-4.6Zhipu · 智谱 · Strong Chinese reasoning

kimi-k2Moonshot · 月之暗面 · Very long context

doubao-proByteDance · 字节豆包 · Low-latency, high-volume

deepseek-r1DeepSeek · 推理 · Open reasoning

curl https://api.nexusai.com/v1/chat/completions \
  -H "Authorization: Bearer $NEXUSAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v3.2",
    "messages": [
      { "role": "user", "content": "Explain MoE in one sentence." }
    ]
  }'

# Switch provider by changing ONE string — same key, same endpoint:
#   "model": "gpt-5.1"          -> OpenAI
#   "model": "claude-opus-4.5"  -> Anthropic
#   "model": "kimi-k2"          -> Moonshot 月之暗面
#   "model": "doubao-pro"       -> ByteDance 字节豆包
#   "model": "qwen3-max"        -> Alibaba 通义千问

How it works

From signup to production in three steps — no migration, no lock-in.

Built for every workload

From chatbots to agents to batch pipelines — one gateway across every model.

See how we drives success for every team.

Sales Leaders use ai agent to automate and optimise sales workflows

This helps sales teams reduce tasks like lead qualification and follow-ups. With automated workflows, sales teams can focus on closing deals faster, prioritizing high-value prospects, and increasing conversion rates consistently.

Learn more

"By automating lead qualification and follow-up tasks, we've focused on high-impact sales activities."

"AI has helped us streamline our sales processes, making the team more productive."

"Our sales cycle has been cut down by 30%, thanks to the efficiency and precision provided by AI automation."

Trusted by builders

Teams shipping with NexusAI — from indie devs to AI-native startups.

This is completely transformed how we manage our daily tasks. What used to take hours now happens automatically and we've never been more productive.

JLJamie Lee, Operations Manager

pricing

Pay only for the tokens you use. No markup, no minimums, no lock-in.

Pricing that scales with your usage

Find the perfect balance of features and scalability for your workflow,
from first prototype to millions of requests.

Pay As You Go

For developers and prototypes — only pay for tokens you use

$0/month

Get API key

Access to all 30+ models
OpenAI-compatible API
Per-token pricing at provider cost + 0%
1:1 deposit match up to $100
Chat playground
3 requests/sec rate limit
Community support

Pro

For production apps that need scale and reliability

$0/month

Start Pro

Everything in Pay As You Go
Smart routing + automatic failover
Volume discounts on tokens
60 requests/sec rate limit
Usage analytics & spend alerts
Prompt caching enabled
Priority email support

Enterprise

For teams with compliance, SLA and self-host needs

$0/custom

Contact sales

Everything in Pro
99.9% uptime SLA
Dedicated capacity & custom rate limits
SSO / SAML, audit logs, RBAC
Self-hosted / private deployment
China-region data residency option
Dedicated solutions engineer

FAQ

Everything about models, routing, pricing and data privacy.

Need Help? We've Got Answers

Explore Our Most Commonly Asked Questions and Find the Information You Need.

Docs Contact Us

Product & Features

NexusAI is a unified LLM gateway. One OpenAI-compatible API key lets you call GPT, Claude, Gemini and Grok alongside the leading Chinese models — DeepSeek, Qwen, GLM, Kimi, Doubao, Hunyuan and more — without managing separate accounts, SDKs or billing.

Plans & Usage

Pay As You Go includes a baseline rate limit; Pro raises it to 60 req/s with burst headroom; Enterprise offers dedicated capacity and a 99.9% uptime SLA. Automatic failover keeps requests flowing even when an upstream provider degrades.

Ship with any model in minutes

One OpenAI-compatible key for GPT, Claude, Gemini, DeepSeek, Qwen and more. Start free with a 1:1 deposit match, or talk to our team about Enterprise.

Get API key View pricing

Frontier and open models — Western and Chinese — behind one endpoint.

0.0Billion+

Tokens routed every week with automatic failover and zero lock-in.

0.0%

Gateway uptime, with sub-second routing overhead across all providers.

One API for every LLMWestern models and Chinese models, unified

Why NexusAI

Secure & Transparent

Smart Model Routing

Cross-Provider Failover

Multi-AI Integration

Real-Time Usage & Alerts

Every model, one endpoint

GPT-5.1

Claude Opus 4.5

Gemini 3 Pro

Grok 4

OpenAI o4

Claude Opus 4.5 (Thinking)

DeepSeek R1

Gemini 3 Deep Think

DeepSeek V3.2

Qwen3 Max

GLM-4.6

Kimi K2

Doubao Pro

Hunyuan

ERNIE 4.5

MiniMax M2

Llama 4

Mistral Large

Qwen3 (Open Weights)

DeepSeek V3 (Open Weights)

Command every model from one place

One key, every model

Authenticate once.Reach every provider.

How it works

Get one API key

Pick a model or auto-route

Monitor and scale

Built for every workload

See how we drives success for every team.

Sales Leaders use ai agent to automate and optimise sales workflows

Trusted by builders

This is completely transformed how we manage our daily tasks. What used to take hours now happens automatically and we've never been more productive.

pricing

Pricing that scales with your usage

FAQ

Need Help? We've Got Answers

1. What is NexusAI?

2. Do I have to rewrite my code?

3. How does smart routing work?

4. How is pricing calculated?

1. Can I access Chinese models from outside China?

2. Is my data used for training?

3. What about rate limits and uptime?

4. Can I self-host or run privately?

Ship with any model in minutes

One API for every LLM
Western models and Chinese models, unified

Authenticate once.
Reach every provider.