ERNIE 4.5 21B A3B
ERNIE 4.5 21B A3B is a chat model from Baidu, released August 12, 2025. A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an...
by Baidu · 120K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use ERNIE 4.5 21B A3B in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your Baidu API key. osFoundry discovers ERNIE 4.5 21B A3B automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run ERNIE 4.5 21B A3B yourself
ERNIE 4.5 21B A3B is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
ERNIE 4.5 21B A3B vs similar models
| Model | Org | Params | Context | Input price | Self-host |
|---|
| ERNIE 4.5 21B A3B | Baidu | — | 120K | $ 0.070 /1M | API only |
| DeepSeek V3.1 | DeepSeek | — | 164K | $ 0.210 /1M | API only |
| Jamba Large 1.7 | AI21 | — | 256K | $ 2.00 /1M | API only |
| Hermes 4 405B | Nous | — | 131K | $ 1.00 /1M | API only |
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about ERNIE 4.5 21B A3B
How much does ERNIE 4.5 21B A3B cost?
ERNIE 4.5 21B A3B is metered at $ 0.070 /1M for input, and $ 0.280 /1M for output. Bring your own Baidu API key — osFoundry passes through provider pricing without markup.
Can I use ERNIE 4.5 21B A3B commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of ERNIE 4.5 21B A3B?
ERNIE 4.5 21B A3B supports a 120K token context window.
Can I run ERNIE 4.5 21B A3B locally?
No — ERNIE 4.5 21B A3B is hosted only and accessed via the Baidu API. An open-weights equivalent is available to self-host — see the cross-link above.
What is ERNIE 4.5 21B A3B best at?
ERNIE 4.5 21B A3B is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use ERNIE 4.5 21B A3B in osFoundry?
Paste your Baidu API key in the key dialog (or deploy the open weights for self-hostable models), assign ERNIE 4.5 21B A3B to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by Baidu on August 12, 2025. Source: https://openrouter.ai/baidu/ernie-4.5-21b-a3b