GLM 5 Turbo
GLM 5 Turbo (Z.ai, 2026) is an chat model. GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...
by Z.ai · 203K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use GLM 5 Turbo in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your Z.ai API key. osFoundry discovers GLM 5 Turbo automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
GLM 5 Turbo vs similar models
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about GLM 5 Turbo
How much does GLM 5 Turbo cost?
GLM 5 Turbo is metered at $ 1.20 /1M for input, and $ 4.00 /1M for output. Bring your own Z.ai API key — osFoundry passes through provider pricing without markup.
Can I use GLM 5 Turbo commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of GLM 5 Turbo?
GLM 5 Turbo supports a 203K token context window.
Can I run GLM 5 Turbo locally?
No — GLM 5 Turbo is hosted only and accessed via the Z.ai API.
What is GLM 5 Turbo best at?
GLM 5 Turbo is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use GLM 5 Turbo in osFoundry?
Paste your Z.ai API key in the key dialog (or deploy the open weights for self-hostable models), assign GLM 5 Turbo to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by Z.ai on March 15, 2026. Source: https://openrouter.ai/z-ai/glm-5-turbo