Ling-2.6-flash
inclusionAI's Ling-2.6-flash is a chat model. Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
by inclusionAI · 262K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use Ling-2.6-flash in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your inclusionAI API key. osFoundry discovers Ling-2.6-flash automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Ling-2.6-flash vs similar models
| Model | Org | Params | Context | Input price | Self-host |
|---|
| Ling-2.6-flash | inclusionAI | — | 262K | $ 0.080 /1M | API only |
| MiMo-V2.5-Pro | Xiaomi | — | 1049K | $ 1.00 /1M | API only |
| Pareto Code Router | openrouter | — | 2000K | $ -1000000.000 /1M | API only |
| Hy3 preview | Tencent | — | 262K | $ 0.066 /1M | API only |
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Ling-2.6-flash
How much does Ling-2.6-flash cost?
Ling-2.6-flash is metered at $ 0.080 /1M for input, and $ 0.240 /1M for output. Bring your own inclusionAI API key — osFoundry passes through provider pricing without markup.
Can I use Ling-2.6-flash commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Ling-2.6-flash?
Ling-2.6-flash supports a 262K token context window.
Can I run Ling-2.6-flash locally?
No — Ling-2.6-flash is hosted only and accessed via the inclusionAI API.
What is Ling-2.6-flash best at?
Ling-2.6-flash is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use Ling-2.6-flash in osFoundry?
Paste your inclusionAI API key in the key dialog (or deploy the open weights for self-hostable models), assign Ling-2.6-flash to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by inclusionAI on April 21, 2026. Source: https://openrouter.ai/inclusionai/ling-2.6-flash