Trinity Large Thinking (free)
Arcee AI's Trinity Large Thinking (free) is a chat model. Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...
by Arcee AI · 262K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use Trinity Large Thinking (free) in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your Arcee AI API key. osFoundry discovers Trinity Large Thinking (free) automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run Trinity Large Thinking (free) yourself
Trinity Large Thinking (free) is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
Trinity Large Thinking (free) vs similar models
| Model | Org | Params | Context | Input price | Self-host |
|---|
| Trinity Large Thinking (free) | Arcee AI | — | 262K | Free | API only |
| GLM 5.1 | Z.ai | — | 203K | $ 1.05 /1M | API only |
| Pareto Code Router | openrouter | — | 2000K | $ -1000000.000 /1M | API only |
| KAT-Coder-Pro V2 | Kwaipilot | — | 256K | $ 0.300 /1M | API only |
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Trinity Large Thinking (free)
How much does Trinity Large Thinking (free) cost?
Trinity Large Thinking (free) is metered at Free for input, and Free for output. Bring your own Arcee AI API key — osFoundry passes through provider pricing without markup.
Can I use Trinity Large Thinking (free) commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Trinity Large Thinking (free)?
Trinity Large Thinking (free) supports a 262K token context window.
Can I run Trinity Large Thinking (free) locally?
No — Trinity Large Thinking (free) is hosted only and accessed via the Arcee AI API. An open-weights equivalent is available to self-host — see the cross-link above.
What is Trinity Large Thinking (free) best at?
Trinity Large Thinking (free) is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use Trinity Large Thinking (free) in osFoundry?
Paste your Arcee AI API key in the key dialog (or deploy the open weights for self-hostable models), assign Trinity Large Thinking (free) to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by Arcee AI on April 1, 2026. Source: https://openrouter.ai/arcee-ai/trinity-large-thinking:free