Llama 3.1 70B Hanami x1
Llama 3.1 70B Hanami x1 (Sao10K, 2025) is an chat model. This is [Sao10K](/sao10k)'s experiment over [Euryale v2.2](/sao10k/l3.1-euryale-70b).
by Sao10K · 16K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use Llama 3.1 70B Hanami x1 in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your Sao10K API key. osFoundry discovers Llama 3.1 70B Hanami x1 automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run Llama 3.1 70B Hanami x1 yourself
Llama 3.1 70B Hanami x1 is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
Llama 3.1 70B Hanami x1 vs similar models
| Model | Org | Params | Context | Input price | Self-host |
|---|
| Llama 3.1 70B Hanami x1 | Sao10K | — | 16K | $ 3.00 /1M | API only |
| Phi 4 | Microsoft | — | 16K | $ 0.065 /1M | API only |
| DeepSeek V3 | DeepSeek | — | 164K | $ 0.320 /1M | API only |
| Command R7B (12-2024) | Cohere | — | 128K | $ 0.037 /1M | API only |
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Llama 3.1 70B Hanami x1
How much does Llama 3.1 70B Hanami x1 cost?
Llama 3.1 70B Hanami x1 is metered at $ 3.00 /1M for input, and $ 3.00 /1M for output. Bring your own Sao10K API key — osFoundry passes through provider pricing without markup.
Can I use Llama 3.1 70B Hanami x1 commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Llama 3.1 70B Hanami x1?
Llama 3.1 70B Hanami x1 supports a 16K token context window.
Can I run Llama 3.1 70B Hanami x1 locally?
No — Llama 3.1 70B Hanami x1 is hosted only and accessed via the Sao10K API. An open-weights equivalent is available to self-host — see the cross-link above.
What is Llama 3.1 70B Hanami x1 best at?
Llama 3.1 70B Hanami x1 is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use Llama 3.1 70B Hanami x1 in osFoundry?
Paste your Sao10K API key in the key dialog (or deploy the open weights for self-hostable models), assign Llama 3.1 70B Hanami x1 to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by Sao10K on January 8, 2025. Source: https://openrouter.ai/sao10k/l3.1-70b-hanami-x1