Llama 4 Maverick
Llama 4 Maverick is a image-generation model from Meta, released April 5, 2025. Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...
by Meta · 1049K token context window
Best for
- image generation from text
- creative design and ideation
Ways to use Llama 4 Maverick in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your Meta API key. osFoundry discovers Llama 4 Maverick automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run Llama 4 Maverick yourself
Llama 4 Maverick is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
Llama 4 Maverick vs similar models
| Model | Org | Params | Context | Input price | Self-host |
|---|
| Llama 4 Maverick | Meta | — | 1049K | $ 0.150 /1M | API only |
| GPT-4.1 Nano | OpenAI | — | 1048K | $ 0.100 /1M | API only |
| Mistral Small 3.1 24B | Mistral | — | 128K | $ 0.350 /1M | API only |
| Gemma 3 4B | Google | — | 131K | $ 0.040 /1M | API only |
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Llama 4 Maverick
How much does Llama 4 Maverick cost?
Llama 4 Maverick is metered at $ 0.150 /1M for input, and $ 0.600 /1M for output. Bring your own Meta API key — osFoundry passes through provider pricing without markup.
Can I use Llama 4 Maverick commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Llama 4 Maverick?
Llama 4 Maverick supports a 1049K token context window.
Can I run Llama 4 Maverick locally?
No — Llama 4 Maverick is hosted only and accessed via the Meta API. An open-weights equivalent is available to self-host — see the cross-link above.
What is Llama 4 Maverick best at?
Llama 4 Maverick is well-suited to image generation from text, creative design and ideation.
How do I use Llama 4 Maverick in osFoundry?
Paste your Meta API key in the key dialog (or deploy the open weights for self-hostable models), assign Llama 4 Maverick to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by Meta on April 5, 2025. Source: https://openrouter.ai/meta-llama/llama-4-maverick