Llama 3.2 11B Vision Instruct
Llama 3.2 11B Vision Instruct is a image-generation model from Meta, released September 25, 2024. Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
by Meta · 131K token context window
Best for
- image generation from text
- creative design and ideation
Ways to use Llama 3.2 11B Vision Instruct in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your Meta API key. osFoundry discovers Llama 3.2 11B Vision Instruct automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run Llama 3.2 11B Vision Instruct yourself
Llama 3.2 11B Vision Instruct is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
Llama 3.2 11B Vision Instruct vs similar models
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Llama 3.2 11B Vision Instruct
How much does Llama 3.2 11B Vision Instruct cost?
Llama 3.2 11B Vision Instruct is metered at $ 0.245 /1M for input, and $ 0.245 /1M for output. Bring your own Meta API key — osFoundry passes through provider pricing without markup.
Can I use Llama 3.2 11B Vision Instruct commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Llama 3.2 11B Vision Instruct?
Llama 3.2 11B Vision Instruct supports a 131K token context window.
Can I run Llama 3.2 11B Vision Instruct locally?
No — Llama 3.2 11B Vision Instruct is hosted only and accessed via the Meta API. An open-weights equivalent is available to self-host — see the cross-link above.
What is Llama 3.2 11B Vision Instruct best at?
Llama 3.2 11B Vision Instruct is well-suited to image generation from text, creative design and ideation.
How do I use Llama 3.2 11B Vision Instruct in osFoundry?
Paste your Meta API key in the key dialog (or deploy the open weights for self-hostable models), assign Llama 3.2 11B Vision Instruct to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by Meta on September 25, 2024. Source: https://openrouter.ai/meta-llama/llama-3.2-11b-vision-instruct