Hermes 2 Pro - Llama-3 8B
Built by NousResearch, Hermes 2 Pro - Llama-3 8B is an chat model with a 8K token context window. Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced...
by NousResearch · 8K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use Hermes 2 Pro - Llama-3 8B in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your NousResearch API key. osFoundry discovers Hermes 2 Pro - Llama-3 8B automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run Hermes 2 Pro - Llama-3 8B yourself
Hermes 2 Pro - Llama-3 8B is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
Hermes 2 Pro - Llama-3 8B vs similar models
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Hermes 2 Pro - Llama-3 8B
How much does Hermes 2 Pro - Llama-3 8B cost?
Hermes 2 Pro - Llama-3 8B is metered at $ 0.140 /1M for input, and $ 0.140 /1M for output. Bring your own NousResearch API key — osFoundry passes through provider pricing without markup.
Can I use Hermes 2 Pro - Llama-3 8B commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Hermes 2 Pro - Llama-3 8B?
Hermes 2 Pro - Llama-3 8B supports a 8K token context window.
Can I run Hermes 2 Pro - Llama-3 8B locally?
No — Hermes 2 Pro - Llama-3 8B is hosted only and accessed via the NousResearch API. An open-weights equivalent is available to self-host — see the cross-link above.
What is Hermes 2 Pro - Llama-3 8B best at?
Hermes 2 Pro - Llama-3 8B is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use Hermes 2 Pro - Llama-3 8B in osFoundry?
Paste your NousResearch API key in the key dialog (or deploy the open weights for self-hostable models), assign Hermes 2 Pro - Llama-3 8B to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by NousResearch on May 27, 2024. Source: https://openrouter.ai/nousresearch/hermes-2-pro-llama-3-8b