Hermes 4 70B
VerifiedOpen-weight 70B LLM built for long-context text tasks.
About Hermes 4 70B
Hermes 4 70B follows an open-weight design that grants full parameter access for local deployment and further adaptation. Its 131072-token context enables coherent handling of extended documents and multi-step interactions without truncation.
Key strengths center on reliable text generation across varied prompts while maintaining consistency over long sequences. The fully open release encourages community inspection, modification, and integration into custom pipelines.
Typical usage includes document summarization, code assistance, and analytical reasoning where large context retention matters. Developers often run it on-premise or via fine-tuning frameworks to match specific text workloads.
Capabilities
How Hermes 4 70B compares
Hermes 4 70B (striped bar) vs other language models on intelligence, speed and price.
Price
USD per 1M output tokens · Lower is better · Hermes 4 70B ranks #22 of 78
Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).
Best for
Long-document analysis
The model handles extended inputs effectively for summarizing research papers, legal contracts, or technical documentation while maintaining coherence across the full context.
Complex software development
It supports multi-step code generation and debugging for large projects, following detailed instructions to produce functional implementations in various languages.
Interactive narrative creation
Strong creative writing and role-play abilities enable sustained, character-consistent storytelling or scenario simulation over many turns.
Strengths & limitations
Strengths
- +Strong instruction adherence
- +Coherent handling of 128k context
- +Versatile across reasoning and creative tasks
- +Helpful and detailed responses
Limitations
- –Text-only modality
- –No built-in vision or tool-use scaffolding
- –Compute-heavy at 70B scale
Cost calculator
Estimate what Hermes 4 70B would cost for your usage.
Based on Hermes 4 70B's $0.13/1M input · $0.40/1M output. Estimate only — actual cost varies by provider and caching.
Quick start
OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY,
});
const completion = await client.chat.completions.create({
model: "nousresearch/hermes-4-70b",
messages: [{ role: "user", content: "Hello!" }],
});
console.log(completion.choices[0].message.content);Model slug: nousresearch/hermes-4-70b
Editor's verdict
Hermes 4 70B is Nous Research's open-weight language models with a 131K-token context window.
At $0.40 per 1M output tokens, it is very cost-efficient for its class.
As an open-weight model you can self-host it or call it through a hosted API.
Best suited to strong instruction adherence and coherent handling of 128k context.
Frequently asked questions
The model provides a context window of 131072 tokens.
User reviews
Real, verified reviews from the community shape this model's rating.
Loading reviews…
Other Hermes models
Sibling versions in the Hermes family from Nous Research.