Who developed Hermes 4 70B?

Hermes 4 70B was created by Nous Research.

How can users access Hermes 4 70B?

It is available through Nous Research's model distribution channels and compatible inference platforms.

What is the pricing for Hermes 4 70B?

Current pricing details are listed on the Nous Research website and associated API documentation.

Hermes 4 70B by Nous Research — Specs, Pricing, Benchmarks (2026)

Q: What types of tasks does Hermes 4 70B handle well?

It performs strongly on long-context reasoning, instruction following, code generation, creative writing, multi-step problem solving, and conversational role-play.

About Hermes 4 70B

Hermes 4 70B follows an open-weight design that grants full parameter access for local deployment and further adaptation. Its 131072-token context enables coherent handling of extended documents and multi-step interactions without truncation.

Key strengths center on reliable text generation across varied prompts while maintaining consistency over long sequences. The fully open release encourages community inspection, modification, and integration into custom pipelines.

Typical usage includes document summarization, code assistance, and analytical reasoning where large context retention matters. Developers often run it on-premise or via fine-tuning frameworks to match specific text workloads.

Capabilities

Long-context reasoning

Instruction following

Code generation

Creative writing

Multi-step problem solving

Conversational role-play

How Hermes 4 70B compares

Hermes 4 70B (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Hermes 4 70B ranks #22 of 78

$0.30

Step 3.5 Flash

$0.30

MiMo-V2-Flash

$0.30

gpt-oss-safeguard-20b

$0.34

DeepSeek V3.2

$0.35

Phi 4 Mini Instruct

$0.40

GLM 4.7 Flash

$0.40

Hermes 4 70B

$0.40

Llama 3.3 Nemotron Super 49B V1.5

$0.40

Qwen3 30B A3B Thinking 2507

$0.41

DeepSeek V3.2 Exp

$0.45

Nemotron 3 Super

$0.50

Cydonia 24B V4.1

$0.50

Olmo 3 32B Think

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-document analysis

The model handles extended inputs effectively for summarizing research papers, legal contracts, or technical documentation while maintaining coherence across the full context.

Complex software development

It supports multi-step code generation and debugging for large projects, following detailed instructions to produce functional implementations in various languages.

Interactive narrative creation

Strong creative writing and role-play abilities enable sustained, character-consistent storytelling or scenario simulation over many turns.

Strengths & limitations

Strengths

+Strong instruction adherence
+Coherent handling of 128k context
+Versatile across reasoning and creative tasks
+Helpful and detailed responses

Limitations

–Text-only modality
–No built-in vision or tool-use scaffolding
–Compute-heavy at 70B scale

Cost calculator

Estimate what Hermes 4 70B would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00033

per request

$3.3

estimated / month

Based on Hermes 4 70B's $0.13/1M input · $0.40/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "nousresearch/hermes-4-70b",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: nousresearch/hermes-4-70b

Editor's verdict

Our take on Hermes 4 70B

Hermes 4 70B is Nous Research's open-weight language models with a 131K-token context window.

At $0.40 per 1M output tokens, it is very cost-efficient for its class.

As an open-weight model you can self-host it or call it through a hosted API.

Best suited to strong instruction adherence and coherent handling of 128k context.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 131072 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Hermes models

Sibling versions in the Hermes family from Nous Research.

Hermes 4 405B

Nous Research · Language Models

Verified

Open-weight LLM with 131k context for complex text tasks.

Open131K ctx$3.00/1M out

Similar models

Other language models worth comparing.

DeepSeek V4 Pro

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text contexts.

OpenII 51.51049K ctx$0.87/1M out

DeepSeek V4 Flash

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text context handling.

OpenII 46.51049K ctx$0.18/1M out

MiMo-V2.5-Pro

Xiaomi · Language Models

Verified

MiMo-V2.5-Pro manages million-token text contexts for complex tasks.

ClosedII 35.61049K ctx$0.87/1M out

Hermes 4 70B

About Hermes 4 70B

Capabilities

How Hermes 4 70B compares

Price

Best for

Long-document analysis

Complex software development

Interactive narrative creation

Strengths & limitations

Strengths

Limitations

Cost calculator

Quick start

Editor's verdict

Frequently asked questions

What is the context length of Hermes 4 70B?

Who developed Hermes 4 70B?

How can users access Hermes 4 70B?

What is the pricing for Hermes 4 70B?

What types of tasks does Hermes 4 70B handle well?

User reviews

Other Hermes models

Hermes 4 405B

Similar models

DeepSeek V4 Pro

DeepSeek V4 Flash

MiMo-V2.5-Pro

Promote Hermes 4 70B