Is Llama 3.3 Euryale 70B free to use?

Pricing information is not specified in the model details and depends on the platform hosting it.

How can users access Llama 3.3 Euryale 70B?

Access occurs through platforms that host the fine-tune created by Sao10k, typically via API or local inference.

What types of tasks is this model best suited for?

It performs well on long-context text reasoning, creative storytelling, role-playing, narrative generation, and instruction following.

Does the model support uncensored responses?

Yes, it is capable of uncensored conversational responses as part of its listed capabilities.

Llama 3.3 Euryale 70B by Sao10k — Specs, Pricing, Benchmarks (2026)

About Llama 3.3 Euryale 70B

Built on the Llama 3.3 architecture, this 70B-parameter model processes up to 131k tokens of text input and output. Its design emphasizes coherent handling of lengthy documents and multi-turn conversations without requiring local hardware.

Because the weights remain closed, users access the model through hosted inference endpoints. This setup suits applications that need reliable, high-volume text processing while avoiding the overhead of self-hosting large models.

Typical usage includes summarization, drafting, and analysis of long documents where maintaining context across many tokens is essential. The model delivers consistent performance for professional and creative writing workflows that rely on extended context.

Capabilities

Long-context text reasoning

Creative storytelling

Role-playing and character simulation

Narrative generation

Uncensored conversational responses

Instruction following

How Llama 3.3 Euryale 70B compares

Llama 3.3 Euryale 70B (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Llama 3.3 Euryale 70B ranks #63 of 141

$0.63

Ring-2.6-1T

$0.63

Ling-2.6-1T

$0.65

Gemma 2 27B

$0.65

ReMM SLERP 13B

$0.70

Hermes 3 70B Instruct

$0.74

Llama 3 70B Instruct

$0.75

Llama 3.3 Euryale 70B

$0.75

Mercury 2

$0.77

DeepSeek V3 0324

$0.78

Qwen-Plus

$0.78

Qwen Plus 0728

$0.78

Qwen Plus 0728

$0.78

Qwen3 Next 80B A3B Thinking

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-Form Narrative Projects

The 131072-token context supports sustained reasoning across extended documents, enabling coherent development of multi-chapter stories without losing track of earlier plot points.

Character-Driven Role-Play Sessions

Strong performance in role-playing and character simulation combined with uncensored responses allows users to maintain consistent personas over long interactive exchanges.

Creative Storytelling Assistance

The model follows detailed instructions while generating original narratives, making it effective for authors seeking help with plot outlines, dialogue, and world-building.

Strengths & limitations

Strengths

+High-quality creative writing and roleplay
+Strong coherence across long contexts
+Flexible and expressive output style
+Good at maintaining character consistency

Limitations

–Text-only modality
–May favor creative flair over strict factual accuracy
–Fine-tune can produce less predictable behavior on non-RP tasks

Cost calculator

Estimate what Llama 3.3 Euryale 70B would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00103

per request

$10.25

estimated / month

Based on Llama 3.3 Euryale 70B's $0.65/1M input · $0.75/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "sao10k/l3.3-euryale-70b",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: sao10k/l3.3-euryale-70b

Editor's verdict

Our take on Llama 3.3 Euryale 70B

Llama 3.3 Euryale 70B is Sao10k's proprietary language models with a 131K-token context window.

At $0.75 per 1M output tokens, it is very cost-efficient for its class.

It is available through Sao10k's API and aggregators like OpenRouter.

Best suited to high-quality creative writing and roleplay and strong coherence across long contexts.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 131072 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Llama models

Sibling versions in the Llama family from Sao10k.

Llama 3.1 Euryale 70B v2.2

Sao10k · Language Models

Verified

A 70B-scale LLM built for extended text generation and context retention.

Closed131K ctx$0.85/1M out

Llama 3.1 70B Hanami x1

Sao10k · Language Models

Verified

A closed-weight Llama 3.1 70B variant by Sao10k for text generation.

Closed16K ctx$3.00/1M out

Llama 3 8B Lunaris

Sao10k · Language Models

Verified

Sao10k's Llama 3 8B Lunaris handles extended text tasks with precision.

Closed8K ctx$0.05/1M out

Similar models

Other language models worth comparing.

DeepSeek V4 Pro

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text contexts.

OpenII 51.51049K ctx$0.87/1M out

DeepSeek V4 Flash

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text context handling.

OpenII 46.51049K ctx$0.18/1M out

Qwen3 Coder Plus

Alibaba Qwen · Language Models

Verified

Open-weight coder built for million-token codebases and complex tasks.

Open1000K ctx$3.25/1M out

Llama 3.3 Euryale 70B

About Llama 3.3 Euryale 70B

Capabilities

How Llama 3.3 Euryale 70B compares

Price

Best for

Long-Form Narrative Projects

Character-Driven Role-Play Sessions

Creative Storytelling Assistance

Strengths & limitations

Strengths

Limitations

Cost calculator

Quick start

Editor's verdict

Frequently asked questions

What is the context length of Llama 3.3 Euryale 70B?

Is Llama 3.3 Euryale 70B free to use?

How can users access Llama 3.3 Euryale 70B?

What types of tasks is this model best suited for?

Does the model support uncensored responses?

User reviews

Other Llama models

Llama 3.1 Euryale 70B v2.2

Llama 3.1 70B Hanami x1

Llama 3 8B Lunaris

Similar models

DeepSeek V4 Pro

DeepSeek V4 Flash

Qwen3 Coder Plus

Promote Llama 3.3 Euryale 70B