Skip to content

Reka Flash 3

Verified

Reka Flash 3 handles long text contexts up to 64k tokens efficiently.

RekaaiLanguage ModelsClosed
Model page
Updated 2026-06-15

About Reka Flash 3

Reka Flash 3 follows a closed-source design that keeps its architecture and weights proprietary. This structure supports a 65536-token context window while restricting direct access to model internals. The text-only modality focuses processing power on language tasks without image or audio support.

Its primary strengths lie in sustained coherence across lengthy documents and conversations. Organizations use it for content creation, summarization, and technical writing where large context retention matters. Typical deployments involve API access for enterprise applications requiring consistent text output.

Capabilities

Long-context reasoning
Text generation
Instruction following
Document summarization
Conversational responses
Question answering

How Reka Flash 3 compares

Reka Flash 3 (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Reka Flash 3 ranks #15 of 98

$0.15
Trinity Mini
$0.15
Rnj 1 Instruct
$0.18
DeepSeek V4 Flash
$0.18
gpt-oss-120b
$0.19
Qwen3 30B A3B Instruct 2507
$0.20
Nemotron 3 Nano 30B A3B
$0.20
Reka Flash 3
$0.21
Hy3 preview
$0.24
Qwen3 14B
$0.27
Qwen3 Coder 30B A3B Instruct
$0.28
Qwen3 32B
$0.30
Step 3.5 Flash
$0.30
MiMo-V2-Flash

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long Document Summarization

With its 65536 token context, Reka Flash 3 can process and summarize entire books or extensive reports in a single pass.

Extended Code Analysis

It excels at reviewing large code repositories while keeping track of dependencies across multiple files.

Multi-turn Dialogue Systems

The model maintains context over very long conversations, making it suitable for customer support bots handling complex queries.

Strengths & limitations

Strengths

  • +Strong handling of extended 64k-token contexts
  • +Focused text-only processing
  • +Efficient for language tasks
  • +Clear instruction adherence

Limitations

  • No vision or multimodal support
  • Context strictly capped at 65536 tokens
  • Lacks specialized domain fine-tuning details

Cost calculator

Estimate what Reka Flash 3 would cost for your usage.

$0.00020
per request
$2
estimated / month

Based on Reka Flash 3's $0.10/1M input · $0.20/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "rekaai/reka-flash-3",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: rekaai/reka-flash-3

Editor's verdict

Our take on Reka Flash 3

Reka Flash 3 is Rekaai's proprietary language models with a 66K-token context window.

At $0.20 per 1M output tokens, it is very cost-efficient for its class.

It is available through Rekaai's API and aggregators like OpenRouter.

Best suited to strong handling of extended 64k-token contexts and focused text-only processing.

Did you find this helpful?

Frequently asked questions

Reka Flash 3 has a context length of 65536 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Reka models

Sibling versions in the Reka family from Rekaai.

Promote Reka Flash 3

Add this badge to your website, or share the tool.

DFeatured on DhanasviReka Flash 3 0