Skip to content
Step 3.5 Flash logo

Step 3.5 Flash

Verified

Fast LLM built for 256k-token text contexts.

StepfunLanguage ModelsClosed
Model page
Updated 2026-06-14

About Step 3.5 Flash

Step 3.5 Flash is engineered as a proprietary LLM with an emphasis on speed and scale. Its 262144-token context window enables processing of lengthy documents or extended dialogues without truncation. The closed weights allow Stepfun to maintain performance optimizations and controlled deployment.

Typical usage includes document summarization, long-context chat systems, and content generation tasks. Developers access the model through APIs where rapid responses on large inputs are required. Its text-only modality keeps the focus on language understanding and generation at extended lengths.

Capabilities

Long-context reasoning
Text generation and summarization
Instruction following
Question answering
Code generation
Document analysis

How Step 3.5 Flash compares

Step 3.5 Flash (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Step 3.5 Flash ranks #15 of 72

$0.18
gpt-oss-120b
$0.19
Qwen3 30B A3B Instruct 2507
$0.20
Nemotron 3 Nano 30B A3B
$0.21
Hy3 preview
$0.27
Qwen3 Coder 30B A3B Instruct
$0.28
Qwen3 32B
$0.30
Step 3.5 Flash
$0.30
MiMo-V2-Flash
$0.30
gpt-oss-safeguard-20b
$0.34
DeepSeek V3.2
$0.35
Phi 4 Mini Instruct
$0.40
GLM 4.7 Flash
$0.40
Llama 3.3 Nemotron Super 49B V1.5

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long Document Analysis

Step 3.5 Flash excels at processing and extracting insights from extensive documents due to its 262144-token context window and document analysis capabilities.

Code Generation Projects

The model supports efficient code generation and instruction following, making it suitable for developers building or refactoring software components.

Content Summarization Tasks

It performs well on text generation and summarization of lengthy inputs, combining long-context reasoning with question answering for accurate overviews.

Strengths & limitations

Strengths

  • +Very large context window for extended inputs
  • +Efficient inference as a Flash variant
  • +Strong performance on text-only tasks
  • +Suitable for large-scale document processing

Limitations

  • Text modality only with no vision or audio support
  • Lacks specialized multimodal capabilities
  • Performance may vary on highly specialized domains

Cost calculator

Estimate what Step 3.5 Flash would cost for your usage.

$0.00024
per request
$2.4
estimated / month

Based on Step 3.5 Flash's $0.09/1M input · $0.30/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "stepfun/step-3.5-flash",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: stepfun/step-3.5-flash

Editor's verdict

Our take on Step 3.5 Flash

Step 3.5 Flash is Stepfun's proprietary language models with a 262K-token context window.

At $0.30 per 1M output tokens, it is very cost-efficient for its class.

It is available through Stepfun's API and aggregators like OpenRouter.

Best suited to very large context window for extended inputs and efficient inference as a flash variant.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 262144 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Step models

Sibling versions in the Step family from Stepfun.

Promote Step 3.5 Flash

Add this badge to your website, or share the tool.

DFeatured on DhanasviStep 3.5 Flash 1