What is the pricing for Step 3.5 Flash?

Pricing details are not specified in the available model information.

How can users access Step 3.5 Flash?

Access methods are not detailed in the provided specifications.

What tasks is Step 3.5 Flash best suited for?

It supports long-context reasoning, text generation, summarization, instruction following, question answering, code generation, and document analysis.

Does Step 3.5 Flash handle code-related queries?

Yes, code generation is one of its listed capabilities.

Step 3.5 Flash by Stepfun — Specs, Pricing, Benchmarks (2026)

About Step 3.5 Flash

Step 3.5 Flash is engineered as a proprietary LLM with an emphasis on speed and scale. Its 262144-token context window enables processing of lengthy documents or extended dialogues without truncation. The closed weights allow Stepfun to maintain performance optimizations and controlled deployment.

Typical usage includes document summarization, long-context chat systems, and content generation tasks. Developers access the model through APIs where rapid responses on large inputs are required. Its text-only modality keeps the focus on language understanding and generation at extended lengths.

Capabilities

Long-context reasoning

Text generation and summarization

Instruction following

Question answering

Code generation

Document analysis

How Step 3.5 Flash compares

Step 3.5 Flash (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Step 3.5 Flash ranks #15 of 72

$0.18

gpt-oss-120b

$0.19

Qwen3 30B A3B Instruct 2507

$0.20

Nemotron 3 Nano 30B A3B

$0.21

Hy3 preview

$0.27

Qwen3 Coder 30B A3B Instruct

$0.28

Qwen3 32B

$0.30

Step 3.5 Flash

$0.30

MiMo-V2-Flash

$0.30

gpt-oss-safeguard-20b

$0.34

DeepSeek V3.2

$0.35

Phi 4 Mini Instruct

$0.40

GLM 4.7 Flash

$0.40

Llama 3.3 Nemotron Super 49B V1.5

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long Document Analysis

Step 3.5 Flash excels at processing and extracting insights from extensive documents due to its 262144-token context window and document analysis capabilities.

Code Generation Projects

The model supports efficient code generation and instruction following, making it suitable for developers building or refactoring software components.

Content Summarization Tasks

It performs well on text generation and summarization of lengthy inputs, combining long-context reasoning with question answering for accurate overviews.

Strengths & limitations

Strengths

+Very large context window for extended inputs
+Efficient inference as a Flash variant
+Strong performance on text-only tasks
+Suitable for large-scale document processing

Limitations

–Text modality only with no vision or audio support
–Lacks specialized multimodal capabilities
–Performance may vary on highly specialized domains

Cost calculator

Estimate what Step 3.5 Flash would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00024

per request

$2.4

estimated / month

Based on Step 3.5 Flash's $0.09/1M input · $0.30/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "stepfun/step-3.5-flash",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: stepfun/step-3.5-flash

Editor's verdict

Our take on Step 3.5 Flash

Step 3.5 Flash is Stepfun's proprietary language models with a 262K-token context window.

At $0.30 per 1M output tokens, it is very cost-efficient for its class.

It is available through Stepfun's API and aggregators like OpenRouter.

Best suited to very large context window for extended inputs and efficient inference as a flash variant.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 262144 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Step models

Sibling versions in the Step family from Stepfun.

Step 3.7 Flash

Stepfun · Multimodal

Verified

Multimodal model for long-context text, image, and video tasks.

ClosedII 42.6256K ctx$1.15/1M out

Similar models

Other language models worth comparing.

DeepSeek V4 Pro

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text contexts.

OpenII 51.51049K ctx$0.87/1M out

DeepSeek V4 Flash

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text context handling.

OpenII 46.51049K ctx$0.18/1M out

Owl Alpha

Openrouter · Language Models

Verified

Processes over a million tokens for long-form text tasks.

Closed1049K ctxFree

Step 3.5 Flash

About Step 3.5 Flash

Capabilities

How Step 3.5 Flash compares

Price

Best for

Long Document Analysis

Code Generation Projects

Content Summarization Tasks

Strengths & limitations

Strengths

Limitations

Cost calculator

Quick start

Editor's verdict

Frequently asked questions

What is the context length supported by Step 3.5 Flash?

What is the pricing for Step 3.5 Flash?

How can users access Step 3.5 Flash?

What tasks is Step 3.5 Flash best suited for?

Does Step 3.5 Flash handle code-related queries?

User reviews

Other Step models

Step 3.7 Flash

Similar models

DeepSeek V4 Pro

DeepSeek V4 Flash

Owl Alpha

Promote Step 3.5 Flash