Skip to content

Seed 1.6

Verified

Seed 1.6 processes image, text, and video with a 262k-token context.

Bytedance-seedMultimodalClosed
Vision
Model page
Updated 2026-06-14

About Seed 1.6

Seed 1.6 integrates vision and language capabilities into a single architecture. Its 262144-token context window supports extended sequences across multiple modalities. The design emphasizes unified processing of visual and textual data streams.

The model excels at tasks requiring simultaneous analysis of images, video clips, and lengthy text. Closed weights limit direct modification while enabling consistent API-based deployment. Typical applications include video summarization and cross-modal retrieval.

Users apply Seed 1.6 to workflows that combine visual content with detailed textual context. Its multimodal scope reduces the need for separate specialized models. Integration occurs through Bytedance-seed's hosted inference services.

Capabilities

Long-context reasoning
Multimodal understanding
Video analysis
Image-text integration
Cross-modal generation
Extended document processing

How Seed 1.6 compares

Seed 1.6 (striped bar) vs other multimodal on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Seed 1.6 ranks #26 of 63

$1.8
Qwen3.5 Plus 2026-04-20
$1.8
GLM 4.5V
$1.9
Qwen3.6 Plus
$2.0
GPT-5 Mini
$2.0
Devstral 2 2512
$2.0
Grok Build 0.1
$2.0
Seed 1.6
$2.0
Qwen3.6 27B
$2.0
Kimi K2.5
$2.5
Grok 4.20
$2.5
Grok 4.3
$2.5
Nova 2 Lite
$2.6
Qwen3 VL 235B A22B Thinking

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Extended document analysis

The model processes up to 262144 tokens to perform long-context reasoning over large reports, research papers, or legal texts while maintaining coherence across sections.

Video content evaluation

It performs video analysis by integrating visual frames with accompanying text or audio transcripts for tasks such as event detection and summarization.

Cross-modal content creation

Seed 1.6 supports image-text integration and cross-modal generation to produce unified outputs that combine visual and textual elements from diverse inputs.

Strengths & limitations

Strengths

  • +Very large context window
  • +Native support for video inputs
  • +Unified image and text handling

Limitations

  • No audio modality
  • High compute needs at full context length
  • Performance depends on input quality

Cost calculator

Estimate what Seed 1.6 would cost for your usage.

$0.00125
per request
$12.5
estimated / month

Based on Seed 1.6's $0.25/1M input · $2.00/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "bytedance-seed/seed-1.6",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: bytedance-seed/seed-1.6

Editor's verdict

Our take on Seed 1.6

Seed 1.6 is Bytedance-seed's proprietary multimodal with a 262K-token context window.

At $2.00 per 1M output tokens, it is mid-priced for its class.

It is available through Bytedance-seed's API and aggregators like OpenRouter.

Best suited to very large context window and native support for video inputs.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 262144 tokens, enabling extended document processing and long-context reasoning.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Seed models

Sibling versions in the Seed family from Bytedance-seed.

Promote Seed 1.6

Add this badge to your website, or share the tool.

DFeatured on DhanasviSeed 1.6 1