Where can I access Relace Search?

Access details are not specified in the model listing.

What is the pricing for Relace Search?

Pricing information is not included in the available specifications.

What type of model is Relace Search?

It is an LLM developed by Relace and optimized for search-related tasks.

How is Relace Search typically used?

It is suited for retrieval and question-answering workloads that benefit from very long context.

Relace Search by Relace — Specs, Pricing, Benchmarks (2026)

About Relace Search

Relace Search is a proprietary large language model whose weights and parameter count are not publicly disclosed. Its defining technical feature is the 256000-token context window, which enables processing of extended text sequences without early truncation. This architecture is optimized for tasks that depend on maintaining context across lengthy inputs.

Strengths center on sustained coherence during search and retrieval operations involving large documents or corpora. Because the model is closed-weight and text-only, it delivers focused performance on semantic matching and information extraction without multimodal overhead.

Common applications include enterprise document search, long-form content analysis, and retrieval-augmented systems that must reference earlier sections of extensive texts. Integration typically occurs through hosted APIs where the large context can be leveraged for accurate, context-aware results.

Capabilities

Long-context text processing

Semantic search and retrieval

Document analysis and synthesis

Query-based reasoning

Information extraction

How Relace Search compares

Relace Search (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Relace Search ranks #29 of 34

$1.2

KAT-Coder-Pro V2

$1.2

MiniMax M2-her

$1.3

Relace Apply 3

$1.8

GLM 4.7

$1.9

GLM 5

$2.2

GLM 4.5

$2.5

Nemotron 3 Ultra

$3.0

Relace Search

$3.3

Qwen3 Coder Plus

$3.8

Qwen3.7 Max

$3.9

Qwen3 Max

$4.0

GLM 5 Turbo

$6.2

Qwen3.6 Max Preview

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-context document retrieval

Relace Search handles queries over extensive inputs thanks to its 256000-token context, enabling direct search across full reports or repositories without chunking.

Semantic search over knowledge bases

The model supports intent-aware lookup within large structured or unstructured collections while preserving surrounding context for accurate results.

Multi-turn research sessions

Users can maintain extended conversations that reference prior information spread across thousands of tokens for iterative exploration.

Strengths & limitations

Strengths

+Handles very large text inputs
+Search-oriented design
+Pure text focus for efficiency

Limitations

–Text modality only
–No built-in vision or code execution
–Standard LLM hallucination risks

Cost calculator

Estimate what Relace Search would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00250

per request

$25

estimated / month

Based on Relace Search's $1.00/1M input · $3.00/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "relace/relace-search",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: relace/relace-search

Editor's verdict

Our take on Relace Search

Relace Search is Relace's proprietary language models with a 256K-token context window.

At $3.00 per 1M output tokens, it is mid-priced for its class.

It is available through Relace's API and aggregators like OpenRouter.

Best suited to handles very large text inputs and search-oriented design.

Did you find this helpful?

Frequently asked questions

The model provides a context window of 256000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Sign in to review

Other Relace models

Sibling versions in the Relace family from Relace.

Relace Apply 3

Relace · Language Models

Verified

Proprietary LLM built for 256k-token text contexts.

Closed256K ctx$1.25/1M out

Similar models

Other language models worth comparing.

DeepSeek V4 Flash

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text context handling.

OpenII 46.51049K ctx$0.18/1M out

DeepSeek V4 Pro

DeepSeek · Language Models

Verified

Open-weight LLM built for million-token text contexts.

OpenII 51.51049K ctx$0.87/1M out

Qwen3 Coder Plus

Alibaba Qwen · Language Models

Verified

Open-weight coder built for million-token codebases and complex tasks.

Open1000K ctx$3.25/1M out

Relace Search

About Relace Search

Capabilities

How Relace Search compares

Price

Best for

Long-context document retrieval

Semantic search over knowledge bases

Multi-turn research sessions

Strengths & limitations

Strengths

Limitations

Cost calculator

Quick start

Editor's verdict

Frequently asked questions

What context length does Relace Search support?

Where can I access Relace Search?

What is the pricing for Relace Search?

What type of model is Relace Search?

How is Relace Search typically used?

User reviews

Other Relace models

Relace Apply 3

Similar models

DeepSeek V4 Flash

DeepSeek V4 Pro

Qwen3 Coder Plus

Promote Relace Search