Skip to content
SWE-bench_Multilingual logo

SWE-bench_Multilingual

Verified

Multilingual benchmark for AI models resolving GitHub issues in code repositories.

DatasetText & NLP620K/moFree
Open dataset
Updated 2026-06-15

What is SWE-bench_Multilingual?

SWE-bench_Multilingual provides a collection of GitHub issues and associated code repositories for evaluating models on multilingual software engineering tasks.

It supports benchmark evaluations in NLP and code generation for researchers focused on multilingual capabilities in software issue resolution.

What you can build with SWE-bench_Multilingual

Benchmarking multilingual code agents

Measure how well LLMs resolve GitHub issues across non-English repositories and programming languages.

Training cross-lingual repair models

Fine-tune models on issue-to-patch pairs from multiple natural languages to improve generalization.

Comparing language-specific performance

Run controlled experiments to quantify accuracy gaps between English and other language codebases.

Load SWE-bench_Multilingual

Python
from datasets import load_dataset

ds = load_dataset("SWE-bench/SWE-bench_Multilingual")
  1. 1pip install datasets
  2. 2from datasets import load_dataset
  3. 3ds = load_dataset('SWE-bench/SWE-bench_Multilingual')
  4. 4print(ds['test'][0])
  5. 5Use the 'instance_id', 'problem_statement' and 'patch' fields for evaluation

SWE-bench_Multilingual: pros & cons

Pros

  • +Extends SWE-bench to non-English languages
  • +Real GitHub issues and patches
  • +Directly loadable via Hugging Face
  • +Supports standardized model comparisons

Cons

  • Evaluation requires full repository setup
  • Limited documentation on language coverage
  • High compute cost for full runs
Did you find this helpful?

Frequently asked questions

A multilingual version of the SWE-bench dataset containing real software engineering tasks from GitHub issues in multiple languages.

User reviews

Verified reviews from the community shape this listing's rating.

Loading reviews…

Sign in to review

Promote SWE-bench_Multilingual

Add this badge to your website, or share the tool.

DFeatured on DhanasviSWE-bench_Multilingual 0