Yes, it is publicly available through the Hugging Face datasets library.

How do I access the dataset?

Use the Hugging Face datasets library and load it with the identifier 'osv5m/osv5m'.

License details are provided on the dataset's Hugging Face repository page.

osv5m — Free Dataset Docs, Examples & Alternatives (2026)

What is osv5m?

OpenStreetView-5M consists of street view imagery collected for training and evaluating models on worldwide visual geolocation.

It supports machine learning research in computer vision, particularly tasks that require inferring geographic position from image content.

What you can build with osv5m

Train Visual Geolocation Models

Develop deep learning models that predict latitude and longitude from street-level photos using the 5M image set.

Benchmark Image-Based Localization

Test and compare retrieval or regression algorithms on a globally distributed street-view collection.

Create Location Inference Tools

Fine-tune models for apps that estimate geographic position from user photos at worldwide scale.

Load osv5m

Python

from datasets import load_dataset

ds = load_dataset("osv5m/osv5m")

1Install the Hugging Face datasets library with pip
2Import load_dataset from the datasets package
3Load the dataset with load_dataset('osv5m/osv5m')
4Select the train split containing images and coordinates
5Preprocess images and labels for your geolocation pipeline

osv5m: pros & cons

Pros

+5 million street-level images
+Global geographic coverage
+Purpose-built for visual geolocation
+Directly loadable via Hugging Face

Cons

–Large storage and compute requirements
–No additional annotations beyond location
–Regional image distribution may be uneven

Did you find this helpful?

Frequently asked questions

A dataset of 5 million street-level images created for visual geolocation tasks by researchers at Imagine, LIGM, Ecole des Ponts.

User reviews

Verified reviews from the community shape this listing's rating.

Loading reviews…

Sign in to review

Similar datasets

Other ai & machine learning options worth comparing.

FineNews

AI & Machine Learning · ksolovev

Verified

News dataset for AI and machine learning workflows.

Dataset↓ 1.5MFree

hd_tmp

AI & Machine Learning · ayuo

Verified

Temporary AI/ML dataset for Hugging Face prototyping.

Dataset↓ 1.5MFree

results

AI & Machine Learning · mteb

Verified

MTEB benchmark results for text embedding model evaluations.

Dataset↓ 1.3MFree

osv5m

What is osv5m?

What you can build with osv5m

Train Visual Geolocation Models

Benchmark Image-Based Localization

Create Location Inference Tools

Load osv5m

osv5m: pros & cons

Pros

Cons

Frequently asked questions

What is OpenStreetView-5M?

Is the dataset free?

How do I access the dataset?

What is the license?

User reviews

Similar datasets

FineNews

hd_tmp

results

Promote osv5m