The debug dataset is a small collection of records sized under 1K entries, created by rtrm for the ai-ml domain.
It supports basic testing scenarios where limited data volume is required.
Run data loading and preprocessing scripts against a tiny dataset to verify code works before scaling to larger data.
Feed the records into training loops to isolate bugs in model architecture or loss functions without long wait times.
Include the dataset in automated tests to confirm ML workflows execute correctly on every code commit.
from datasets import load_dataset
ds = load_dataset("rtrm/debug")A small test collection of fewer than 1K records created by rtrm for the ai-ml category.
Verified reviews from the community shape this listing's rating.
Loading reviews…