Dataset Inspector is a desktop UI for inspecting local Lightning-AI/litData shards, MosaicML Streaming (MDS) shards, and WebDataset tar shards, with support for previewing Hugging Face and Zenodo datasets directly online without downloading.
- Inspect local LitData shards (
index.json+.bin/.zstchunks). - Inspect local MosaicML Streaming (MDS) shards (
index.json+.mds/.mds.zst). - Inspect local WebDataset shards (
.tar,.tar.gz,.tar.zst). - Inspect Hugging Face datasets via streaming API (no full local download).
- Inspect Zenodo datasets via HTTP Range request (no full local download).
- Preview json/audio/image, copy values, and open extracted fields with your default app.
Local LitData shards |
Local WebDataset tar shards |
Hugging Face dataset preview |
Zenodo record preview |
- Download Dataset Inspector installers from Releases.
- Browse local LitData/MosaicML/WebDataset folders, or HF URLs, or Zenodo URLs, then press Load.
- LitData / MosaicML shards: pick a shard → item → field, then preview fields.
- WebDataset shards: pick a shard → sample → field, then preview/open files.
- Hugging Face datasets: pick a split → row → field to preview values.
- Report issues/ feature requests: https://github.com/binbinsh/dataset-inspector/issues



