dataset-discovery

Official

Discover ML datasets from multiple sources.

AuthorOpenLAIR
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Multi-source dataset discovery and ranking for ML tasks, saving researchers time locating relevant data.

Core Features & Use Cases

  • Cross-source search across HuggingFace, OpenML, GitHub, and papers.
  • Deduplication and relevance ranking to surface high-quality datasets.
  • Quick evaluation: preview metadata, and pull representative samples for quick inspection.

Quick Start

Find and rank datasets for a given ML task across multiple sources.

Dependency Matrix

Required Modules

requests

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: dataset-discovery
Download link: https://github.com/OpenLAIR/dr-claw/archive/main.zip#dataset-discovery

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.