public-dataset-exploration

Name: public-dataset-exploration
Availability: InStock
Author: lightning-rod-labs

Official

Find raw public data and turn it into seeds.

Data & Analytics #github #forecasting #data conversion #huggingface #dataset discovery #kaggle #seed generation

Authorlightning-rod-labs

Version1.0.0

Installs0

System Documentation

What problem does it solve?

This Skill helps you locate and shortlist raw public datasets in a new domain when you don’t yet have training-ready data available.

Core Features & Use Cases

Market discovery across platforms: Search Kaggle, Hugging Face, and GitHub for domain-relevant raw or semi-structured datasets suitable for conversion.
Training-readiness filtering: Identify “relevant but not training-ready” sources by checking whether the data can produce forecasting questions or document-style Q&A rather than already being instruction-tuned or synthetic.
Seed creation workflow planning: Convert downloaded files into samples using the SDK’s conversion utilities and assemble an input dataset for downstream pipelines.

Use case: You’re starting a sports forecasting project and have a domain focus, but no documents or labels—this Skill guides you to find a usable raw dataset (e.g., event logs or match stats), convert it into samples, and package it as seeds for further labeling and training.

Quick Start

Ask the Skill to explore public datasets for your domain and recommend 1–3 candidates that are relevant but not already training-ready.

public-dataset-exploration

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper