dataset-supply-chain-security

Community

Guard dataset provenance and integrity end-to-end.

Authormaruakshay
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Datasets can be poisoned, tampered, or misrepresented across public hubs and RAG pipelines, risking degraded model performance and security breaches.

Core Features & Use Cases

  • Pinning and hashing: enforce commit/content hashes for all external datasets and verify on download.
  • Vetting and provenance: formal checks for publishers, licenses, and provenance before ingestion.
  • Ingestion governance: logging, internal mirroring, and sandboxing of new datasets.
  • Use Case: A model team audits a HuggingFace repo to ensure the dataset version matches the audited snapshot and logs access through a central registry.

Quick Start

Pin all external datasets to immutable commit or content hashes and verify them at download time before ingestion.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: dataset-supply-chain-security
Download link: https://github.com/maruakshay/mii-ai-security/archive/main.zip#dataset-supply-chain-security

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.