dataset-supply-chain-security
CommunityGuard dataset provenance and integrity end-to-end.
Authormaruakshay
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Datasets can be poisoned, tampered, or misrepresented across public hubs and RAG pipelines, risking degraded model performance and security breaches.
Core Features & Use Cases
- Pinning and hashing: enforce commit/content hashes for all external datasets and verify on download.
- Vetting and provenance: formal checks for publishers, licenses, and provenance before ingestion.
- Ingestion governance: logging, internal mirroring, and sandboxing of new datasets.
- Use Case: A model team audits a HuggingFace repo to ensure the dataset version matches the audited snapshot and logs access through a central registry.
Quick Start
Pin all external datasets to immutable commit or content hashes and verify them at download time before ingestion.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: dataset-supply-chain-security Download link: https://github.com/maruakshay/mii-ai-security/archive/main.zip#dataset-supply-chain-security Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.