aiml-moderation-content
CommunityBenchmark content moderation across prompts and model outputs.
Authorwuyoscar
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a structured, end-to-end benchmark for evaluating content moderation across multiple data variants, helping teams measure the effectiveness of safety filters in LLM workflows.
Core Features & Use Cases
- Multi-variant evaluation: covers prompt content, user prompts, and model outputs to test recall and robustness.
- Self-contained scoring: uses a lightweight rubric to surface strengths and gaps in model safety without relying on external services.
- Use Case: QA teams can run the benchmark to compare safety performance across models or updates, ensuring consistent guardrails before deployment.
Quick Start
Run the aiml-moderation-content benchmark with the included prompts to evaluate content moderation performance across all three variants.
Dependency Matrix
Required Modules
None requiredComponents
Standard packageđź’» Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: aiml-moderation-content Download link: https://github.com/wuyoscar/ISC-Bench/archive/main.zip#aiml-moderation-content Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.