enforcing-skill-rules

Community

Measure and enforce skill rules to 100%.

Authorrbaumier
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Provides a reproducible, data-driven loop to measure per-rule effectiveness of an AI Skill, identify failures, and iterate wording or examples until every rule passes binary evaluation. This eliminates uncertainty about whether a skill's rules are followed, prevents regressions during compression, and documents discriminating vs non-discriminating rules.

Core Features & Use Cases

  • Extracts every rule from a SKILL.md as named assertions and tags them by category for coverage tracking.
  • Produces a single full-sweep trap prompt that violates all assertions, runs baseline and with-skill executions, and saves iteration artifacts for auditability.
  • Uses cross-model grading (separate grader model) to produce strict PASS/FAIL evidence, root-cause failures, discriminating flags per assertion, and benchmark reports.
  • Ideal for improving an existing skill, validating a new skill before deployment, compressing a skill without regression, and measuring variance reduction across runs.

Quick Start

Run a full-sweep evaluation: extract assertions, write one trap prompt that violates every rule, run baseline and three with-skill runs, then grade outputs with a separate cross-model grader and record benchmarks.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: enforcing-skill-rules
Download link: https://github.com/rbaumier/skills/archive/main.zip#enforcing-skill-rules

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.