guardrails-safety-filter-builder
CommunityHard guardrails for safe AI prompts.
Authorpatricio0312rev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill enables teams to implement robust safety filters for LLMs, including PII redaction, policy-driven constraints, prompt-injection detection, and safe refusal templates to reduce risk and protect user data.
Core Features & Use Cases
- Input filtering to block malicious prompts and reduce attack surface.
- Output redaction to mask sensitive information in responses.
- Policy constraints to enforce allowed content and provide safe refusals.
- PII detection and redaction to protect personal data in user interactions.
- Prompt-injection detection to identify and block attempts to manipulate system prompts or dynamics.
Quick Start
Configure the guardrails builder in your ML pipeline to automatically apply safety filters during request handling.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: guardrails-safety-filter-builder Download link: https://github.com/patricio0312rev/skillset/archive/main.zip#guardrails-safety-filter-builder Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.