guardrails-safety-filter-builder

Community

Hard guardrails for safe AI prompts.

Authorpatricio0312rev
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill enables teams to implement robust safety filters for LLMs, including PII redaction, policy-driven constraints, prompt-injection detection, and safe refusal templates to reduce risk and protect user data.

Core Features & Use Cases

  • Input filtering to block malicious prompts and reduce attack surface.
  • Output redaction to mask sensitive information in responses.
  • Policy constraints to enforce allowed content and provide safe refusals.
  • PII detection and redaction to protect personal data in user interactions.
  • Prompt-injection detection to identify and block attempts to manipulate system prompts or dynamics.

Quick Start

Configure the guardrails builder in your ML pipeline to automatically apply safety filters during request handling.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: guardrails-safety-filter-builder
Download link: https://github.com/patricio0312rev/skillset/archive/main.zip#guardrails-safety-filter-builder

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.