prompt-injection-red-teaming

Community

Systematically test prompt-injection defenses.

Authormaruakshay
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Red-teaming an LLM-based system is often ad-hoc and brittle. This Skill provides a structured methodology to identify, categorize, and measure prompt-injection vulnerabilities so defenses can be validated and improved over time.

Core Features & Use Cases

  • Systematic attack taxonomy across five classes (direct overrides, indirect injections, obfuscation, multi-turn manipulation, and role/context manipulation).
  • Probing library design with documented payloads, expected outcomes, and quantitative bypass metrics.
  • Continuous evaluation and governance workflow, including regression suites and deployment gating.
  • Use Cases: security teams validating defenses in development, production readiness assessments, and ongoing security assurance for prompts and tools.

Quick Start

Run a red-team evaluation against your model using the defined probe library and document the results.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: prompt-injection-red-teaming
Download link: https://github.com/maruakshay/mii-ai-security/archive/main.zip#prompt-injection-red-teaming

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.