prompt-injection-red-teaming
CommunitySystematically test prompt-injection defenses.
Software Engineering#security-testing#prompt-injection#red-teaming#attack-scenarios#continuous-evaluation#LLM-robustness
Authormaruakshay
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Red-teaming an LLM-based system is often ad-hoc and brittle. This Skill provides a structured methodology to identify, categorize, and measure prompt-injection vulnerabilities so defenses can be validated and improved over time.
Core Features & Use Cases
- Systematic attack taxonomy across five classes (direct overrides, indirect injections, obfuscation, multi-turn manipulation, and role/context manipulation).
- Probing library design with documented payloads, expected outcomes, and quantitative bypass metrics.
- Continuous evaluation and governance workflow, including regression suites and deployment gating.
- Use Cases: security teams validating defenses in development, production readiness assessments, and ongoing security assurance for prompts and tools.
Quick Start
Run a red-team evaluation against your model using the defined probe library and document the results.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: prompt-injection-red-teaming Download link: https://github.com/maruakshay/mii-ai-security/archive/main.zip#prompt-injection-red-teaming Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.