adversarial-resilience
CommunityHarden agents against prompt and data attacks.
Legal & Compliance#audit logging#data exfiltration#secret scanning#agent security#permission boundaries#sandbox escape#adversarial prompt injection
Authoritallstartedwithaidea
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Adversarial resilience prevents AI agents from being hijacked through prompt injection or from leaking sensitive data when processing untrusted inputs.
Core Features & Use Cases
- Defense-in-depth security layers: input sanitization, instruction anchoring, output filtering, permission boundaries, and audit logging.
- Untrusted-data handling: treats user-provided fields (including campaign names, ad copy, and keyword lists) as data only, not instructions.
- Secret and sensitive-data protection: scans outputs for likely credentials/PII patterns and redacts before returning responses.
- Execution guardrails: enforces allowed paths and blocks dangerous command patterns regardless of model intent.
- Forensic traceability: logs violations and actions to support post-incident analysis and compliance.
Quick Start
Use the adversarial-resilience skill to secure your agent by hardening how it ingests untrusted campaign data, restricts tool permissions, and prevents sensitive information from appearing in responses.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: adversarial-resilience Download link: https://github.com/itallstartedwithaidea/agent-skills/archive/main.zip#adversarial-resilience Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.