device-computer-automation
OfficialDrive your desktop UI safely via pixels.
Software Engineering#multi-monitor#screen capture#accessibility permissions#window focus#desktop ui automation#mouse and keyboard#pixel targeting
AuthorMuvon
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Operates your local desktop’s native user interface by taking targeted screenshots and issuing mouse, keyboard, and clipboard actions when no reliable UI semantic tree or automation API is available.
Core Features & Use Cases
- Pixel-anchor desktop targeting: Click and type by visual anchors and region-based recognition rather than accessibility IDs.
- Desktop-automation MCP control: Uses the desktop-automation server to move/click, type keys, press shortcuts, and capture screen regions.
- Safety and reliability guardrails: Enforces a verification loop (snapshot → locate → act → verify → record) and prevents risky actions like screen-lock/biometric bypass or destructive system changes.
Quick Start
Ask the agent to automate a native action on your desktop for a specific window, such as taking you through saving a file from the currently focused app while confirming the correct window focus first.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: device-computer-automation Download link: https://github.com/Muvon/octomind-tap/archive/main.zip#device-computer-automation Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.