device-computer-automation

Official

Drive your desktop UI safely via pixels.

AuthorMuvon
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Operates your local desktop’s native user interface by taking targeted screenshots and issuing mouse, keyboard, and clipboard actions when no reliable UI semantic tree or automation API is available.

Core Features & Use Cases

  • Pixel-anchor desktop targeting: Click and type by visual anchors and region-based recognition rather than accessibility IDs.
  • Desktop-automation MCP control: Uses the desktop-automation server to move/click, type keys, press shortcuts, and capture screen regions.
  • Safety and reliability guardrails: Enforces a verification loop (snapshot → locate → act → verify → record) and prevents risky actions like screen-lock/biometric bypass or destructive system changes.

Quick Start

Ask the agent to automate a native action on your desktop for a specific window, such as taking you through saving a file from the currently focused app while confirming the correct window focus first.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: device-computer-automation
Download link: https://github.com/Muvon/octomind-tap/archive/main.zip#device-computer-automation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.