browser-automation-agent

Community

Automate web browsers for AI.

Authorbesoeasy
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates web browser interactions for AI agents, enabling tasks like form filling, navigation, and data capture without manual intervention.

Core Features & Use Cases

  • Deterministic Element Selection: Uses accessibility tree snapshots for reliable targeting of web elements.
  • Browser Control: Open URLs, fill forms, click buttons, type text, navigate back/forward, and reload pages.
  • Content Capture: Take screenshots, generate PDFs, and extract page text or HTML.
  • Use Case: An AI agent needs to book a flight. It can use this Skill to open the booking website, fill in passenger details, select dates, click the search button, and then capture a screenshot of the results.

Quick Start

Use agent-browser to open the URL https://example.com and then take a screenshot named output.png.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: browser-automation-agent
Download link: https://github.com/besoeasy/open-skills/archive/main.zip#browser-automation-agent

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.