web-extractor

Community

Capture complete web content from dynamic pages.

AuthorTouricks
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Web pages increasingly render content via JavaScript and load content lazily, making simple text extraction brittle or incomplete. This skill provides reliable extraction of the full visible text from dynamic pages, even when behind authentication, using strategies that render, scroll, intercept API calls, or transcribe canvas content.

Core Features & Use Cases

  • Extracts text from dynamic, JS-rendered pages, SPAs, lazy-loaded sections, and canvas-based UIs.
  • Supports multiple extraction strategies: Simple DOM read, Lazy Load, Virtual Scroll, API Interception, and Canvas Transcription.
  • Produces a deduplicated, output-ready text suitable for downstream processing and analysis.
  • Use cases include scraping articles, dashboards, Notion/Google Docs-like docs, Confluence, and Unity WebGL/Canvas apps.

Quick Start

Navigate to the target web page, allow time for rendering, and invoke the extraction workflow to capture and compile the full visible text.

Dependency Matrix

Required Modules

None required

Components

scriptsreferencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: web-extractor
Download link: https://github.com/Touricks/fanshi_personal_skills/archive/main.zip#web-extractor

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.