web-extractor
CommunityCapture complete web content from dynamic pages.
AuthorTouricks
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Web pages increasingly render content via JavaScript and load content lazily, making simple text extraction brittle or incomplete. This skill provides reliable extraction of the full visible text from dynamic pages, even when behind authentication, using strategies that render, scroll, intercept API calls, or transcribe canvas content.
Core Features & Use Cases
- Extracts text from dynamic, JS-rendered pages, SPAs, lazy-loaded sections, and canvas-based UIs.
- Supports multiple extraction strategies: Simple DOM read, Lazy Load, Virtual Scroll, API Interception, and Canvas Transcription.
- Produces a deduplicated, output-ready text suitable for downstream processing and analysis.
- Use cases include scraping articles, dashboards, Notion/Google Docs-like docs, Confluence, and Unity WebGL/Canvas apps.
Quick Start
Navigate to the target web page, allow time for rendering, and invoke the extraction workflow to capture and compile the full visible text.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: web-extractor Download link: https://github.com/Touricks/fanshi_personal_skills/archive/main.zip#web-extractor Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.