trusted-web-scraper
CommunitySecurely collect official data from trusted sites.
Data & Analytics#rate limiting#data collection#web scraping#research automation#information extraction#structured output#trusted sources
Authorptreezh
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It helps you reliably collect and structure information from official sources while reducing the risk of using untrusted or unreliable websites.
Core Features & Use Cases
- Trust verification: Checks domain trust signals (e.g., SSL validity and whether the domain looks like education/government/known corporate sites).
- Smart crawl strategy: Chooses a crawl approach based on site type and trust level, while enforcing basic guardrails (rate limiting and robots.txt respect flags).
- Structured extraction & cleaning: Extracts text/tables/images/doc links and cleans/normalizes extracted items into a consistent output schema.
Quick Start
Tell the skill to crawl a target official URL and extract specific content fields into JSON output.
Dependency Matrix
Required Modules
requestsbeautifulsoup4
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: trusted-web-scraper Download link: https://github.com/ptreezh/sscisubagent-skills/archive/main.zip#trusted-web-scraper Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.