shopify-supplier-extract
CommunityTurn supplier pages into canonical product JSON
Data & Analytics#html parsing#json normalization#supplier scraping#product extraction#csv mapping#image downloading#datasheet capture
Authorredbananastudios
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It solves the problem of manually scraping supplier pages and converting messy HTML and images into structured, consistent product data for downstream importing.
Core Features & Use Cases
- CSV-driven extraction workflow: Takes a products.csv with required supplier and SKU fields, filters invalid/blocked rows, and processes each product deterministically.
- Structured raw JSON output: Cleans supplier HTML and extracts factual content into a canonical raw JSON format (tables kept structured, descriptions/bullets/specs/FAQ separated).
- Media asset harvesting: Downloads hi-res images and datasheets into per-product output folders for traceable publishing.
- Standard project workflow: Uses a known input location, produces output in a predictable output/{slug}-{our_sku}/ structure, and reports success/failure totals.
Quick Start
Ask the AI to run shopify-supplier-extract against your products CSV at ./brand-memory/input/products.csv (or ./input/products.csv) to produce output/raw.json plus downloaded images and datasheets for each non-amazon supplier row.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: shopify-supplier-extract Download link: https://github.com/redbananastudios/ai-library/archive/main.zip#shopify-supplier-extract Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.