ocr
OfficialTurn images into editable text with OCR.
Authortrpc-group
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill automates the extraction of text from images, removing the tedious manual transcription step and enabling rapid data capture.
Core Features & Use Cases
- Language support and multi-format input: OCR on PNG/JPG/JPEG/GIF/BMP/TIFF/WEBP with multi-language capability.
- Preprocessing for accuracy: Optional image preprocessing (grayscale, contrast, sharpening) to improve results.
- Output formats: Plain text or JSON with confidence scores for downstream analytics.
- Use Case: Digitize receipts, scanned documents, or research images to extract searchable text.
Quick Start
Basic local OCR: python3 scripts/ocr.py <image_file> <output_file> OCR with language: python3 scripts/ocr.py image.png text.txt --lang eng OCR from URL: python3 scripts/ocr_url.py "https://example.com/image.jpg" text.txt --lang eng
Dependency Matrix
Required Modules
pytesseractPillowrequests
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ocr Download link: https://github.com/trpc-group/trpc-agent-go/archive/main.zip#ocr Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.