ocr

Official

Turn images into editable text with OCR.

Authortrpc-group
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill automates the extraction of text from images, removing the tedious manual transcription step and enabling rapid data capture.

Core Features & Use Cases

  • Language support and multi-format input: OCR on PNG/JPG/JPEG/GIF/BMP/TIFF/WEBP with multi-language capability.
  • Preprocessing for accuracy: Optional image preprocessing (grayscale, contrast, sharpening) to improve results.
  • Output formats: Plain text or JSON with confidence scores for downstream analytics.
  • Use Case: Digitize receipts, scanned documents, or research images to extract searchable text.

Quick Start

Basic local OCR: python3 scripts/ocr.py <image_file> <output_file> OCR with language: python3 scripts/ocr.py image.png text.txt --lang eng OCR from URL: python3 scripts/ocr_url.py "https://example.com/image.jpg" text.txt --lang eng

Dependency Matrix

Required Modules

pytesseractPillowrequests

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ocr
Download link: https://github.com/trpc-group/trpc-agent-go/archive/main.zip#ocr

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.