ifly-pdf&image-ocr
OfficialTurn images and PDFs into editable text instantly.
Authoriflytek
Version1.0.0
Installs0
System Documentation
What problem does it solve?
OCRing images and PDFs to extract text and convert PDFs into editable formats, enabling quick digitization and content reuse across languages.
Core Features & Use Cases
- Image OCR (LLM OCR): multi-language text extraction with layout preservation from images.
- PDF OCR: extract text from PDFs and convert to Word, Markdown, or JSON with page-level outputs.
- Document conversion: convert PDFs into editable formats while preserving structure.
- Use Case: quickly digitize contracts or invoices and repurpose content in Word or Markdown.
Quick Start
Run the image_ocr.py script on an image to extract text or run pdf_ocr.py on a PDF to extract text and convert it to Word or Markdown.
Dependency Matrix
Required Modules
requests
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ifly-pdf&image-ocr Download link: https://github.com/iflytek/iFly-Skills/archive/main.zip#ifly-pdf-image-ocr Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.