ifly-pdf&image-ocr

Official

Turn images and PDFs into editable text instantly.

Authoriflytek
Version1.0.0
Installs0

System Documentation

What problem does it solve?

OCRing images and PDFs to extract text and convert PDFs into editable formats, enabling quick digitization and content reuse across languages.

Core Features & Use Cases

  • Image OCR (LLM OCR): multi-language text extraction with layout preservation from images.
  • PDF OCR: extract text from PDFs and convert to Word, Markdown, or JSON with page-level outputs.
  • Document conversion: convert PDFs into editable formats while preserving structure.
  • Use Case: quickly digitize contracts or invoices and repurpose content in Word or Markdown.

Quick Start

Run the image_ocr.py script on an image to extract text or run pdf_ocr.py on a PDF to extract text and convert it to Word or Markdown.

Dependency Matrix

Required Modules

requests

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ifly-pdf&image-ocr
Download link: https://github.com/iflytek/iFly-Skills/archive/main.zip#ifly-pdf-image-ocr

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.