parse-doc

Name: parse-doc
Availability: InStock
Author: ejoongseok

Community

Turn office docs into usable Markdown

Content & Communication #ocr #pdf #markdown #document conversion #excel #pptx #office tools

Authorejoongseok

Version1.0.0

Installs0

System Documentation

What problem does it solve?

This Skill removes the manual effort of converting scattered office documents into consistent, searchable Markdown, especially when files contain embedded text, tables, or scanned content.

Core Features & Use Cases

Document-to-Markdown conversion by file type: Converts HWP/HWPX/PDF/XLSX/DOCX/PPTX plus legacy XLS/PPT/ODT/ODP/ODS into Markdown saved under parsed output.
OCR for scanned PDFs (and image-based documents): Automatically runs OCR when text extraction appears empty, producing page-wise Markdown.
Image extraction and multimodal interpretation support: Extracts images from PDFs and PPTX and leaves Markdown references so you can interpret them with Claude’s multimodal reading.
Batch and pattern-based parsing: Supports parsing a single file, all files, or matches like *.pdf, with skip behavior for already-parsed outputs.

Quick Start

Place your file in .local.claude/docs/original/ as meeting-notes.pdf, then run /parse-doc meeting-notes.pdf to generate .local.claude/docs/parsed/meeting-notes.md.

parse-doc

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper