Name: docx-reader
Availability: InStock
Author: Stratio

System Documentation

What problem does it solve?

Ingest and extract content from Word documents to obtain prose, tables, images, metadata, comments, and tracked changes, enabling faster content analysis, indexing, and governance of documentation.

Core Features & Use Cases

Two-mode extraction: quick mode for fast, one-shot outputs with a deterministic fallback to a thorough deep mode when needed.
Rich content extraction: text, tables, images, core metadata, and surfaced tracked changes or comments when present.
Legacy support: converts older binary .doc files to modern .docx for reliable parsing.
Markdown output: produces Markdown-ready results suitable for feeding LLMs and downstream pipelines.
Use case: ingest policy documents or contracts into governance workflows with structured outputs.

Quick Start

Run the quick_extract.py script on a DOCX document to obtain a Markdown-formatted summary.

Please help me install this Skill: Name: docx-reader Download link: https://github.com/Stratio/genai-agents/archive/main.zip#docx-reader Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

docx-reader

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper