wiki-chunk

Community

Chunk transcripts into topic JSON for fast search.

Authorcdeistopened
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Chunk transcripts into semantic topic segments to enable fast search, structured retrieval, and efficient downstream embedding for retrieval-augmented workflows.

Core Features & Use Cases

  • Semantic chunking: split transcripts into topic-based chunks with timestamps, topic types, and key entities.
  • Gemini-powered: leverages Gemini to generate coherent topic boundaries and context.
  • Output format: emits data/chunks/{episode_id}.json with a structured array of chunks for embedding and indexing.
  • Use case: empower RAG pipelines and search over large transcript corpora to surface relevant topic segments quickly.

Quick Start

Run the wiki-chunk pipeline to generate topic-based JSON chunks from transcripts.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: wiki-chunk
Download link: https://github.com/cdeistopened/skill-stack-skills/archive/main.zip#wiki-chunk

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.