tts-gen

Community

Turn Chinese SRT subtitles into synced WAV audio.

Authorzlovexforever
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill eliminates the manual effort of recording and aligning Chinese voice audio to subtitle timestamps by generating a single time-synchronized WAV track directly from a Chinese SRT file.

Core Features & Use Cases

  • Chinese SRT → Time-synced TTS WAV: Synthesizes speech for each subtitle entry and places each segment into the correct timeline position.
  • Duration correction for synchronization: Measures synthesized audio duration and automatically speed-ups (up to 1.3x) or pads with silence to match the subtitle slot.
  • Edge TTS voice selection: Supports multiple Microsoft Edge TTS Chinese voice options (default: zh-CN-XiaoxiaoNeural), making it easy to match a desired narration style.
  • Use Case: After translating subtitles to Chinese (ZH SRT), generate the dubbed narration WAV aligned to the translated SRT so it can be used in a YouTube dubbing workflow.

Quick Start

Run the tts-gen skill to generate a synchronized dubbed audio file from your Chinese subtitle file named video_en_zh.srt.

Dependency Matrix

Required Modules

edge-ttsffmpegpydub

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: tts-gen
Download link: https://github.com/zlovexforever/zlx-claude-skills/archive/main.zip#tts-gen

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.