carocut-media-audio

Official

Automate AI voiceovers, BGM, and SFX for videos.

Authorbilibili
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill streamlines the production of audio assets for video projects by generating voiceovers via TTS, and by automatically retrieving background music and sound effects, while producing timing data to synchronize audio with visuals.

Core Features & Use Cases

  • Voiceover generation: Generate TTS-based voice tracks with Edge TTS according to storyboard pacing.
  • BGM and SFX retrieval: Download royalty-free music and effects from Freesound to match mood and pacing.
  • Durations and timing: Produce durations.json for frame-accurate audio-visual synchronization in Remotion.
  • Incremental updates: Re-generate only changed VO segments while preserving overall timing when needed.

Quick Start

Run the audio workflow to generate voiceovers, fetch BGM and SFX, and produce durations.json.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: carocut-media-audio
Download link: https://github.com/bilibili/carocut/archive/main.zip#carocut-media-audio

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.