visual-summary-analysis

Community

Turn videos/images into smooth scene descriptions.

Authorsmyx-sunjinhui
Version1.0.0
Installs0

System Documentation

What problem does it solve?

When you have a video clip or image and need an accurate, natural-language summary of what’s happening, this Skill removes the need for manual description work by automatically generating a coherent scene narrative.

Core Features & Use Cases

  • Scene understanding & description generation: Analyzes uploaded video/image content and outputs a smooth, logically coherent scene description.
  • Multimodal reasoning for key elements: Identifies key visual elements such as subjects/objects, environment/background, actions/behaviors, and lighting/atmosphere, then expresses them in Chinese.
  • History report listing via cloud API: Supports keyword-triggered retrieval of historical visual summary reports and returns results as a Markdown table with clickable report links.

Quick Start

Send a request with a video or image attachment and ask for a visual summary scene description, e.g., "请对我上传的视频内容做视觉摘要智述分析并生成一段场景描述。"

Dependency Matrix

Required Modules

requests

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: visual-summary-analysis
Download link: https://github.com/smyx-sunjinhui/smyx-open-claw-skills/archive/main.zip#visual-summary-analysis

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.