Name: qwencloud-vision
Availability: InStock
Author: QwenCloud

System Documentation

What problem does it solve?

Analyze images and videos using Qwen VL and VL-OCR models to understand scenes, extract text, answer questions, and produce structured outputs for automation and agents.

Core Features & Use Cases

Image and video understanding (including thinking-mode support) for descriptions, Q&A, and reasoning.
OCR text extraction with structured data outputs and language support.
Multi-image comparison and visual reasoning for charts, scenes, and visual problems.
JSON Schema or JSON object outputs for easy integration with pipelines and agents.

Quick Start

Describe an image or video by running python scripts/analyze.py with a prompt and the media file.

Please help me install this Skill: Name: qwencloud-vision Download link: https://github.com/QwenCloud/qwencloud-ai/archive/main.zip#qwencloud-vision Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

qwencloud-vision

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper