Name: VLM
Availability: InStock
Author: zakarooo

System Documentation

What problem does it solve?

This Skill enables vision-based AI chat, allowing teams to analyze and describe visual content through natural language within a backend-driven workflow.

Core Features & Use Cases

Vision chat with image_url and base64 image support for multimodal analysis.
Build conversational interfaces that describe, summarize, or answer questions about images.
Use cases include product image descriptions, accessibility alt text generation, and visual QA in apps.

Quick Start

Open the skill's script at skills/VLM/scripts/vlm.ts and run it with a sample image URL to see a basic vision analysis flow. Example CLI usage for testing: z-ai vision --prompt "Describe this image" --image "https://example.com/photo.jpg" You can adapt the script to your own image sources and prompts; ensure backend usage of the z-ai-web-dev-sdk as shown in the examples.

Please help me install this Skill: Name: VLM Download link: https://github.com/zakarooo/dataup/archive/main.zip#vlm Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

VLM

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper