ai-vision-cli

Community

AI vision CLI for image and video analysis.

Authortan-yong-sheng
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Analyze images and videos with AI vision models to detect objects with bounding boxes, compare multiple images, audit design compliance, and analyze video content using Google Gemini or Vertex AI. Supports CLI and MCP modes to streamline developer workflows.

Core Features & Use Cases

  • Detect objects in images with bounding boxes and confidence scores
  • Compare multiple images to identify visual differences and regressions
  • Audit design quality and accessibility attributes across UI assets
  • Analyze video content frame-by-frame or in segments using Gemini or Vertex AI
  • Operate in both CLI mode and MCP server integration for automation

Quick Start

Install and run ai-vision-cli to analyze an image or video with Google Gemini or Vertex AI.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ai-vision-cli
Download link: https://github.com/tan-yong-sheng/ai-vision-mcp/archive/main.zip#ai-vision-cli

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.