minimax-m3-multimodal-input

Community

Ground multimodal inputs for grounded decisions.

Authormadebyaris
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Ground multimodal inputs (images and videos) to anchor visual claims in task reasoning.

Core Features & Use Cases

  • Ground visual claims by reading attached media and citing exact file paths.
  • Compare pre- and post-state media to verify UI or design changes.
  • Scope includes UI reviews, bug reports, and design parity checks with screenshots or clips.

Quick Start

Instruct the model to read the attached media and generate a grounded visual-fidelity verdict.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: minimax-m3-multimodal-input
Download link: https://github.com/madebyaris/advance-minimax-m3-cursor-rules/archive/main.zip#minimax-m3-multimodal-input

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.