robotwin-image-to-env
CommunityTurn an image into a RoboTwin env.
Software Engineering#rendering#robotwin#image-to-env#asset-matching#tabletop-localization#pose-preset
AuthorGeuan666
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill converts a single image into a reviewed RoboTwin environment by detecting visible tabletop objects, matching assets from a built-in library, localizing anchors on the tabletop with locate MCP, drafting the layout, and generating a renderable first frame.
Core Features & Use Cases
- Object discovery on the tabletop using qwen3-vl-plus, followed by per-object cropping.
- Asset candidate retrieval from the built-in multimodal asset library via qwen3-vl-embedding, with instance-level or class-level matching.
- Pose preset selection per object using asset docs, then authoritative localization via locate-tabletop-contacts MCP.
- Estimation of a conservative tabletop footprint and projection into a RoboTwin layout, followed by a render pass and VLM-based refinement.
- Honest reporting for unresolved objects and generation of a draft RoboTwin env for user review.
Quick Start
Provide a source image; the system will generate a draft RoboTwin env and a first render for review.
Dependency Matrix
Required Modules
httpxPillownumpy
Components
scriptsreferencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: robotwin-image-to-env Download link: https://github.com/Geuan666/skills-robotwin/archive/main.zip#robotwin-image-to-env Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.