gemini-imagegen

Community

Generate & edit images with Gemini AI, effortlessly.

Authorsidsarasvati
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill provides a powerful, flexible interface to Google's Gemini API for all image generation and manipulation needs. It simplifies complex tasks like creating images from text, editing existing visuals, applying style transfers, or composing multiple images, saving designers and content creators significant time and effort.

Core Features & Use Cases

  • Text-to-Image Generation: Create high-quality images from simple text prompts, supporting various styles and resolutions up to 4K.
  • Image Editing & Refinement: Modify existing images with conversational instructions or through iterative multi-turn chat for precise adjustments.
  • Advanced Composition: Combine up to 14 reference images into a single, coherent output, ideal for product mockups, group photos, or complex scenes.
  • Use Case: You need a logo for "Acme Corp" with a coffee bean motif. This skill can generate the initial logo, then refine it through chat to make the text bolder and add a blue gradient, saving you multiple design iterations.

Quick Start

Generate an image of "A cat wearing a wizard hat" and save it as output.png.

Dependency Matrix

Required Modules

google-genaipillow

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: gemini-imagegen
Download link: https://github.com/sidsarasvati/dotfiles/archive/main.zip#gemini-imagegen

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.