baoyu-danger-gemini-web

Community

Gemini-based text/image generation with vision

Authorlisposter
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill enables generating Gemini-based text and images through a reverse-engineered Gemini Web API. It provides a backend for text generation, image creation from prompts, vision-input via reference images, and multi-turn conversations, useful when other skills need Gemini-backed generation capabilities.

Core Features & Use Cases

  • Gemini-powered text generation with optional image generation from prompts.
  • Vision-enabled generation using reference images for contextual input.
  • Multi-turn chat support to maintain conversational context across prompts.

Quick Start

Use the Gemini Web skill to generate text or images. Examples:

  • npx -y bun ${SKILL_DIR}/scripts/main.ts "Describe a sunset over the ocean"
  • npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Create an illustration of a robot" --image robot.png
  • npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Describe this image" --reference input.png

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: baoyu-danger-gemini-web
Download link: https://github.com/lisposter/creator/archive/main.zip#baoyu-danger-gemini-web

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.