Claude Code Media Generator¶
AI-Powered Video & Image Generation
Create stunning videos and images using Google Veo & Imagen APIs
Why Claude Code Media Generator?¶
-
8 Video Generation Modes
From text-to-video to advanced video extension, style transfer, and object manipulation. All powered by Google Veo 3.1.
-
Reference Image System
Use up to 3 reference images for subject preservation or apply style transfer from any image.
-
Cloud-Native Architecture
Direct GCS storage integration. Auto-upload for seamless video extension workflows.
-
Smart Presets
Quick, Quality, Extend, Budget — choose the right balance of speed, quality, and cost.
Video Generation Modes¶
-
Text-to-Video
Generate video from text prompt only
-
Image-to-Video
Animate a single image with AI
-
First & Last Frames
Interpolate between two keyframes
-
Video Extension
Add 7 more seconds to any video
-
Reference Asset
Preserve subject across generations
-
Reference Style
Apply artistic style from an image
-
Insert Objects
Add objects using mask (Veo 3.1)
-
Remove Objects
Remove objects using mask (Veo 3.1)
Quick Start¶
The easiest way - use the /generative skill in Claude Code:
# Generate an image
/generative image "A beautiful sunset over mountains"
# Generate a video
/generative video "Ocean waves crashing on the beach"
# With options
/generative video "Cinematic sunset" --preset quality
For automation and advanced users:
# Option 1: Environment variable
export GOOGLE_API_KEY="your-api-key"
# Generate video
python video_gen.py "A cat playing in a sunlit garden"
# With quality preset
python video_gen.py "Cinematic sunset over mountains" --preset quality
Important: GCS Bucket Naming¶
Mandatory Naming Convention
When using Vertex AI API Key, your GCS bucket MUST follow this exact pattern:
gs://YOUR_PROJECT_ID-media-output/
Using any other bucket name will result in permission denied errors. This is a security boundary enforced by Vertex AI.
Example:
# ✅ Correct pattern
--storage-uri "gs://gen-lang-client-0344941103-media-output/videos/"
# ❌ Wrong - will fail
--storage-uri "gs://my-custom-bucket/videos/"
Pricing Overview¶
| Model | Type | Price/Second | 5-sec Video |
|---|---|---|---|
| veo-3.1-generate-preview | Standard | $0.35 | $1.75 |
| veo-3.1-fast-generate-preview | Fast | $0.15 | $0.75 |
| veo-3.0-generate-preview | Previous Gen | $0.25 | $1.25 |
| veo-2.0-generate-preview | Legacy | $0.35 | $1.75 |
Cost Saving Tip
Use --preset budget to automatically select the most cost-effective model for your use case.
Project Status¶
| Component | Version | Status |
|---|---|---|
| video_gen.py | v2.27 | Production Ready |
| image_gen.py | v1.4 | Stable |
| check_api.py | v2.1 | Stable |
| Documentation | v3.10 | Updated |
| Claude Code Skill | v3.2.1 | Available |
| Agent | v1.0 | Available |
| MCP Server | — | Planned |
Development Roadmap¶
-
Phase 1: Core Features ✅
- 8 video generation modes
- Image generation (Imagen/Gemini)
- Smart validation & auto-correction
- GCS storage integration
-
Phase 2: Enhanced Workflows ✅
- Video extension from URLs
- Auto FPS conversion (24fps)
- Reference image system
- Smart defaults from metadata
-
Phase 3: Claude Code ✅
/generativeunified skill- Help system (5 topics)
- Config wizard
- Agent for auto-detection
-
Phase 4: Planned Features
- Remix Mode (
--remix video.mp4) - Outpainting (
--outpaint left,right) - Camera Control (
--camera pan-left) - Audio Control (style/mood)
- Remix Mode (
Explore Documentation¶
-
Using Skills
The easiest way to generate media -
/generativeskill for Claude Code. -
Getting Started
Installation, authentication setup, and creating your first video in minutes.
-
Video Generation
All 8 modes, model comparison, reference images, and advanced features.
-
Image Generation
Text-to-image with Gemini & Imagen models, aspect ratios, and templates.
-
Advanced Usage
Direct scripts, automation, CI/CD integration for power users.
-
Guides
GCS storage, presets, troubleshooting, and more.
Made with using Google Vertex AI