Create videos where a person appears to speak or sing using only a photo and an audio file. This skill picks the best AI model for your project, whether you need a virtual presenter, a dubbed product demo, or a lip-synced character. No technical background required.
Behind the scenes the skill uses the RunComfy CLI to call models like ByteDance OmniHuman, Wan 2-7, HappyHorse, and Seedance v2. Each model handles different types of input. The skill reads your intent and chooses the right one automatically.
You can start with a written script or an existing audio recording. The result is a video with natural mouth movements and gestures. This is useful for UGC content, virtual spokespersons, or any project that needs a face to match a voice.
Global
mkdir -p ~/.claude/skills/ai-avatar-videoProject
mkdir -p .claude/skills/ai-avatar-videoSource Repository
Remotion Best Practicesremotion-dev/skills
Best practices for building videos with React and Remotion
Ai Video Generationqu-skills/skills
Make AI videos from text, images, or references using 40+ models
Remotion Renderqu-skills/skills
Render animated videos directly from your React component code using Remotion
Ai Image Generationqu-skills/skills
Create AI images quickly with over 50 models via your terminal
Image To Videoagentspace-so/runcomfy-agent-skills
Pick the right AI model to animate your image into video
Flux Kontextagentspace-so/runcomfy-agent-skills
Flux Kontext Pro edits one image part while keeping the rest identical
Nano Banana 2agentspace-so/runcomfy-agent-skills
Create rapid image drafts with strong typography using Google Nano Banana 2 on RunComfy
Gpt Image Editagentspace-so/runcomfy-agent-skills
Edit images with OpenAI GPT Image 2 on RunComfy using smart prompting patterns