스킬 ai-avatar-video

🎥

ai-avatar-video

Name: ai-avatar-video
Author: inference-sh

안전 🌐 네트워크 접근

AI 태킹 헤드 비디오 생성

또한 다음에서 사용할 수 있습니다: halt-catch-fire,inference-skills,doany-ai,qu-skills,inference-sh-skills,infsh-skills,agentspace-so,skills-shell,runcomfy-com

텍스트, 오디오 또는 이미지를 사용하여 AI 아바타로 전문적인 태킹 헤드 비디오를 만드세요. 촬영 없이 제품 데모, 튜토리얼 및 가상 프레젠테이터에 완벽합니다.

지원: Claude Codex Code(CC)

📊 70 적절함

스킬 ZIP 다운로드

Claude에서 업로드

설정 → 기능 → 스킬 → 스킬 업로드로 이동

토글을 켜고 사용 시작

테스트해 보기

"ai-avatar-video" 사용 중입니다. Generate a talking head video from a portrait photo with a voice-over explaining a product feature.

예상 결과:

Uses infsh app run bytedance/omnihuman-1-5 --input '{"image_url": "https://portrait.jpg", "audio_url": "https://speech.mp3"}' to create the video. Result: The AI avatar from portrait.jpg speaks the audio track with synchronized lip movements.

"ai-avatar-video" 사용 중입니다. Create a dubbed version of an educational video in a new language.

예상 결과:

1. Transcribe original video with infsh/fast-whisper-large-v3. 2. Generate speech in new language with kokoro-tts. 3. Sync to original video with latentsync-1-6. Result: Original video with AI avatars speaking the new language.

보안 감사

안전

v1 • 2/4/2026

Documentation file containing example CLI commands. Static scanner misidentified markdown code blocks as code execution patterns. No actual security vulnerabilities found.

스캔된 파일

154

분석된 줄 수

발견 사항

총 감사 수

위험 요인

🌐 네트워크 접근 (16)

SKILL.md:16 SKILL.md:21 SKILL.md:25 SKILL.md:26 SKILL.md:53 SKILL.md:54 SKILL.md:64 SKILL.md:65 SKILL.md:73 SKILL.md:74 SKILL.md:90 SKILL.md:99 SKILL.md:108 SKILL.md:151 SKILL.md:152 SKILL.md:153

감사자: claude

품질 점수

아키텍처

100

유지보수성

콘텐츠

커뮤니티

100

보안

사양 준수

만들 수 있는 것

제품 데모 비디오

촬영 없이 AI 프레젠테이터로 전문 제품 데모를 만드세요. 기능을 소개하는 SaaS 회사에 이상적입니다.

교육 콘텐츠

일관된 설명 비디오와 튜토리얼을 생성하세요. 온라인 강좌 및 교육 플랫폼에 완벽합니다.

다국어 콘텐츠

AI 아바타가 새로운 언어로 말하도록 기존 비디오를 여러 언어로 더빙하세요. 글로벌 청중을 확장하세요.

이 프롬프트를 사용해 보세요

기본 아바타 비디오 생성

Generate a talking head video from this image: [IMAGE_URL] using the specified audio: [AUDIO_URL]. Use the OmniHuman 1.5 model for best quality.

립싱크 비디오 생성

Create a lipsync video where this static image speaks the provided audio. Use the PixVerse Lipsync model for realistic lip movements.

다국어 더빙 생성

Create a dubbed version of this video in English: [VIDEO_URL] using these translated subtitles as audio: [TRANSLATED_AUDIO_URL]. Use LatentSync for video-to-audio sync.

TTS + 아바타 워크플로우 생성

First generate speech from this text: [TEXT] using kokoro-tts. Then create a talking head video from this image: [IMAGE_URL] with the generated speech as audio.

모범 사례

Use high-quality front-facing portrait photos with good lighting for best avatar results
Ensure audio is clear with minimal background noise for accurate lipsync
Use Claude Code's multi-step workflow for complex video generation tasks
Verify URL accessibility before using images or audio in commands
Test with small segments before generating full-length videos

피하기

Don't use low-quality images with poor lighting - avatar quality will be poor
Avoid using copyrighted images for commercial avatar videos without permission
Don't mix languages in the same video without proper dubbing workflow
Don't forget to save output files immediately after generation
Avoid using very long audio tracks without testing lip sync quality

자주 묻는 질문

What is inference.sh and why do I need it?

Inference.sh is a cloud platform that provides AI model APIs. The inference.sh CLI allows you to run these AI models locally without managing infrastructure. You need it to access OmniHuman, Fabric, and PixVerse models for avatar video generation.

Which model should I use for my avatar videos?

Use OmniHuman 1.5 for multi-character videos and best quality. Use Fabric 1.0 for static images that need to speak. Use PixVerse Lipsync for highly realistic lip sync on existing images.

Can I create videos in languages other than English?

Yes, you can create videos in any language supported by the kokoro-tts text-to-speech model. Use the dubbing workflow to create multilingual versions of existing videos.

How do I authenticate with inference.sh?

Run infsh login in your terminal. You'll need to authenticate with your inference.sh account credentials. This provides access to all available AI models.

What image formats work best for avatar videos?

Use high-resolution portrait photos (400x400 or larger) with front-facing orientation. JPG and PNG formats work well. Good lighting and neutral background produce better results.

Can I use this skill with Claude Code?

Yes, this skill is compatible with Claude, Codex, and Claude Code. You can use the command examples in SKILL.md with Claude Code to build automated video generation workflows.

개발자 세부 정보

작성자

inference-sh

라이선스

MIT

리포지토리

https://github.com/inference-sh/skills/tree/main/skills/ai-avatar-video/

참조

main

파일 구조

📄 SKILL.md