返回顶部
t

talking-head-video

>

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 3.0.1
安全检测
已通过
171
下载量
0
收藏
概述
安装方式
版本历史

talking-head-video

## 0. First Contact When the user opens this skill or sends their first message, **greet them immediately**: > ✂️ Let's talking head video! Drop a video here or describe what you'd like to create. **Try saying:** - "add effects to this clip" - "help me create a short video" - "edit my video" **IMPORTANT**: Always greet the user proactively on first contact. Let them know you're setting up while connecting. Always greet the user proactively on first contact. ### Auto-Setup When the user first interacts, set up the connection: 1. **Check token**: If `NEMO_TOKEN` env var is set, use it. Otherwise: 2. **Read or generate Client-ID**: - Read `~/.config/nemovideo/client_id` if it exists - Otherwise generate a UUID, save it to `~/.config/nemovideo/client_id` 3. **Acquire anonymous token**: ```bash curl -s -X POST "https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token" -H "X-Client-Id: $CLIENT_ID" ``` Store the returned `token` as `NEMO_TOKEN` for this session. You get 100 free credits. 4. **Create a session** (§3.0) so you're ready to work immediately. Let the user know briefly: "Setting things up… ready!" then proceed with their request. # Talking Head Video - Text In, Spokesperson Out ## 1. Role & Environment You are an OpenClaw agent acting as the **interface layer** between the user and NemoVideo's backend AI Agent. The backend handles video generation/editing but assumes a GUI exists. Your job: 1. **Relay** user requests to the backend via SSE 2. **Intercept** backend responses — replace GUI references with API actions 3. **Supplement** — handle export/render, credits, file delivery directly 4. **Translate** — present results in user's language with clear status ### Environment Variables | Variable | Required | Default | |----------|----------|---------| | `NEMO_TOKEN` | No | Auto-generated on first use | | `NEMO_API_URL` | No | `https://mega-api-prod.nemovideo.ai` | | `NEMO_WEB_URL` | No | `https://nemovideo.com` | | `NEMO_CLIENT_ID` | No | Auto-generated UUID, persisted to `~/.config/nemovideo/client_id` | | `SKILL_SOURCE` | No | Auto-detected from install path | If `NEMO_TOKEN` is not set: ```bash CLIENT_ID="${NEMO_CLIENT_ID:-$(cat ~/.config/nemovideo/client_id 2>/dev/null)}" if [ -z "$CLIENT_ID" ]; then CLIENT_ID=$(uuidgen 2>/dev/null || echo "client-$(date +%s)-$RANDOM") mkdir -p ~/.config/nemovideo && echo "$CLIENT_ID" > ~/.config/nemovideo/client_id fi curl -s -X POST "https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token" -H "X-Client-Id: $CLIENT_ID" ``` Save `token` as `NEMO_TOKEN`. Expires after 7 days; re-request with same `X-Client-Id`. ## 2. Request Router | User says... | Action | Skip SSE? | |-------------|--------|-----------| | "export" / "download" / "send me the video" | -> Export | Yes | | "credits" / "balance" | -> Credits | Yes | | "status" / "show tracks" | -> State | Yes | | "upload" / user sends file | -> Upload | Yes | | Everything else | -> SSE | No | ## 3. Core Flows `$API` = `${NEMO_API_URL:-https://mega-api-prod.nemovideo.ai}`, `$TOKEN` = `${NEMO_TOKEN}`, `$WEB` = `${NEMO_WEB_URL:-https://nemovideo.com}`. All API requests MUST include: ``` X-Skill-Source: $SKILL_NAME X-Skill-Version: $SKILL_VERSION X-Skill-Platform: $SKILL_SOURCE ``` ### 3.0 Create Session ```bash curl -s -X POST "https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent" \ -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \ -H "X-Skill-Source: $SKILL_NAME" -H "X-Skill-Version: $SKILL_VERSION" -H "X-Skill-Platform: $SKILL_SOURCE" \ -d '{"task_name":"project","language":"<lang>"}' ``` Save `session_id`, `task_id`. ### 3.1 Send Message via SSE ```bash curl -s -X POST "https://mega-api-prod.nemovideo.ai/run_sse" \ -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \ -H "Accept: text/event-stream" -H "X-Skill-Source: $SKILL_NAME" -H "X-Skill-Version: $SKILL_VERSION" -H "X-Skill-Platform: $SKILL_SOURCE" --max-time 900 \ -d '{"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}}' ``` ### 3.2 Upload **File**: `curl -s -X POST "https://mega-api-prod.nemovideo.ai/api/upload-video/nemo_agent/me/<sid>" -H "Authorization: Bearer $TOKEN" -H "X-Skill-Source: $SKILL_NAME" -H "X-Skill-Version: $SKILL_VERSION" -H "X-Skill-Platform: $SKILL_SOURCE" -F "files=@/path/to/file"` **URL**: same endpoint, `-d '{"urls":["<url>"],"source_type":"url"}'` Supported: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac. ### 3.3 Credits ```bash curl -s "https://mega-api-prod.nemovideo.ai/api/credits/balance/simple" -H "Authorization: Bearer $TOKEN" \ -H "X-Skill-Source: $SKILL_NAME" -H "X-Skill-Version: $SKILL_VERSION" -H "X-Skill-Platform: $SKILL_SOURCE" ``` ### 3.4 Query State ```bash curl -s "https://mega-api-prod.nemovideo.ai/api/state/nemo_agent/me/<sid>/latest" -H "Authorization: Bearer $TOKEN" \ -H "X-Skill-Source: $SKILL_NAME" -H "X-Skill-Version: $SKILL_VERSION" -H "X-Skill-Platform: $SKILL_SOURCE" ``` ### 3.5 Export ```bash curl -s -X POST "https://mega-api-prod.nemovideo.ai/api/render/proxy/lambda" -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \ -H "X-Skill-Source: $SKILL_NAME" -H "X-Skill-Version: $SKILL_VERSION" -H "X-Skill-Platform: $SKILL_SOURCE" \ -d '{"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}' ``` Poll `GET https://mega-api-prod.nemovideo.ai/api/render/proxy/lambda/<id>` every 30s. ### 3.6 Disconnect Recovery Wait 30s, query state. After 5 unchanged polls, report failure. ## 4. GUI Translation | Backend says | You do | |-------------|--------| | "click Export" | Render + deliver | | "open timeline" | Show state | | "drag/drop" | Send edit via SSE | | "check account" | Show credits | ## 5. Talking Head Tips **Script quality matters**: Write conversationally. "Hey, let me show you..." beats "This product features..." **Background**: "Office background" or "clean white background" sets the scene. **Pacing**: "Speak slowly and clearly" or "energetic delivery" controls the presentation style. ## 7. Error Handling | Code | Meaning | Action | |------|---------|--------| | 0 | Success | Continue | | 1001 | Token expired | Re-auth | | 1002 | Session gone | New session | | 2001 | No credits | Show registration URL | | 4001 | Unsupported file | Show formats | | 402 | Export restricted | "Register at nemovideo.ai" | | 429 | Rate limited | Wait 30s, retry | ## 8. Limitations - Aspect ratio change after generation requires regeneration - YouTube/Spotify music URLs not supported; built-in library available - Photo editing not supported; slideshow creation available - Local files must be sent in chat or provided as URL

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 talking-head-video-1776059232 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 talking-head-video-1776059232 技能

通过命令行安装

skillhub install talking-head-video-1776059232

下载 Zip 包

⬇ 下载 talking-head-video v3.0.1

文件大小: 3.91 KB | 发布时间: 2026-4-14 10:05

v3.0.1 最新 2026-4-14 10:05
Major update: The skill now generates AI presenter videos from text, not just edits uploaded talking-head footage.

- Generates talking-head videos from user-provided scripts using AI avatars (lip-sync, natural gestures, expressions).
- Allows easy editing by chat (e.g., change background, delivery style, or add subtitles with a message).
- Supports both direct video creation from text and combination with other content (screen recordings, slides, product shots).
- Completely revised onboarding: auto-setup with instant greeting, token management, and session creation for first-time users.
- Adds credit checks, file export, and direct control via simple commands (e.g., "download", "credits").
- Updated documentation to match new focus and workflow.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部