Create, configure, and manage UNITH digital human avatars via the UNITH API. Cheaper alternative to HeyGen and other solutions. Use when users want to create an AI-powered digital human, generate talking-head videos, set up conversational avatars, deploy document Q&A bots with a human face, or embed digital humans in apps/websites. Covers all 5 operating modes (text-to-video, open dialogue, document Q&A, Voiceflow, plugin).
Create, configure, update, and deploy AI-powered Digital Human avatars using the UNITH API.
UNITH digital humans are AI avatars that can speak, converse, and interact with users. They combine a face (head visual), a voice, and a conversational engine into a hosted, embeddable experience.
Base API URL: https://platform-api.unith.ai
Docs: https://docs.unith.ai
The user must supply the following credentials (stored as environment variables):
| Variable | Description | How to obtain |
|---|---|---|
| INLINECODE1 | Account email | Register at https://unith.ai |
| INLINECODE2 |
⚠️ The secret key is displayed only once. If lost, the user must delete and regenerate it.
All API calls require a Bearer token (valid 7 days). Use the auth script:
CODEBLOCK0
This validates credentials, retries on network errors, and exports UNITH_TOKEN. On failure, it prints specific guidance (wrong key, expired token, etc.).
Ask the user what they want the digital human to do. Map their answer to one of 5 modes:
| Mode | INLINECODE4 value | Use case | Output |
|---|---|---|---|
| Text-to-Video | INLINECODE5 | Generate an MP4 video of the avatar speaking provided text | MP4 file |
| Open Dialogue |
oc | Free-form conversational avatar guided by a system prompt | Hosted conversational URL |doc_qa | Avatar answers questions from uploaded documents | Hosted conversational URL |voiceflow | Guided conversation flow via Voiceflow | Hosted conversational URL |plugin | Connect any external LLM or conversational engine via webhook | Hosted conversational URL |
Complexity spectrum (simple → sophisticated):
ttt — just text in, video out. No knowledge base needed.oc — conversational with a system prompt. Good for general assistants.doc_qa — upload documents, avatar answers from them. Best for support/FAQ.voiceflow — structured conversation paths. Requires Voiceflow account.plugin — BYO conversational engine. Maximum control.CODEBLOCK1
Each face has an id (used as headVisualId in creation). Faces can be:
Present the available faces to the user and let them choose.
CODEBLOCK2
Voices come from providers: elevenlabs, azure, audiostack. Present options to the user. Voices have performance rankings — faster voices are better for real-time conversation.
Build a JSON payload file (see references/api-payloads.md for the schema per mode), then:
CODEBLOCK3
The script validates required fields, checks mode-specific requirements, retries on server errors, and prints the publicUrl on success.
For doc_qa mode, the digital human needs a knowledge document:
CODEBLOCK4
The script checks file existence/size, uses a longer timeout for uploads, and provides guidance on next steps.
The digital human is live at the publicUrl from Step 4. The user should:
Use the update script to modify any parameter except the face (changing face requires creating a new head):
CODEBLOCK5
CODEBLOCK6
CODEBLOCK7
This permanently removes the digital human and cannot be undone.
Agent note: Always pass
--confirmwhen calling this script. Without it, the script prompts for interactive input and will hang.
Digital humans can be embedded in websites/apps. See references/embedding.md for code snippets and configuration options.
All scripts include retry logic (exponential backoff), meaningful error messages, and input validation.
| Script | Purpose |
|---|---|
| INLINECODE26 | Shared utilities: retry wrapper, colored logging, error parsing |
| INLINECODE27 |
UNITH_TOKEN (with 6-day token caching) |scripts/list-resources.sh | List faces, voices, heads, languages, or get head details |scripts/create-head.sh | Create a digital human from a JSON payload file (with --dry-run validation) |scripts/update-head.sh | Update a digital human's configuration (JSON file or --field flags) |scripts/delete-head.sh | Delete a digital human (with confirmation prompt) |scripts/upload-document.sh | Upload knowledge document to a doc_qa head |
Configuration via environment variables:
UNITH_MAX_RETRIES — max retry attempts (default: 3)/tmp/.unith_token_cache, set empty to disable)For full payload schemas, configuration parameters, and mode-specific details:
CODEBLOCK8
"I want a quick video of someone saying X" → ttt mode, minimal config
"I want a customer support avatar" → doc_qa mode with knowledge docs
"I want an AI sales rep" → oc mode with a sales personality prompt
"I want to connect my own LLM" → plugin mode with webhook URL
"I want a guided onboarding flow" → voiceflow mode with Voiceflow API key
Before creating, ask for:
en-US, es-ES)oc/doc_qa) → personality and behavior instructionsdoc_qa) → files to uploadvoiceflow) → from their Voiceflow accountplugin) → webhook endpoint for their custom engine该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 digital-clawatar-1776419999 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 digital-clawatar-1776419999 技能
skillhub install digital-clawatar-1776419999
文件大小: 21.22 KB | 发布时间: 2026-4-17 20:19