返回顶部
a

ali-minimax-toolkitMiniMax多模态工具

MiniMax multimodal generation via API. Use when user wants voice, music, image, image-to-image, or video generation with MiniMax. Supports TTS, music, image (t2i + i2i), and video (t2v, i2v, sef, ref). Cross-platform Python scripts.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
90
下载量
免费
免费
0
收藏
概述
安装方式
版本历史

ali-minimax-toolkit

MiniMax Multimodal Toolkit

Generate voice, music, image, and video content via MiniMax APIs. Pure Python — works on Windows, Mac, and Linux without any third-party dependencies.

Prerequisites

  • - MINIMAX_API_KEY environment variable (starts with sk-)
  • INLINECODE2 environment variable (optional, default: https://api.minimaxi.com)
  • Python 3.6+
  • For video duration detection: ffprobe (optional)

Quick Start

CODEBLOCK0

Or use CLI directly:
CODEBLOCK1

Output Convention

All generated files MUST be saved to minimax-output/ under the agent's working directory.

TTS (Text-to-Speech)

Endpoint: POST /v1/t2a_v2 — returns hex audio, decoded and saved as file.

Models: speech-2.8-hd (recommended, best quality), speech-2.8-turbo (faster), speech-02-hd, INLINECODE10

CODEBLOCK2

Common voice IDs: female-shaonv, male-qn-qingse, male-qn-jingying, presenter_male, presenter_female
Emotions: happy, sad, angry, fearful, disgusted, surprised, calm, fluent, whisper (empty = auto)

Music Generation

Endpoint: POST /v1/music_generation — lyrics required, returns audio URL. Takes 30-300 seconds.

CODEBLOCK3

Image Generation (Text-to-Image)

Endpoint: POST /v1/image_generation — returns image URLs (immediate).

CODEBLOCK4

Aspect ratios: 1:1 (default), 16:9, 4:3, 3:2, 2:3, 3:4, 9:16, INLINECODE34

Image-to-Image Generation

Endpoint: POST /v1/image_generation with image_file — generate new images from a reference.

CODEBLOCK5

Video Generation

Endpoint: POST /v1/video_generation (async) + GET /v1/query/video_generation — polling required.

CODEBLOCK6

Models: MiniMax-Hailuo-2.3 (default), MiniMax-Hailuo-2.3-Fast (i2v), MiniMax-Hailuo-02 (1080P, 10s)
Modes: t2v, i2v, sef (start-end frame), ref (subject reference)

Video Prompt Tips

Main subject + Scene + Movement + Camera motion + Aesthetic. For i2v: describe motion only, don't repeat what's in the image.

Generate & Send to Feishu

Use generate_and_send.py to generate content and prepare for Feishu delivery via the feishu-media skill:

CODEBLOCK7

After generation, the script outputs file paths and feishu-media send instructions. Use the feishu-media skill to actually deliver the content.

Legacy PowerShell Script

The original scripts/minimax-api.ps1 is preserved for backward compatibility but is deprecated. Use the Python scripts instead.

Error Handling

Error CodeMeaningSolution
2061Plan doesn't support modelTry speech-02-turbo for TTS
1008
Insufficient balance | Top up MiniMax account | | 2013 | Invalid params | Check required fields |

References

See references/ folder for detailed API docs, voice catalogs, and prompt guides.

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 ali-minimax-toolkit-1775942374 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 ali-minimax-toolkit-1775942374 技能

通过命令行安装

skillhub install ali-minimax-toolkit-1775942374

下载

⬇ 下载 ali-minimax-toolkit v1.0.0(免费)

文件大小: 28.21 KB | 发布时间: 2026-4-12 08:51

v1.0.0 最新 2026-4-12 08:51
- Initial release of ali-minimax-toolkit for MiniMax multimodal generation.
- Supports TTS, music, image (text-to-image & image-to-image), and video (text/image-to-video, subject/sequence reference) generation via MiniMax APIs.
- Pure Python implementation: cross-platform, no third-party dependencies required.
- Provides both Python module and CLI usage; all outputs saved to minimax-output/ directory.
- Includes quick-start guides, error handling info, API references, and Feishu integration instructions.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部