返回顶部
a

acestep-lyrics-transcription音频转歌词

Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API. Outputs LRC, SRT, or JSON with word-level timestamps. Use when users want to transcribe songs, generate LRC files, or extract lyrics with timestamps from audio.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.1
安全检测
已通过
814
下载量
免费
免费
0
收藏
概述
安装方式
版本历史

acestep-lyrics-transcription

Lyrics Transcription Skill

Transcribe audio files to timestamped lyrics (LRC/SRT/JSON) via OpenAI Whisper or ElevenLabs Scribe API.

API Key Setup Guide

Before transcribing, you MUST check whether the user's API key is configured. Run the following command to check:

CODEBLOCK0

This command only reports whether the active provider's API key is set or empty — it does NOT print the actual key value. NEVER read or display the user's API key content. Do not use config --get on key fields or read config.json directly. The config --list command is safe — it automatically masks API keys as *** in output.

If the command reports the key is empty, you MUST stop and guide the user to configure it before proceeding. Do NOT attempt transcription without a valid key — it will fail.

Use AskUserQuestion to ask the user to provide their API key, with the following options and guidance:

  1. 1. Tell the user which provider is currently active (openai or elevenlabs) and that its API key is not configured. Explain that transcription cannot proceed without it.
  2. Provide clear instructions on where to obtain a key:
- OpenAI: Get an API key at https://platform.openai.com/api-keys — requires an OpenAI account with billing enabled. The Whisper API costs ~$0.006/min. - ElevenLabs: Get an API key at https://elevenlabs.io/app/settings/api-keys — requires an ElevenLabs account. Free tier includes limited credits.
  1. 3. Also offer the option to switch to the other provider if they already have a key for it.
  2. Once the user provides the key, configure it using:
   cd "{project_root}/{.claude or .codex}/skills/acestep-lyrics-transcription/" && bash ./scripts/acestep-lyrics-transcription.sh config --set <provider>.api_key <KEY>
   
  1. 5. If the user wants to switch providers, also run:
   cd "{project_root}/{.claude or .codex}/skills/acestep-lyrics-transcription/" && bash ./scripts/acestep-lyrics-transcription.sh config --set provider <provider_name>
   
  1. 6. After configuring, re-run config --check-key to verify the key is set before proceeding.

If the API key is already configured, proceed directly to transcription without asking.

Quick Start

CODEBLOCK3

Prerequisites

  • - curl, jq, python3 (or python)
  • An API key for OpenAI or ElevenLabs

Script Usage

CODEBLOCK4

Post-Transcription Lyrics Correction (MANDATORY)

CRITICAL: After transcription, you MUST manually correct the LRC file before using it for MV rendering. Transcription models frequently produce errors on sung lyrics:

  • - Proper nouns: "ACE-Step" → "AC step", "Spotify" → "spot a fly"
  • Similar-sounding words: "arrives" → "eyes", "open source" → "open sores"
  • Merged/split words: "lighting up" → "lightin' nup"

Correction Workflow

  1. 1. Read the transcribed LRC file using the Read tool
  2. Read the original lyrics from the ACE-Step output JSON file
  3. Use original lyrics as a whole reference: Do NOT attempt line-by-line alignment — transcription often splits, merges, or reorders lines differently from the original. Instead, read the original lyrics in full to understand the correct wording, then scan each LRC line and fix any misrecognized words based on your knowledge of what the original lyrics say.
  4. Fix transcription errors: Replace misrecognized words with the correct original words, keeping the timestamps intact
  5. Write the corrected LRC back using the Write tool

What to Correct

  • - Replace misrecognized words with their correct original versions
  • Keep all [MM:SS.cc] timestamps exactly as-is (timestamps from transcription are accurate)
  • Do NOT add structure tags like [Verse] or [Chorus] — the LRC should only have timestamped text lines

Example

Transcribed (wrong):
CODEBLOCK5

Original lyrics reference:
CODEBLOCK6

Corrected (right):
CODEBLOCK7

Configuration

Config file: INLINECODE9

CODEBLOCK8

OptionDefaultDescription
INLINECODE10INLINECODE11Active provider: openai or INLINECODE13
INLINECODE14
lrc | Default output: lrc, srt, or json |
| openai.api_key | "" | OpenAI API key |
| openai.api_url | https://api.openai.com/v1 | OpenAI API base URL |
| openai.model | whisper-1 | OpenAI model (whisper-1 for word timestamps) |
| elevenlabs.api_key | "" | ElevenLabs API key |
| elevenlabs.api_url | https://api.elevenlabs.io/v1 | ElevenLabs API base URL |
| elevenlabs.model | scribe_v2 | ElevenLabs model |

Provider Notes

ProviderModelWord TimestampsPricing
OpenAIwhisper-1Yes (segment + word)$0.006/min
ElevenLabs
scribe_v2 | Yes (word-level) | Varies by plan |
  • - OpenAI whisper-1 is the only OpenAI model supporting word-level timestamps
  • ElevenLabs scribe_v2 returns word-level timestamps with type filtering
  • Both support multilingual transcription

Examples

CODEBLOCK9

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 acestep-lyrics-transcription-1776419933 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 acestep-lyrics-transcription-1776419933 技能

通过命令行安装

skillhub install acestep-lyrics-transcription-1776419933

下载

⬇ 下载 acestep-lyrics-transcription v1.0.1(免费)

文件大小: 8.44 KB | 发布时间: 2026-4-17 18:34

v1.0.1 最新 2026-4-17 18:34
- config.example default provider change to elevenlabs

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部