🎁 Get the FREE AI Skills Starter Guide — Subscribe →

🦀 ClawHub

Audio Video To Text

by @ivan830826

音视频转文字技能，使用 Whisper 进行语音识别。支持多种音视频格式，可输出纯文本、SRT/VTT 字幕或 JSON 格式。适用于会议记录、视频字幕生成、采访整理、播客转录等场景。

Versionv1.0.0

Use this skill with your agent

Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

Task-oriented agent. Great for testing AI skills end-to-end.

Local-first agent. Install skills via ClawHub CLI.

Set up OpenClaw →

Anthropic's coding agent. Paste the prompt or SKILL.md into your session.

Claude Code docs →

AI-powered IDE. Use the smoke-test prompt in Cursor Agent.

Open Cursor →

Open-source AI code assistant. Add SKILL.md as a custom tool.

Continue docs →

Agentic IDE by Codeium. Paste the prompt into Cascade.

Try Windsurf →

VS Code extension for autonomous coding with MCP tools.

Cline on GitHub →

Copilot Workspace

GitHub's AI dev environment. Suitable for code-generation skills.

Copilot Workspace →

What to do next

Skills are meant to be used inside your own AI agent. Install it, run a quick smoke test, then ask your agent to apply it to your real task.

1

Install into your agentCopy the ClawHub install command and run it where your OpenClaw/agent environment is configured.

2

Run a smoke testUse the test prompt below to confirm the skill loads and understands the workflow before relying on it.

3

Use it in your own agentPaste your actual task into Manus, OpenClaw, Claude Code, Cursor, or another agent that supports skills.

I just installed the Audio Video To Text skill. Please run a quick smoke test: explain what this skill can do, ask me for the minimum input it needs, then produce one small sample output for a realistic task.