🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

Web Video Transcribe DOCX

by @c-narcissus

Offline-first workflow for turning Chinese web page video or audio into text and Word deliverables. Use when Codex needs to (1) extract playable media stream...

Versionv1.0.2
πŸ’‘ Examples

1. Run python {baseDir}/scripts/bootstrap_env.py once in the target environment. 2. For a generic web page URL, run python {baseDir}/scripts/pipeline_web_to_docx.py --output-dir

. 3. For a direct media URL, run python {baseDir}/scripts/download_url.py and then python {baseDir}/scripts/transcribe_sensevoice.py --input --output-txt --output-docx . 4. For a local media file, run python {baseDir}/scripts/transcribe_sensevoice.py --input --output-txt --output-docx . 5. If the user asks for a polished reading version rather than a raw transcript, read references/cleanup-guidelines.md, produce a refined .txt, and then render it with python {baseDir}/scripts/transcript_to_docx.py.

πŸ”’ Constraints

  • Run python {baseDir}/scripts/bootstrap_env.py before first use in a fresh environment.
  • Validate the skill folder with skill-creator/scripts/quick_validate.py.
  • Prefer testing --help and one representative happy path after changing the scripts.
  • If extraction fails on a page, capture a direct media URL with browser tooling and continue with the downloader + transcriber.
  • Do not promise support for DRM-protected streams, authenticated cookies, or sites that only expose encrypted EME playback.
  • View on ClawHub
    TERMINAL
    clawhub install web-video-transcribe-docx

    πŸ§ͺ Use this skill with your agent

    Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

    πŸ” Can't find the right skill?

    Search 60,000+ AI agent skills β€” free, no login needed.

    Search Skills β†’