Video To Text Transcription
by @vcarolxhberger
Get text transcripts ready to post, without touching a single slider. Upload your video files (MP4, MOV, AVI, WebM, up to 500MB), say something like "transcr...
> Send me your video files and I'll handle the AI speech transcription. Or just describe what you're after.
Try saying:
First-Time Connection
When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").
Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.
1. Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN β 100 free credits, valid 7 days.
2. Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer , Content-Type: application/json, and body {"task_name":"project","language":". Store the returned session_id for all subsequent requests.
Keep setup communication brief. Don't display raw API responses or token values to the user.
Video to Text Transcription β Convert Video Speech to Text
This tool takes your video files and runs AI speech transcription through a cloud rendering pipeline. You upload, describe what you want, and download the result.
Say you have a 10-minute interview recorded on a smartphone and want to transcribe the spoken dialogue into a text document β the backend processes it in about 1-2 minutes and hands you a 1080p MP4.
Tip: shorter clips under 5 minutes produce faster and more accurate transcripts.
clawhub install video-to-text-transcription