🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain

All Skills

13 skills total matching "transcription audio video image analysis"

πŸ¦€ ClawHub2.2k dl
Pollinations
Pollinations.ai API for AI generation and analysis - text, images, videos, audio, vision, and transcription. Use when user requests AI-powered content (text...
⭐ GitHub⭐ 35.1k
azure-ai-transcription-py
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
⭐ GitHub⭐ 35.1k
azure-ai-transcription-py
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
⭐ GitHub⭐ 35.1k
azure-ai-transcription-py
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
⭐ GitHub⭐ 1.4k
transcription-generate
ζδΊ€ιŸ³ι’‘ζˆ–θ§†ι’‘θ½¬ε†™δ»»εŠ‘οΌŒη”Ÿζˆι€ε­—η¨Ώζˆ–ε­—εΉ•δ»»εŠ‘γ€‚
⭐ GitHub⭐ 392
audio-transcription
Transcribe audio and video files into structured notes. Activate this skill when users want to transcribe recordings, meetings, podcasts, voice memos, or any audio/video content in their vault.
⭐ GitHub⭐ 376
elevenlabs-stt
ElevenLabs speech-to-text with Scribe models and forced alignment via inference.sh CLI. Models: Scribe v1/v2 (98%+ accuracy, 90+ languages). Capabilities: transcription, speaker diarization, audio event tagging, word-level timestamps, forced alignment, subtitle generation. Use for: meeting transcription, subtitles, podcast transcripts, lip-sync timing, karaoke. Triggers: elevenlabs stt, elevenlabs transcription, scribe, elevenlabs speech to text, forced alignment, word alignment, subtitle timing, diarization, speaker identification, audio event detection, eleven labs transcribe
⭐ GitHub⭐ 363
english-to-katakana-transcription
AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution
⭐ GitHub⭐ 363
dna-to-mrna-transcription
AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution
⭐ GitHub⭐ 358
english-to-katakana-transcription
Transcribes English sentences into Japanese Katakana characters based on phonetic syllables without translating the meaning.
⭐ GitHub⭐ 255
transcription
Audio/video transcription using OpenAI Whisper. Covers installation, model selection, transcript formats (SRT, VTT, JSON), timing synchronization, and speaker diarization. Use when transcribing media or generating subtitles.
πŸ¦€ ClawHub1.1k dl
Youtube Knowledge Extractor
Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis) channels. Especially powerful for HowTo vide...
πŸ¦€ ClawHub324 dl
Echosaw Media Intelligence
Analyze video, audio, and image files using AI. Produces structured intelligence reports including transcripts, content moderation signals, sentiment analysi...