BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 26+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case β†’Pick My Role

All Skills β€” audio

26 skills in "audio" matching "extraction"

πŸ¦€ ClawHub
FFmpeg CLI
Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.
πŸ¦€ ClawHub
AudioPod
Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API
πŸ¦€ ClawHub
Boxed FFmpeg
Audio/video information extraction, format conversion, and audio extraction using FFmpeg WASM sandbox.
πŸ¦€ ClawHub
Alibabacloud Video Translation
Alibaba Cloud IMS (Intelligent Media Services) based video translation Skill. Supports subtitle extraction (ASR/OCR), translation, and speech synthesis trans...
πŸ¦€ ClawHub
cutmv
Video processing tool using FFmpeg for cutting, format conversion, compression, frame/audio extraction, watermarking, and subtitle addition.
πŸ¦€ ClawHub
Document Intelligence Mcp
Document OCR, classification, table extraction, and summarization using local AI vision. Supports invoices, contracts, forms, reports.
πŸ¦€ ClawHub
douyin-research-kit
Extract and analyze Douyin (ζŠ–ιŸ³) content using yt-dlp. Supports video metadata, caption extraction, user profile analysis, music/sound info, and engagement st...
πŸ¦€ ClawHub
Local GLM OCR with llama.cpp on AIPC(no API Key)
Image OCR, text recognition, extract text from image, scan document, read image text, invoice OCR, receipt OCR, contract recognition, table extraction, busin...
πŸ¦€ ClawHub
Video Reader
Tool-driven video question answering with frame extraction, sub-agent analysis, and audio transcription
πŸ¦€ ClawHub
Music Seperator (Demucs)
Separate vocals and instrument stems from audio files with Demucs CLI. Use when the user asks for vocal extraction, accompaniment generation, stem splitting,...
πŸ¦€ ClawHub
image-ocr-local-AIPC
Image OCR, text recognition, extract text from image, scan document, read image text, invoice OCR, receipt OCR, contract recognition, table extraction, busin...
πŸ¦€ ClawHub
Meeting Summarizer
Transcribe meetings with SenseAudio ASR speaker diarization, timestamps, and meeting-note extraction workflows. Use when users need meeting transcription, me...
πŸ¦€ ClawHub
Youtube Knowledge Extractor
Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis) channels. Especially powerful for HowTo vide...
πŸ¦€ ClawHub
Nanonets OCR
Document extraction API by Nanonets. Convert PDFs and images to markdown, JSON, or CSV with confidence scoring. Use when you need to OCR documents, extract invoice fields, parse receipts, or convert tables to structured data.
πŸ¦€ ClawHub
baml-codegen
Use when generating BAML code for type-safe LLM extraction, classification, RAG, or agent workflows - creates complete .baml files with types, functions, clients, tests, and framework integrations from natural language requirements. Queries official BoundaryML repositories via MCP for real-time patterns. Supports multimodal inputs (images, audio), Python/TypeScript/Ruby/Go, 10+ frameworks, 50-70% token optimization, 95%+ compilation success.
πŸ¦€ ClawHub
Hum2Song
Hum2Song turns a hummed or sung melody into a complete song with local audio processing, MIDI extraction, and optional AI-assisted arrangement, without uploa...
πŸ¦€ ClawHub
Veryfi Documents AI
Real-time OCR and data extraction API by Veryfi (https://veryfi.com). Extract structured data from receipts, invoices, bank statements, W-9s, purchase orders...
πŸ¦€ ClawHub
ClawHub - YouTube Downloader & Clipper
Clip and download specific time ranges or full YouTube videos in various qualities, including audio-only MP3 extraction, using precise timestamps.
πŸ¦€ ClawHub
Akashic Doc Analyzer
Parse, analyze, and extract content from documents (PDF, DOCX, PPTX, audio). Supports OCR, table extraction, and semantic chunking.
πŸ¦€ ClawHub
Invoice Scan
AI-powered invoice OCR, scanning, and data extraction. Use when: (1) user needs OCR or text extraction from invoice images, scanned documents, or PDFs, (2) s...
πŸ¦€ ClawHub
Meta Video Ad Analyzer
Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio transcription, and AI-powered scene analysis. Use when analyzing video creative content, extracting text overlays, or generating scene-by-scene descriptions.
πŸ¦€ ClawHub
Pub Brave
Web search and content extraction via Brave Search API. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, ch...
πŸ¦€ ClawHub
Banner Youtube Translate Workflow
Automates downloading YouTube audio, launching Doubao, playing audio, and capturing translations for full video subtitle extraction.
πŸ¦€ ClawHub
cutmv Video Tool
Perform video/audio cutting, format conversion, compression, frame/audio extraction, watermarking, and subtitle addition using FFmpeg.
πŸ¦€ ClawHub
Nex Voice
Voice note transcription and intelligent action item extraction for capture and organization of verbal communication. Record and transcribe voice notes, voic...
πŸ¦€ ClawHub
tiktok-research-kit
Extract and analyze TikTok content using yt-dlp. Supports video metadata, caption extraction, sound/music info, user profile analysis, and engagement stats....