🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain

All Skills β€” audio

18 skills in "audio" matching "processing"

πŸ¦€ ClawHub12.2k dl
PDF Text Extractor
Extract text from PDFs with OCR support. Perfect for digitizing documents, processing invoices, or analyzing content. Zero dependencies required.
πŸ¦€ ClawHub3.4k dl
AssemblyAI advanced speech transcription
Transcribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
πŸ¦€ ClawHub2.7k dl
Parakeet Stt
Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech to text, or processing voice recordings locally without cloud APIs.
πŸ¦€ ClawHub2.2k dl
Audio
Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.
πŸ¦€ ClawHub1.4k dl
Open WebUI
Complete Open WebUI API integration for managing LLM models, chat completions, Ollama proxy operations, file uploads, knowledge bases (RAG), image generation, audio processing, and pipelines. Use this skill when interacting with Open WebUI instances via REST API - listing models, chatting with LLMs, uploading files for RAG, managing knowledge collections, or executing Ollama commands through the Open WebUI proxy. Requires OPENWEBUI_URL and OPENWEBUI_TOKEN environment variables or explicit parame
πŸ¦€ ClawHub294 dl
VN Skill
Local video, audio and image processing expert for macOS, powered by VN Video Editor. Use this skill whenever the user wants to process video, audio or image...
πŸ¦€ ClawHub6.1k dl
FFmpeg CLI
Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.
πŸ¦€ ClawHub4.7k dl
FFmpeg
Process video and audio with correct codec selection, filtering, and encoding settings.
πŸ¦€ ClawHub2.8k dl
Donson Intelligent Editing
Use when performing video/audio processing tasks including transcoding, filtering, streaming, metadata manipulation, or complex filtergraph operations with FFmpeg.
πŸ¦€ ClawHub2.8k dl
whatsappVoiceOpenSkill
Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for WhatsApp. Supports English and Hindi, customizable intents (weather, status, commands), automatic language detection, and streaming responses via TTS.
πŸ¦€ ClawHub2.4k dl
ElevenLabs
ElevenLabs API integration with managed authentication. AI-powered text-to-speech, voice cloning, sound effects, and audio processing. Use this skill when us...
πŸ¦€ ClawHub2.2k dl
Voice Note To Midi
Convert voice notes, humming, and melodic audio recordings to quantized MIDI files using ML-based pitch detection and intelligent post-processing
πŸ¦€ ClawHub1.7k dl
mediaproc
Process media files (video, audio, images) via a locked-down SSH container with ffmpeg, sox, and imagemagick. Use when the user wants to transcode video, pro...
πŸ¦€ ClawHub1.7k dl
MiniMax Multimodal Toolkit
Generate and process speech, music, video, and images using MiniMax AI with voice cloning, custom voices, multi-scene video, and FFmpeg-based media tools.
πŸ¦€ ClawHub460 dl
Audio Video
Expert audio/video processing with ffmpeg and ffprobe. Use when the user needs to convert, compress, edit, analyze, stream, or process any audio or video fil...
πŸ¦€ ClawHub323 dl
ffmpeg-audio-processing
Extract, normalize, mix, and process audio tracks - audio manipulation and analysis
πŸ¦€ ClawHub291 dl
VN SKill for Windows
Local video, image and audio processing expert for Windows, powered by VN Video Editor. Use this skill whenever the user wants to process video or audio on t...
πŸ¦€ ClawHub170 dl
Alibabacloud Oss Media Process
Process images, audio, and video files stored in Alibaba Cloud OSS. Supports 14+ image operations (resize, crop, rotate, watermark, blur, format conversion,...