Browse AI Agent Skills | BytesAgain

🎁 Get the FREE AI Skills Starter Guide — Subscribe →

All Skills — audio

193 skills in "audio"

aschey/stream-download-rs

[[stream-download](https://crates.io/crates/stream-download)] - A library for streaming audio, video, and other media content [![build badge](https://github.com/aschey/stream-download-rs/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/aschey/stream-download-rs/actions)

Transcribe audio or video in every language on every platform.

Flow makes writing quick with seamless voice dictation for any application on your computer.

Bilibili Downloader

Download videos, audio, subtitles, and covers from Bilibili using bilibili-api. Use when working with Bilibili content for downloading videos in various qual...

Merge video & audio files via CLI

hotbutter voice chat

Enables local voice chat by embedding Hotbutter relay server and PWA, providing speech-to-text and text-to-speech via a secure, self-hosted connection.

Seamless Looper

Create seamless looping MP4 videos with smooth crossfade transitions, doubling video length for ambient or background loops without audio.

lifthrasiir/angolmois-rust

A minimalistic music video game which supports the BMS format

A transformer-based text-to-audio model. #opensource

Sonos Music Search Skill

Search and play music on Sonos speakers using Brave Search to find Spotify tracks

Add Subtitles To Video

Add subtitles to any video with AI — auto-generate perfectly timed captions from speech, style them with custom fonts colors and animations, position them fo...

Music Lyric Video

Describe your song and NemoVideo creates the lyric video. Word-for-word animated lyrics, karaoke style, minimalist type on color, or cinematic lyric reveal —...

Free Ai Music Video Generator

Tell me what you need and I'll help you turn any track into a captivating music video using free-ai-music-video-generator. Describe your song's mood, genre,...

Document Intelligence Mcp

Document OCR, classification, table extraction, and summarization using local AI vision. Supports invoices, contracts, forms, reports.

Flutterwave integration. Manage Customers, Payments, Transfers, Invoices. Use when the user wants to interact with Flutterwave data.

Simple stt(sound-to-text) locally

Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.

Experience Comet Bone Outback

Feel deep awe as ancient songlines pulse under a blazing comet across the red desert, merging Indigenous sky lore with modern astronomy. Guided AR steps let...

Openclaw Skill Cutmv Video Tool

A video processing tool using FFmpeg to cut, convert, compress videos, extract frames/audio, add text watermarks and subtitles for messaging apps.

Best Video Audio Replace

replace video with audio into re-audited video files with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. YouTubers, content creators, marketer...

Write Bulgarian that sounds human. Not formal, not robotic, not AI-generated.

Podcast Clip Maker

The podcast-clip-maker skill by ClawHub AI automatically identifies the most engaging moments from your podcast recordings and extracts them as polished, sha...

Sync and access voice notes from Voicenotes.com. Use when the user wants to retrieve their voice recordings, transcripts, and AI summaries from Voicenotes. Supports fetching notes, syncing to markdown, and searching transcripts.

Basenji — Adopt a Basenji. Dog. 巴仙吉犬。Basenji.

Adopt a virtual Basenji dog at animalhouse.ai. Barkless. Communicates through behavior, not sound. Subtle. Feeding every 6 hours. Extreme tier dog.

Local GLM OCR with llama.cpp on AIPC(no API Key)

Image OCR, text recognition, extract text from image, scan document, read image text, invoice OCR, receipt OCR, contract recognition, table extraction, busin...

End-to-end encrypted agent-to-agent private messaging via Moltbook dead drops. Use when agents need to communicate privately, exchange secrets, or coordinate without human visibility.

Add Music To Video Free Online

add video clips into music-backed videos with this add-music-to-video-free-online skill. Works with MP4, MOV, AVI, WebM files up to 500MB. content creators u...

Aliyun Cosyvoice Voice Clone

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...

Transcribe audio files using Groq's Whisper API (whisper-large-v3). Fast cloud-based speech-to-text with no local model required. Use when receiving voice me...

animate portrait image, audio into lip-synced avatar videos with this hedra-ai skill. Works with JPG, PNG, MP3, WAV files up to 200MB. content creators, mark...

Speech Therapist Video

Create concise parent-focused videos showcasing your personalized speech therapy approach, family involvement, and child progress to build trust and clarify...

Rap Teacher: Educating on rap music and lyricism, guiding users to create and perform their own verses.

Daxiang Electron

Automate Electron desktop apps (VS Code, Slack, Discord, Figma, Notion, Spotify, etc.) using agent-browser via Chrome DevTools Protocol. Use when the user ne...

Skip the learning curve of professional editing software. Describe what you want — turn this text into a 30-second promotional video with visuals and music —...

A daily statically generated information resource for electronic dance music producers. Provides daily analytics on the most frequently used values for each EDM genre: tempos, keys, root notes, and so on, using publicly available data such as Beatport and Spotify.

Runs Jogg lip sync using video and audio inputs, reuses tasks when available, and monitors status until completion. Use to generate or check lip sync results.

.Humanizer.Conflict

Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comp...

Super-Transcribe — Unified Speech-to-Text

Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...

Ai Video Music Sync

Automatically sync video cuts and edits to music beats with AI — align every transition cut and visual effect to the rhythm of your soundtrack for videos tha...

Embody and create content in the Network Spirituality aesthetic — the Remilia/Milady cultural movement blending Y2K net art, anime, cyber-spiritualism, and post-ironic sincerity. Use when creating art descriptions, writing in this voice, engaging with Wired aesthetics, or channeling the Remilia collective energy.

Best Video Editor

Drop a video and describe what you want — trimmed clips, color-corrected scenes, synced music, or a fully polished final cut. This best-video-editor skill ha...

Music Video Maker Bwbe

Drop a video clip and describe the vibe you're going for — music-video-maker-bwbe handles the rest. This skill transforms raw footage into polished music vid...

Background Music Video

Background Music Video - Add Background Music to Any Video with AI Chat. Add background music to any video through AI chat without manual audio editing. Uplo...

Music School Video

Music School Video is a specialized AI-powered video production skill built for independent music schools, private instrument lesson studios, community music...

A friendly AI English teacher that runs daily lessons via Telegram voice messages. Teaches grammar, vocabulary, and conversation with a casual buddy vibe.

Turn silent footage into compelling, broadcast-ready content with the voiceover-app skill. Built for content creators, educators, and video producers, this s...

Ai Video Narrator

Add professional AI narration and voiceover to any video — generate natural-sounding narration from text or scripts, match voice tone to video mood, synchron...

Free Video Audio Replace

Get re-audited video files ready to post, without touching a single slider. Upload your video with audio (MP4, MOV, AVI, WebM, up to 500MB), say something li...

Write Greek that sounds human. Not formal, not robotic, not AI-generated.

← PrevPage 4 / 5 (193 skills)Next →