🦀 ClawHubclawhub

Multimodal Base

Supports image understanding, OCR, speech-to-text, and text-to-speech synthesis with multi-voice and multimodal unified processing using OpenAI and Edge TTS.

v0.1.0by yuyonghao-123

View on ClawHub →

⚠️ BytesAgain does not review or verify third-party content. Proceed at your own risk.

📋 This skill is indexed from ClawHub and is available under its original license. BytesAgain is an independent directory — we do not host or own this content. All rights belong to the original author.

🔍 Can't find the right skill?

Install our skill and let your agent search 43,000+ skills for you.

Install Free →