π¦ ClawHubclawhub
Multimodal Base
Supports image understanding, OCR, speech-to-text, and text-to-speech synthesis with multi-voice and multimodal unified processing using OpenAI and Edge TTS.
v0.1.0by yuyonghao-123
View on ClawHub ββ οΈ BytesAgain does not review or verify third-party content. Proceed at your own risk.
π This skill is indexed from ClawHub and is available under its original license. BytesAgain is an independent directory β we do not host or own this content. All rights belong to the original author.
π Can't find the right skill?
Install our skill and let your agent search 43,000+ skills for you.