π¦ ClawHub
TurboQuant+ KV Cache Compression
by @wukai8289
TurboQuant+ compresses llama.cpp KV caches on Apple Silicon up to 6.4x with minimal quality loss, enabling larger models and longer contexts efficiently.
TERMINAL
clawhub install turboquant-plus