(Yet Another) KV cache calculator - kvanta.vcerny.cz

r/LocalLLaMA
Generative AI AI Tools

Hello everyone, I thought all public web-based KV cache calculators kinda suck. so I decided to create one I would like to use myself - KVANTA It should any LLM/VLM from Hugging Face, if not let me know! (also, it's Apache 2.0) submitted by /u/Fun-Purple-7737 [link] [comments]