view post Post 2880 I have just released a new blogpost about kv caching and its role in inference speedup ๐๐ https://huggingface.co/blog/not-lain/kv-caching/some takeaways : See translation 4 replies ยท ๐ฅ 7 7 ๐ค 2 2 + Reply