Rohan's Bytes
Subscribe
Sign in
Share this post
Rohan's Bytes
"RotateKV: Accurate and Robust 2-Bit KV Cache Quantization for LLMs via Outlier-Aware Adaptive Rotations"
Copy link
Facebook
Email
Notes
More
AI Paper Explained
"RotateKV: Accurate and Robust 2-Bit KV Cache…
Rohan Paul
Feb 10
Share this post
Rohan's Bytes
"RotateKV: Accurate and Robust 2-Bit KV Cache Quantization for LLMs via Outlier-Aware Adaptive Rotations"
Copy link
Facebook
Email
Notes
More
Below podcast on this paper is generated with Google's Illuminate.
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
"RotateKV: Accurate and Robust 2-Bit KV Cache…
Share this post
Below podcast on this paper is generated with Google's Illuminate.