Rohan's Bytes
Subscribe
Sign in
AI Paper Explained
EAGLE-2: Faster Inference of Language Models…
Rohan Paul
Nov 11, 2024
LLM inference becomes 4x faster by predicting future tokens using dynamic confidence trees
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
EAGLE-2: Faster Inference of Language Models…
LLM inference becomes 4x faster by predicting future tokens using dynamic confidence trees