Rohan's Bytes
Subscribe
Sign in
Your Mixture-of-Experts LLM Is Secretly an…
Rohan Paul
Nov 7, 2024
MoE models secretly contain powerful embedding capabilities within their routing mechanisms.
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Your Mixture-of-Experts LLM Is Secretly an…
MoE models secretly contain powerful embedding capabilities within their routing mechanisms.