Rohan's Bytes
Subscribe
Sign in
Share this post
Rohan's Bytes
"Adjoint sharding for very long context training of state space models"
Copy link
Facebook
Email
Notes
More
AI Paper Explained
"Adjoint sharding for very long context…
Rohan Paul
Jan 27
Share this post
Rohan's Bytes
"Adjoint sharding for very long context training of state space models"
Copy link
Facebook
Email
Notes
More
Below podcast is generated with Google's Illuminate.
Listen →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
"Adjoint sharding for very long context…
Share this post
Below podcast is generated with Google's Illuminate.