Rohan's Bytes
Subscribe
Sign in
GenARM: Reward Guided Generation with…
Rohan Paul
Nov 10, 2024
GenARM guides LLMs using token-level rewards without retraining the base model.
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
GenARM: Reward Guided Generation with…
GenARM guides LLMs using token-level rewards without retraining the base model.