HelloBench reveals significant limitations in LLMs' long text generation capabilities across diverse tasks.
Share this post
HELLOBENCH: EVALUATING LONG TEXT GENERATION…
Share this post
HelloBench reveals significant limitations in LLMs' long text generation capabilities across diverse tasks.