"More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives"

Playback speed

Share post at current time

0:00

Transcript

"More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives"

Generated below podcast on this paper with Google's Illuminate.

Rohan Paul

Jan 13, 2025

Smart weighting of examples helps LLMs maintain peak performance with more context.

DR-ICL (Differentiated In-Context Learning ) enhances LLMs' performance in many-shot scenarios by introducing differentiated learning and advantage-based reweighting, solving the performance decline issue when demonstration examples increase.

-----

https://arxiv.org/abs/2501.04070

🤔 Original Problem:

LLMs show declining performance as in-context learning examples increase from few-shot to many-shot scenarios, caused by suboptimal negative log-likelihood optimization and increasing noise from larger demonstration sets.

-----

🔧 Solution in this Paper:

→ DR-ICL uses differentiated learning to optimize globally, ensuring many-shot performance exceeds zero-shot levels

→ Implements advantage-based reweighting locally to filter noise in many-shot demonstrations

→ Divides sequences into reweighting windows and calculates advantages from previous window samples

→ Integrates advantages into NLL computation for dynamic weight adjustment

→ Combines global and local perspectives through a refined training objective

-----

💡 Key Insights:

→ Many-shot doesn't always mean better performance in LLMs

→ Performance plateaus and declines with increasing demonstrations

→ Noise accumulation significantly impacts model effectiveness