Training data's local patterns are secret sauce behind LLMs' reasoning superpowers, according to this Paper.
Why think step by step? Reasoning emerges…
Training data's local patterns are secret sauce behind LLMs' reasoning superpowers, according to this Paper.