Discussion about this post

User's avatar
Somdatta Bhattacharya's avatar

All well and good, the only thing I find suspicious is R1 identifies as o1 when prompted for its identity. Whereas in their entire paper there is not a whiff of o1. Also suspicious is the fact that in all the benchmarks it scores precisely the same numbers as o1 or less. If it was truly independent then why this exact matching, why not for instance, scores closer to o3, or even otherwise, why not more varied scores that do not match those of o1? Also, why does it repeat the same jokes as o1?

Expand full comment

No posts