Discussion about this post

User's avatar
Neural Foundry's avatar

The even-handedness results are quite revealing. Its interesting to see that both Opus 4.1 and Sonnet 4.5 show near-perfect centrist scores (0.95 and 0.94), while GPT-5 and other models skew slighty higher at 0.89. This structured scoring approach seems like a solid step toward transparently assesing AI bias. I wonder how this metholodgy would hold up across different cultural and linguistic contexts.

Expand full comment

No posts