OpenAI breaks ARC benchmark records while Anthropic reveals AI alignment risks, ElevenLabs achieves 75ms TTS, and Claude gets Excel boost.
π« OpenAI's next generation o3 models evalsβ¦
OpenAI breaks ARC benchmark records while Anthropic reveals AI alignment risks, ElevenLabs achieves 75ms TTS, and Claude gets Excel boost.