This paper evaluates LLMs' social abilities through complex multi-agent interactions and private information handling
AgentSense: Benchmarking Social Intelligence…
This paper evaluates LLMs' social abilities through complex multi-agent interactions and private information handling