Back
Tagged
AI Evaluation

Policy & Safety
OpenAI Unveils Shared Playbook for Trustworthy Third-Party AI Evaluations
OpenAI has released comprehensive guidance for third-party AI evaluations, offering a standardized framework to assess the capabilities, safeguards, and validity of advanced frontier AI systems. This move aims to foster greater trust and responsibility in AI development and deployment.
Jun 3, 2026
AI Agents
AI Agents
Unlocking the Future of AI Agents: IBM Research and Hugging Face Launch the Open Agent Leaderboard
IBM Research and Hugging Face have unveiled the Open Agent Leaderboard, a crucial new platform designed to benchmark and evaluate the performance of AI agents. This initiative promises to bring much-needed transparency, standardization, and a common ground for developers and researchers in the burgeoning field of agentic AI.
May 27, 2026