AI Safety & Benchmarking: Building Trustworthy Evaluation Ecosystems

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

AI Safety & Benchmarking: Building Trustworthy Evaluation Ecosystems

Listen for free

View show details

About this listen

Effective AI supervision requires reliable benchmarking ecosystems. Nicholas Miailhe discusses why benchmarks matter, how they should be constructed, and what regulators need to know about safety evaluations. The conversation highlights emerging international efforts to standardise safety testing and ensure comparability across models.

Speaker: Nicholas Miailhe (PRISM Eval)

Interviewer: Doaa Abu Elyounes, Programme Specialist, Ethics of AI Unit, UNESCO

Hosted on Ausha. See ausha.co/privacy-policy for more information.

No reviews yet