EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models cover art

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Listen for free

View show details

About this listen

In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

No reviews yet