π§ Researchers present IatroBench, a pre-registered benchmark that measures potential harms caused by AI safety interventions themselves. The study examines whether safety measures inadvertently create negative effects that warrant consideration in AI system design.π¬ Hacker Newsπ https://arxiv.org/abs/2604.07709#AI #MachineLearning #tech
π§ Researchers present IatroBench, a pre-registered benchmark that measures potential harms caused by AI safety intervent...