#AI jailbreak expert had spent much of the previous two years testing and prodding large language models such as Claude ...

#AI jailbreak expert had spent much of the previous two years testing and prodding large language models such as Claude and ChatGPT, always with the aim of making them say things they shouldn’t. But this was one of his most advanced hacks yet: a sophisticated plan of manipulation, which involved him being cruel, vindictive, sycophantic, even abusive. #security https://www.theguardian.com/technology/2026/apr/29/meet-the-ai-jailbreakers-i-see-the-worst-things-humanity-has-produced

Read Original

Related

Mastodon discussion 29m ago

「Meta AI」と声で会話できるように、新AIモデル「Muse Spark」搭載(ケータイ Watch)|dメニューニュース(NTTドコモ) https://www.yayafa.com/2802410/ #「MetaAI」と声で会話でき...

「Meta AI」と声で会話できるように、新AIモデル「Muse Spark」搭載(ケータイ Watch)|dメニューニュース(NTTドコモ) https://www.yayafa.com/2802410/ #「MetaAI」と声で会話できるように、新AIモデル「MuseSpark」搭載 #AgenticAi #AI #ArtificialGeneralIn...