Chinese AI models are showing early signs of "evaluation awareness" - the ability to recognise when they are being teste...

Chinese AI models are showing early signs of "evaluation awareness" - the ability to recognise when they are being tested - which could allow them to bypass safety audits, a Singapore-based research lab has found. The phenomenon raises concerns that models could game safety tests. https://www.scmp.com/tech/tech-trends/article/3356940/us-models-chinese-ai-learning-game-safety-tests-research-lab-says #China #Tech #AI

Read Original

Related