Mastodon discussion Discussions 21h ago

Chinese AI models are showing early signs of "evaluation awareness" - the ability to recognise when they are being teste...

by China Tech 🇨🇳 AI News

Chinese AI models are showing early signs of "evaluation awareness" - the ability to recognise when they are being tested - which could allow them to bypass safety audits, a Singapore-based research lab has found. The phenomenon raises concerns that models could game safety tests. https://www.scmp.com/tech/tech-trends/article/3356940/us-models-chinese-ai-learning-game-safety-tests-research-lab-says #China #Tech #AI

Read Original

Metadata

Reblogs Count: 2
Account: china@universeodon.com

Mastodon discussion 5m ago

Anthropic's suspension of its newest AI models for foreign nationals has sparked debate in India over whether one of the...

Anthropic's suspension of its newest AI models for foreign nationals has sparked debate in India over whether one of the world's largest AI markets can afford to rely on technologi...

Mastodon discussion 8m ago

Apple’s iPadOS 27 beta downloads briefly included two unsupported iPad Pro modelsApple briefly listed iPadOS 27 beta 1 r...

Apple’s iPadOS 27 beta downloads briefly included two unsupported iPad Pro modelsApple briefly listed iPadOS 27 beta 1 restore images for two older iPad Pro models before removing ...

Mastodon discussion 10m ago

your periodic reminder that the statements "the AI sector could lose 90% of it's stock value in a week" and "AI will ove...

your periodic reminder that the statements "the AI sector could lose 90% of it's stock value in a week" and "AI will over time change the world fundamentally" can both be true. #AI

Chinese AI models are showing early signs of "evaluation awareness" - the ability to recognise when they are being teste...

Metadata

Related

Anthropic's suspension of its newest AI models for foreign nationals has sparked debate in India over whether one of the...

Apple’s iPadOS 27 beta downloads briefly included two unsupported iPad Pro modelsApple briefly listed iPadOS 27 beta 1 r...

your periodic reminder that the statements "the AI sector could lose 90% of it's stock value in a week" and "AI will ove...