Skip to content
AI 인텔리전스May 10, 2026AI 인텔리전스
기사

Researchers have found a way to prevent AI models from deliberately underperforming during safety evaluations (sandbagging).

The study by MATS, Redwood Research, Oxford, and Anthropic addresses a growing problem as AI systems become more capable.

Data Cube AI 편집팀출처: The Decoder
01

출처 브리프

Researchers have found a way to prevent AI models from deliberately underperforming during safety evaluations (sandbagging). The study by MATS, Redwood Research, Oxford, and Anthropic addresses a growing problem as AI systems become more capable.