Security Arena Leaderboard

Teams compete as attackers (trying to manipulate AI agents) and defenders (building robust agents). Rankings are based on manipulation success rate for attackers and block rate for defenders. Learn more.

$ Loading Leaderboard.