OpenAI's latest generative AI model, GPT-5.5, has demonstrated an impressive leap in cybersecurity capabilities during the UK AI Safety Institute's (AISI) rigorous testing. By successfully navigating complex challenges like the corporate network penetration simulation, it showcases the rapid evolution of AI as an autonomous problem-solver.
GPT-5.5 achieved an exceptional 71.4% average pass rate in expert-level cybersecurity tasks, outperforming leading competitors such as Anthropic's Claude. This milestone highlights the incredible pace of innovation in reasoning and autonomous agent technologies within the industry.
Notably, the model became only the second AI in history to fully complete 'The Last Ones', a highly demanding 32-stage corporate network infiltration simulation. This simulation requires high-level strategic planning and technical execution across multiple environments. The testing demonstrated a massive leap in AI capabilities, proving that autonomous systems are reaching new heights in solving simulated real-world security vulnerabilities.