Anthropic’s browser agent got hijacked 31.5% of the time before safeguards engaged

3 min read1 views

Across the frontier labs, the highest prompt injection figures published this spring are Anthropic’s. Point a red-teamer at its newest model in a browser, and the attacker hijacked it 31.

What Happened

Across the frontier labs, the highest prompt injection figures published this spring are Anthropic’s. Point a red-teamer at its newest model in a browser, and the attacker hijacked it 31.5% of the time before safeguards engaged. OpenAI, Google, and Meta never gave security leaders a comparable number to set beside it. That figure looks like a liability. In this comparison, it is the opposite. It's the one solid piece of ground. Four frontier labs each shipped a prompt injection disclosure, and n

This story caught our attention because it speaks to a broader shift happening across the tech industry right now. Companies large and small are rethinking how they approach AI — and the results are starting to show.

Why It Matters

The implications here go beyond the headline. We're seeing a pattern where AI capabilities that seemed years away are arriving much sooner than expected. That's creating both opportunities and real challenges for teams trying to keep up.

For developers and businesses, the practical question is straightforward: how do you take advantage of these advances without getting burned by the hype? The answer, as usual, depends on context — but the direction is clear.

The Bigger Picture

It's worth stepping back and looking at where this fits in the broader arc of AI development. We've moved past the "wow, it can do that?" phase and into the "okay, but can we actually use this?" phase. That's a healthy transition.

The companies that figure out how to build reliable, production-ready AI systems — not just impressive demos — are going to be the ones that matter in the next few years.

What to Watch For

Keep an eye on how this plays out over the coming months. The real test isn't whether the technology works in a lab setting, but whether it holds up under the messy, unpredictable conditions of the real world. That's where things get interesting.

Related Articles

AI

Latest: GitHub Copilot users see token-based price hikes

Since its announcement in April this year, the proposed changes to billing methods on GitHub Copilot were a source of much speculation: how much more or less would a pay-a-you-use AI cost an organisation or individual compared to a flat-rate, monthly subscription? Just a day into the changeover to token-based billing for the LLM-based service, […] The post GitHub Copilot users see token-based price hikes appeared first on AI News.

AI

AI in video game development: How artificial intelligence is reshaping the industry

A Google Cloud survey found that 90% of developers are already integrating AI into their daily work, and on Steam, 7,818 titles disclosed AI use in 2025 alone, a 681% increase over the previous year. AI in video game development is not a side experiment.

AI

Claude Mythos exposed a hard truth: Your enterprise patching process is way too slow

In 2024, researchers from the University of Illinois found that GPT-4, when provided with a common vulnerabilities and exposures (CVE) description, could autonomously exploit 87% of a curated 15-vulnerability one-day dataset. Without the description, it could only exploit 7%.

AI

AI agents are entering their rebuild era as enterprises confront the reliability problem

As enterprise AI agents move into production, organizations are confronting a growing reliability problem. Many teams are discovering that LLM performance alone does not determine whether agents succeed in production.

AI

MIT's MeMo lets teams swap in a better LLM without retraining — and performance jumps 26%

Enabling LLMs to acquire new knowledge after training remains a major hurdle for enterprise AI — current solutions are either too expensive, too slow, or constrained by context window limits. MeMo, a framework from researchers at multiple universities, encodes new knowledge into a dedicated smaller memory model that operates separately from the main LLM.

AI

Anthropic's Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment

Anthropic today released Claude Opus 4.8, an upgrade to its flagship model that ships at the same price as its predecessor, alongside a dramatically cheaper "fast mode" tier and a new feature that lets the model spawn hundreds of parallel subagents for codebase-scale work.

AI

How DeepSeek’s radical architecture is shattering Silicon Valley's token moat

DeepSeek’s announcement over the weekend that it has made its 75% price cut permanent on its flagship V4 Pro model is a disruptive assault on the capital-heavy business models of Silicon Valley’s frontier labs.  The reduction on DeepSeek V4 Pro directly undercuts comparable Western models used as workhorses for enterprise production.

AI

The Download: climate tech goes public and the AI Hype Index returns

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. Climate tech companies are going public.

Comments

Leave a Comment

Loading comments...