ai-security

12 items

New York Times 2026-05-14-1

Google Says Criminal Hackers Used A.I. to Find a Major Software Flaw

Google's criminal AI zero-day confirms the new attack topology: AI compressed bug discovery to near-zero cost, but the attacker still needed credentials and the patch cycle still ran in days. The asymmetric trade sits in IAM hardening and patch-velocity infrastructure. The AI-security pure-plays are already priced for the headline; the credential layer is what actually moved.

The New York Times 2026-05-12-2

Google Says Criminal Hackers Used A.I. to Find a Major Software Flaw

AI compressed vulnerability discovery to near-zero cost; credentialed access remained the second gate. Google's disclosure of the first criminal AI-enabled zero-day is the empirical confirmation that the offense-side binding constraint has shifted from bug-finding to credential acquisition, which re-rates the IAM stack more cleanly than the AI-security pure-plays. Rob Joyce's "fingerprint at the crime scene" line points to a parallel category in forensic AI-authorship detection that remains structurally unfilled.

blog.himanshuanand.com 2026-05-11-3

The 90 Day Disclosure Policy Is Dead

Coordinated disclosure was an information-containment regime, and containment fails when discovery diffuses. Eleven independent researchers landed the same critical bug in six weeks; Copy Fail took roughly an hour of AI-assisted scanning to find; Dirty Frag's embargo collapsed within hours via unrelated rediscovery, with Microsoft Defender confirming in-the-wild exploitation a day later. The offense side has integrated LLMs into exploit pipelines. The defense and policy layer largely has not, and that asymmetry is the actual risk — CVE feeds are now lagging artifacts, and patch-diff intelligence is the signal that matters.

WIRED · 2026-05-07 2026-05-09-w3

5,000 Vibe-Coded Apps Are Leaking on the Open Web — and the S3 Analogy Misses the Legal Novelty

RedAccess found over 5,000 exposed apps across the four leading vibe-coding platforms, with roughly 2,000 leaking real PHI, customer chat logs, and internal strategy decks. These aren't misconfigured storage buckets; they're auth logic the platform generated and the user never saw. The S3 analogy that's circulating misses the legal novelty: AWS could credibly disclaim your bucket policy because you wrote it. Lovable, Replit, and Base44 wrote the auth logic that isn't there. That shifts where liability attaches, and the first court to hold a code-generation platform partially liable for a generated vulnerability resets every product roadmap in the category overnight. It's the same verification failure the hedge fund and interpretability stories surface from different angles: the layer that was supposed to enforce quality or security has been dissolved by the technology it was meant to govern. The people building trust infrastructure for that layer, across all three markets, are the ones with a durable position.

Bloomberg 2026-05-09-2

AI Is Making Digital Fraud Easier, Faster and Harder to Stop

Breach notifications to victims fell 79% last year while breaches hit a record high — the disclosure regime didn't get repealed, it decayed through underuse. Companies underdisclose, states underenforce, and the cost lands on consumers and small banks while AI defense vendors capture the rents. The structural fix — continuous identity attestation at the rails layer — is the same control plane the agentic enterprise stack needs, which means two demand vectors pointing at the same consolidation.

WIRED 2026-05-07-3

5,000 Vibe-Coded Apps Are Leaking on the Open Web — and the S3 Analogy Misses the Legal Novelty

RedAccess found 5,000-plus exposed apps on the four leading vibe-coding platforms with around 2,000 leaking real PHI, customer chat logs, and strategy decks. The S3 analogy is reaching for the right pattern but missing the legal twist: AWS could credibly say it didn't write your bucket policy. Lovable, Replit, and Base44 wrote the auth logic that doesn't exist. The first court that holds a code-generation platform partially liable for a generated vulnerability resets the entire industry's product roadmap overnight.

The New Yorker 2026-04-26-1

When Your Digital Life Vanishes

DriveSavers' ransomware recoveries went 6x in two years: under 50 in 2023, nearly 300 in 2025, with the firm's ransomware lead naming AI directly as the multiplier turning unsophisticated IT operators into sophisticated attackers. Buried in the same New Yorker piece: data center proliferation is wildly inflating storage costs, AI agents are now "notorious" for accidental deletions, and HDD lifespan stays flat at seven years even as Seagate ships 44TB drives. The cloud-abundance narrative has the order book pointed the wrong way — the AI revolution is also a data destruction revolution, and the recovery industry is the only place reading the signal correctly.

tanyaverma.sh 2026-04-13-1

The Closing of the Frontier

Two-thirds of MATS symposium research posters ran on Chinese open-source models because Anthropic's Mythos restrictions closed off Western frontier access to independent safety researchers. The safety case for restricted access is degrading the safety research pipeline it claims to protect. The policy question isn't content moderation: it's whether frontier model access needs due process obligations the way utilities do.

UK AI Security Institute 2026-04-13-3

AISI Evaluation of Claude Mythos Preview's Cyber Capabilities

A UK government lab confirmed Mythos can autonomously execute a 32-step corporate network attack end-to-end, outperforming every tested model including GPT-5, with performance still scaling at the 100M token ceiling. The evaluation tested capability against undefended ranges, so what AISI validated is threat potential, not operational impact against a real defended environment. The structural shift is that government evaluation infrastructure is becoming the third-party verification layer for frontier AI claims, sitting between self-reported lab benchmarks and the market the way FDA trials sit between pharma and prescribers.

Alex Kim's Blog 2026-04-04-2

Claude Code Source Leak: Anti-Distillation DRM, KAIROS Autonomous Mode, and the Defensive Architecture

The Claude Code source leak is most interesting for what the defensive architecture reveals: anti-distillation via fake tool injection, Zig-level client attestation below the JS runtime, and undercover mode that strips AI attribution from open-source commits — each individually bypassable within hours by anyone who reads the activation logic. The more significant find is KAIROS, an unreleased autonomous daemon with GitHub webhooks, nightly memory distillation, and cron-scheduled refresh every five minutes, showing Anthropic is building always-on background agents, not session-based assistants. The leak itself was a known Bun bug left unpatched for 20 days — the gap between what Anthropic built and what it shipped is the operational risk signal, not the defensive code.

Sockpuppet.org 2026-04-01-3

Vulnerability Research Is Cooked

Every IT department runs on a hidden subsidy: the scarcity of people smart enough to hack them. Anthropic's Frontier Red Team just demonstrated 500 validated high-severity vulnerabilities from a trivial bash script and Claude Opus 4.6, no fuzzers, no specialized tooling, just raw model inference. The Bitter Lesson is about to hit security like a brick: 80% of exploit development was jigsaw-puzzle grinding, and now everyone has a universal solver. The scarce resource isn't intelligence anymore; it's the ability to patch faster than agents can find what's broken.

WIRED 2026-03-18-3

Justice Department Says Anthropic Can't Be Trusted With Warfighting Systems

The DOJ's filing reveals a dependency it was supposed to prevent: Claude is currently the only AI model cleared for classified DOD systems, which means the supply-chain risk designation is partly a self-inflicted wound. The government's argument that Anthropic "could" sabotage warfighting systems conflates a vendor's contractual right to set usage terms with criminal sabotage, and the distinction matters for every AI company negotiating enterprise AUPs. The real signal is structural: safety restrictions are now priced as commercial liability in the defense market, and the replacement vendors inheriting these contracts gain not just revenue but classified use-case intelligence that compounds for years.