ai-security

6 items

The New Yorker 2026-04-26-1

When Your Digital Life Vanishes

DriveSavers' ransomware recoveries went 6x in two years: under 50 in 2023, nearly 300 in 2025, with the firm's ransomware lead naming AI directly as the multiplier turning unsophisticated IT operators into sophisticated attackers. Buried in the same New Yorker piece: data center proliferation is wildly inflating storage costs, AI agents are now "notorious" for accidental deletions, and HDD lifespan stays flat at seven years even as Seagate ships 44TB drives. The cloud-abundance narrative has the order book pointed the wrong way — the AI revolution is also a data destruction revolution, and the recovery industry is the only place reading the signal correctly.

# tags

ai-economics ai-cybersecurity agentic-ai-viability ai-infrastructure memory reliability ai-security saas-margins memory-chips new-yorker whitespace-adjacent

tanyaverma.sh 2026-04-13-1

The Closing of the Frontier

Two-thirds of MATS symposium research posters ran on Chinese open-source models because Anthropic's Mythos restrictions closed off Western frontier access to independent safety researchers. The safety case for restricted access is degrading the safety research pipeline it claims to protect. The policy question isn't content moderation: it's whether frontier model access needs due process obligations the way utilities do.

# tags

ai-governance open-source frontier-models ai-security ai-political-economy ai-policy ai-1.0-defensibility anthropic defensibility

◆ entities

Anthropic Mythos Project Glasswing MATS Tanya Verma

→ threads

ai-governance ai-1.0-defensibility

⟷ links

2026-04-10-w3 2026-04-08-2 2026-03-12-3 2026-04-09-2 2026-04-11-3 2026-04-04-2 2026-03-22-2 2026-03-29-1 2026-04-04-3 2026-04-11-1 2026-04-12-2

permalink

UK AI Security Institute 2026-04-13-3

AISI Evaluation of Claude Mythos Preview's Cyber Capabilities

A UK government lab confirmed Mythos can autonomously execute a 32-step corporate network attack end-to-end, outperforming every tested model including GPT-5, with performance still scaling at the 100M token ceiling. The evaluation tested capability against undefended ranges, so what AISI validated is threat potential, not operational impact against a real defended environment. The structural shift is that government evaluation infrastructure is becoming the third-party verification layer for frontier AI claims, sitting between self-reported lab benchmarks and the market the way FDA trials sit between pharma and prescribers.

# tags

ai-cybersecurity evaluation anthropic agentic-ai inference-scaling ai-security agentic-ai-viability inference-economics responsible-disclosure ai-governance cybersecurity

◆ entities

UK AISI Claude Mythos Preview Anthropic Project Glasswing GPT-5

→ threads

ai-cybersecurity agentic-ai-viability evaluation-infrastructure

⟷ links

2026-03-09-3 2026-04-04-2 2026-03-22-2 2026-03-20-2 2026-04-01-2 2026-04-11-3 2026-04-03-3 2026-03-18-3 2026-04-11-1 2026-04-12-2

permalink

Alex Kim's Blog 2026-04-04-2

Claude Code Source Leak: Anti-Distillation DRM, KAIROS Autonomous Mode, and the Defensive Architecture

The Claude Code source leak is most interesting for what the defensive architecture reveals: anti-distillation via fake tool injection, Zig-level client attestation below the JS runtime, and undercover mode that strips AI attribution from open-source commits — each individually bypassable within hours by anyone who reads the activation logic. The more significant find is KAIROS, an unreleased autonomous daemon with GitHub webhooks, nightly memory distillation, and cron-scheduled refresh every five minutes, showing Anthropic is building always-on background agents, not session-based assistants. The leak itself was a known Bun bug left unpatched for 20 days — the gap between what Anthropic built and what it shipped is the operational risk signal, not the defensive code.

# tags

ai-security agentic-ai ai-1.0-defensibility developer-tools competitive-dynamics agentic-ai-viability ai-economics anthropic claude-code-leak

◆ entities

Anthropic Claude Code Bun KAIROS OpenCode GrowthBook Alex Kim

→ threads

ai-coding-tools-race agentic-ai-viability

⟷ links

2026-04-01-1 2026-03-08-2 2026-04-01-2 2026-03-09-1 2026-03-18-3 2026-03-09-3 2026-03-13-2

permalink

Sockpuppet.org 2026-04-01-3

Vulnerability Research Is Cooked

Every IT department runs on a hidden subsidy: the scarcity of people smart enough to hack them. Anthropic's Frontier Red Team just demonstrated 500 validated high-severity vulnerabilities from a trivial bash script and Claude Opus 4.6, no fuzzers, no specialized tooling, just raw model inference. The Bitter Lesson is about to hit security like a brick: 80% of exploit development was jigsaw-puzzle grinding, and now everyone has a universal solver. The scarce resource isn't intelligence anymore; it's the ability to patch faster than agents can find what's broken.

# tags

ai-security agentic-ai cybersecurity bitter-lesson

◆ entities

Thomas Ptacek Nicholas Carlini Anthropic Frontier Red Team Claude Opus 4.6 Ghost CMS Richard Sutton

→ threads

ai-security agentic-ai-viability

⟷ links

2026-03-09-3 2026-03-21-2 2026-03-20-2 2026-03-13-w1 2026-03-13-w3 2026-03-11-2 2026-03-24-1 2026-03-29-1 2026-03-11-3 2026-03-18-3

permalink

WIRED 2026-03-18-3

Justice Department Says Anthropic Can't Be Trusted With Warfighting Systems

The DOJ's filing reveals a dependency it was supposed to prevent: Claude is currently the only AI model cleared for classified DOD systems, which means the supply-chain risk designation is partly a self-inflicted wound. The government's argument that Anthropic "could" sabotage warfighting systems conflates a vendor's contractual right to set usage terms with criminal sabotage, and the distinction matters for every AI company negotiating enterprise AUPs. The real signal is structural: safety restrictions are now priced as commercial liability in the defense market, and the replacement vendors inheriting these contracts gain not just revenue but classified use-case intelligence that compounds for years.

# tags

ai-security ai-defensibility ai-economics enterprise-ai

◆ entities

Anthropic DOD OpenAI Google xAI Palantir Claude

→ threads

ai-1.0-defensibility ai-security

⟷ links

2026-03-09-3 2026-03-11-2 2026-03-12-3 2026-03-13-w3 2026-03-13-w1 2026-03-09-2

permalink