ai-reliability

2 items

The Economist 2026-04-11-1

AI mathematicians: By devising and verifying proofs, AI is changing how maths is done

Four independent groups racing to formalize proofs in Lean, and Math Inc. translated Viazovska's sphere-packing work in weeks rather than the decade Hales needed for peer review, but DARPA's Shafto names the real bottleneck as trust, not computation. AI's primary value in mathematics is making claims auditable at scale. That separation between generation and formal verification is the architecture every enterprise AI system will eventually need.

MIT Technology Review 2026-04-03-1

There are more AI health tools than ever — but how well do they work?

Oxford researchers found non-expert users with LLM assistance identify medical conditions only a third of the time, even when the model alone gets it right. The binding constraint on health AI isn't model capability: it's the interaction gap between what the model knows and what users can extract. Companies racing to ship health chatbots are optimizing the wrong layer; the ones building structured intake UX will outperform the ones chasing benchmark scores.