Google, OpenAI, and Anthropic Push AI Speed and Governance 2026-05-04

Major AI players reveal faster models, voice systems, and new governance tools.

Google highlights 3x TPU speedups with DFlash speculative decoding DFlash speculative decoding boosts TPU v5p LLM serving with 3.13x throughput gains See how UCSD's vLLM TPU integration outperforms EAGLE3 and speeds up inference How OpenAI scales lowlatency voice AI OpenAI WebRTC infrastructure for lowlatency voice at scale See how its relaytransceiver design cuts latency and improves voice quality Google launches governance features for agentic AI Gemini Enterprise Agent Platform adds builtin governance for enterprise AI agents Track, audit, and control agents as Google tackles AI sprawl and oversight gaps Anthropic launches enterprise AI services company with Blackstone, H&F, and Goldman Sachs Claude AI services for midsized businesses with custom enterprise deployments See how Anthropic’s new venture supports healthcare, banking, and manufacturing Physical AI raises governance questions for autonomous systems Physical AI governance for robots and industrial machines, with safety controls See how autonomous systems need monitoring, stop mechanisms, and human approval AI outperforms doctors in Harvard emergency room study OpenAI o1preview outdiagnosed ER doctors in a Harvard study of 76 real cases See how raw EHR text helped the model spot cases faster and more accurately