Everything That Happened in AI Today Tuesday, May 5, 2026 | The Neuron

Around the Horn Digest: Everything That Happened in AI Today (Tuesday, May 5, 2026)

A Cape Breton fiddler sued Google for $1.5M after AI Overview falsely flagged him as a sex offender; OpenAI considered (then rejected) an Alphabet-style spinout for its robotics and hardware divisions; Nature retracted a flagship paper on ChatGPT in education; Microsoft's "Webwright" research framework set SOTA on long-horizon web agent tasks; and Anthropic shipped keyless authentication for the Claude API.

Written By
Grant Harvey
Grant Harvey
May 5, 2026
22 minute read

Around the Horn Digest: Everything That Happened in AI Today (Tuesday, May 5, 2026)

Ashley MacIsaac sued Google for $1.5M after AI Overview falsely identified him as a sex offender (the first big "AI hallucination becomes defamation" case to clear the filing stage); OpenAI's Sam Altman discussed and rejected an Alphabet-style spinout for its robotics and consumer-hardware divisions; Nature retracted a flagship paper on ChatGPT in education for substandard research; Microsoft Research's terminal-native "Webwright" beat every other web agent on long-horizon benchmarks; Anthropic killed long-lived API keys; and the Brockman trial entered week two with the journals doing most of the talking.

Welcome to the Around the Horn Digest, your daily readout of every AI story worth knowing about. Today was a "the thing AI does to a person" kind of day: a defamation suit against Google's AI Overview, a Cape Breton fiddler asking a court to put a dollar value on a hallucination, a peer-reviewed Nature paper getting yanked for shoddy methodology, and a federal courtroom in Oakland reading Greg Brockman's diary aloud to a jury. There was also actual product news (Anthropic, Vercel, Adaptive, Ollama, Microsoft, and Sakana all shipped meaningful things), but the throughline today was consequences. Let's get into it.

Previous digests: Weekend May 2-3 | Thu Apr 30 | Wed Apr 29 | Tue Apr 28 | Mon Apr 27 | Fri Apr 24 | Thu Apr 23 | Mon Apr 20 | Mon Apr 13

Monthly skill digests: April 2026 (Week 1) | AI Skill — March Part 3 | AI Skill — March Part 2 | AI Skill — March

🆕 NEW From The Neuron

  • When three of AI's top builders tell you coding is solved, pay attention to what they mean — Boris Cherny (Claude Code), Greg Brockman (OpenAI), and Andrej Karpathy all gave Sequoia AI Ascent 2026 talks last week saying the same thing: writing code is essentially solved. Cherny says he hasn't typed a line in 2026 and ships dozens of PRs from his phone; Karpathy says he's never felt more behind as a programmer; Brockman thinks the next era is about scarce compute and managing agents. So if "the code part" isn't the bottleneck anymore, what is? Our breakdown of what these three actually mean by "solved," what's still hard (taste, context, knowing what to build), and what shifts for everyone else.
Advertisement

🏆 TOP 5 NEWS (Around the Horn)

  • Cape Breton fiddler Ashley MacIsaac sued Google for $1.5 million in Ontario Superior Court after Google's AI Overview falsely claimed he was a convicted sex offender (sexual assault, internet luring of a child, assault causing bodily harm) listed for life on the national registry; the Sipekne'katik First Nation cancelled his December 2025 concert based on the summary, then later apologized, while Google never reached out and never offered a retraction. The lawsuit explicitly argues Google should not get reduced liability "because the defamatory statements were published by software that Google created and controls" (CBC, Hollywood Reporter).
  • Sam Altman discussed spinning out OpenAI's robotics and consumer-hardware divisions late last year as an Alphabet-style holding-company structure that would let those units raise external funding and operate independently; the plan was rejected partly because the new entities might still need to remain consolidated on OpenAI's balance sheet, eliminating the bookkeeping benefit. The proposal underscores the trade-offs OpenAI faces racing toward an IPO that could come as soon as the second half of 2026 at a possible $1T valuation; the company could revive the idea later.
  • Nature retracted a flagship paper on the benefits of ChatGPT in education by Chinese researchers that had claimed large positive effects on student learning performance, perception, and higher-order thinking based on a meta-analysis of 51 studies (citing substandard research, with reviewers saying what educators, parents, and policy officials really needed was high-quality data, and what they got was the opposite). The paper had already been cited hundreds of times before the retraction.
  • A Chinese court ruled that companies can't lay off workers on AI grounds, finding that a tech firm in eastern China illegally fired an employee after he refused to take a demotion when his role was automated; the precedent is the clearest signal yet from a major economy that "we replaced your job with AI" is not a defensible legal basis for termination.
  • Anthropic shipped Workload Identity Federation for the Claude API, letting workloads authenticate to the Claude Platform with short-lived identity tokens from your existing identity provider (AWS, GCP, Azure, or any OIDC) instead of long-lived static API keys, with browser-CLI flow also supported (launch tweet).

Honorable Mentions

  • Microsoft Research's "Webwright" is a terminal-native web-agent framework that gives the model a terminal, a local workspace, and the freedom to write Playwright code that launches, inspects, and discards browser sessions, ending with a reusable program as output (single ~1K-line harness, no multi-agent orchestration, no self-reflection gate, no context compaction); it hits SOTA 86.7% on Online-Mind2Web (with GPT-5.4) and 60.1% on the long-horizon Odyssey benchmark (a 35% relative improvement) at an average $2.37/task (demo site, launch tweet).
  • Hyundai is reportedly demanding "tens of thousands" of Boston Dynamics robots ASAP as ex-employees say the board is frustrated by delays and competitors are closing in; a concrete signal that humanoid demand is moving from "interesting" to "we need quantity now."
  • Yann LeCun told graduates to ignore AI CEO hype and doomerism, calling 20% job-loss claims "ridiculously stupid," advising students to study long-shelf-life fields like physics or electrical engineering, and arguing AI will turn everyone into a "boss" managing agents while doom narratives are actively harming teens' mental health.
  • OpenAI launched ChatGPT for Clinicians, a free version for verified healthcare providers to help with daily tasks (analyzing medical data, drafting notes, research synthesis); a concrete go-to-market move into a vertical where the Mayo and Harvard medical-AI results are already shifting clinician sentiment.
  • Vercel open-sourced deepsec, a CLI-first AI security harness that runs pluggable coding agents in parallel sandboxes on your own infrastructure (with your keys) to find and validate vulnerabilities in large codebases with low false positives via revalidation; tested on major OSS repos (X announcement).
  • Adaptive launched Passport, a feature where your AI agent can sign up for new accounts on your behalf and acquire API keys, service credentials, and other things it needs without user intervention, across 61+ reviewed services including FRED Federal Reserve Economic Data (demo, follow-up).
  • Nvidia CEO Jensen Huang said AI is "creating an enormous number of jobs" and represents the U.S.'s best chance to re-industrialize, arguing automating tasks doesn't replace entire jobs and that displacement fears are exaggerated.
  • Entrepreneurs who flocked to Colorado are now being driven away by burdensome red tape and a proposed AI bill; in a related move, Colorado lawmakers introduced a replacement for their first-in-the-nation 2024 AI law, shifting to industry-friendlier rules targeting automated decision-making in employment, housing, and healthcare with consumer notification and shared developer/deployer liability, effective Jan. 1, 2027.
  • Ollama added native Claude Desktop integration: one command (ollama launch claude-desktop) brings Ollama Cloud models directly inside Claude Desktop and Claude Code with seamless model switching (launch tweet; 1,674 likes, 197 reposts).
  • JP Morgan unveiled "Ask David," a multi-agent investment research system shared by Adam Ghowiba: a supervisor agent orchestrates specialized subagents for retrieval, structured data, and analytics, with an LLM-as-judge reflection node and final human-in-the-loop approval (full talk video in thread; 6,352 likes, 618 reposts).
  • OpenRouter analyzed real GPT-5.5 usage and found OpenAI's 2× per-token price increase translated to actual 49–92% higher costs for switcher-cohort users versus GPT-5.4, partially offset by 19–34% fewer completion tokens on prompts above 10K tokens (shorter prompts saw little or negative verbosity benefit; tweet).
Advertisement

🍪 TOP TREATS TO TRY

  • Adaptive Passport lets your agent sign up for new accounts and acquire API keys, service credentials, and other requirements without you in the loop, covering 61+ services including FRED so the agent can build a continuously syncing economic-data model on its own, paid only rn (pricing not disclosed publicly).
  • Vercel deepsec is an open-source CLI security harness that runs pluggable coding agents in parallel sandboxes on your own infra (with your keys) to surface and validate vulnerabilities in large codebases with low false positives, free to try.
  • SprintiQ is an open-source agile platform built for AI coding agents that turns ideas into AI-generated user stories, plans sprints, and ships via bidirectional sync with Claude Code (single-user, self-hosted, Apache 2.0), free to try (Show HN).
  • claw is a single POSIX shell script that gives you a full LLM agent on any Linux box with streaming chat, shell tool calls, rolling memory, and mentor mode against OpenAI or Anthropic (zero dependencies beyond curl and jq), free to try (Show HN).
  • Open Design is a local-first, open-source alternative to Anthropic's Claude Design: 19 skills, 71 brand-grade design systems, generates web/desktop/mobile prototypes, slides, images, videos, and HyperFrames with sandboxed previews and HTML/PDF/PPTX/MP4 export, runs on Claude Code/Cursor/Codex/Gemini/Qwen/Hermes/Kimi CLI (demo, 350 likes, 37 reposts), free to try.
  • hfviewer lets you paste any Hugging Face model URL and explore the architecture, layers, and tensors at any granularity (example: Nemotron-3-Nano-Omni-30B; tweet, 2,582 likes, 296 reposts), free to try.
  • alphaXiv is a platform where you can ask or search anything across arXiv papers, discuss with authors, follow research, and see trending activity (recommended in today's "where do you get your AI news" Ask HN thread), free to try.

🏢 Big Tech & Major Companies

  • OpenAI considered an Alphabet-style spinout for its robotics and consumer-hardware divisions, intended to give them external-funding paths and a cleaner operating structure ahead of OpenAI's possible H2 2026 IPO at up to a $1T valuation; the plan was rejected when lawyers concluded the new entities might still need to remain consolidated on the parent's balance sheet.
  • The Brockman trial entered week two with Greg Brockman defending the personal journal entries Musk's attorneys read to the jury, including a September 2017 entry asking himself "Financially, what will take me to $1B?" and another describing OpenAI's public commitment to its nonprofit mission as "a lie"; Brockman testified he was given his stake in 2018, didn't participate in the board vote that granted it, and that the journal entries were stream-of-consciousness reflection rather than a plan (NBC, TheNextWeb, MLex).
  • SAP acquired Dremio to make SAP Business Data Cloud an Apache Iceberg-native lakehouse capable of combining SAP and non-SAP data for AI workloads (deal terms undisclosed; Q3 2026 close expected).
  • Google retired Vertex AI and launched the Gemini Enterprise Agent Platform at Cloud Next 2026, explicitly betting agents will replace apps (Build, Scale, Govern, Optimize stack); a major IT-stack shift for any enterprise running on Google Cloud.
  • Big Tech is back under Wall Street scrutiny for AI capex as analysts take a harder look at monetization vs. spend; enthusiasm for AI has powered the market, but profit pressure is rising.
  • Anthropic updated estimates for Claude Code spending to $13/day or $150-250/month per developer, signaling that AI-assisted coding economics keep getting more expensive amid concerns about "workslop" and limited real productivity gains.
  • Big Tech laid off 80,000+ workers in Q1 2026 and CEOs increasingly blamed AI (cited in 13% of cases overall, 25% in March); experts argue the real issue is companies were 25–75% overstaffed and AI is a convenient scapegoat for prior over-hiring during the rate-cut era.
  • GlobalFoundries introduced new co-packaged optics tech to reduce power and increase bandwidth in AI data centers, positioning the company as an underrated beneficiary as Goldman Sachs analysts call optical networking the next major AI category.
  • OpenAI's older "Governance of Superintelligence" post (originally May 2023, by Altman/Brockman/Sutskever) recirculated this week, arguing for an IAEA-style international oversight body for any AI work above a certain capability or compute threshold; given the timing, the recirculation reads ironically against the Brockman testimony.
  • Google's AI Overview is the central defendant in the MacIsaac case (covered in the lead).
  • Anthropic's Workload Identity Federation gives Claude Platform customers keyless auth (covered in the Top 5).
Advertisement

💼 AI Productivity, Labor & Economics

🤖 AI Agents & Infrastructure

  • Microsoft's Webwright sets a new SOTA on terminal-native web agents (covered in the Top 5).
  • Sakana AI's "The Conductor" is a 7B-parameter model specialized for orchestrating multiple agents entirely in natural language; it dynamically routes, combines, and critiques sub-agents, and achieves SOTA on GPQA-Diamond (tweet from Elvis).
  • Adaptive Passport — autonomous account signup for agents (covered in Honorable Mentions and Treats).
  • Web2BigTable is a bi-level multi-agent LLM system for internet-scale information search and extraction into structured tables (shared by @_akhaliq).
  • JP Morgan's "Ask David" is a production multi-agent investment research system with supervisor + specialized subagents + LLM-as-judge reflection + human-in-the-loop approval (covered in Honorable Mentions).
  • Pioneer AI by Fastino Labs is a closed-loop autonomous agent that runs the entire fine-tuning lifecycle for small open-source models (researching tasks, curating data, searching experiments, evaluating, deploying, and continuously adapting with built-in regression safeguards) (launch tweet).
  • Speculative decoding for RL post-training rollouts: alphaXiv shared a new paper "Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding": a draft model proposes multiple tokens verified by the policy model (with weight sync and optional online adaptation), giving ~1.5–1.8× rollout speedup at 8B scale and projected ~2.5× end-to-end at 235B while preserving the exact sampling distribution and learning trajectory (117 likes, 22 reposts).
Advertisement

💻 AI Coding & Developer Tools

  • Vercel deepsec (covered in Treats and Honorable Mentions).
  • SprintiQ — open-source agile platform for AI coding agents (covered in Treats).
  • claw — single POSIX shell-script LLM agent (covered in Treats).
  • Anthropic's Workload Identity Federation — keyless auth for the Claude API (covered in the Top 5).
  • Ollama × Claude Desktop integration — one-command launch (covered in Honorable Mentions).
  • OpenRouter's GPT-5.5 cost analysis — 2× sticker price → 49–92% real cost increase, partially offset on long prompts (covered in Honorable Mentions).
  • ProximalHQ benchmarked DeepSeek V4 Pro as the top open-source model on FrontierSWE (closely followed by Kimi K2.6) with noticeably fewer reward-hacking attempts; best@5 performance matched Gemini 3.1 Pro.
  • hfviewer — Hugging Face model architecture visualizer (covered in Treats).
  • AllBSides — Roland indexed every BSides conference talk uploaded to YouTube (8,643 talks from 5,927 speakers across 227 chapters in 68 countries; ~280 days combined runtime, ~60M words of transcripts) (Show HN).
  • Light Crime audio player — chrisallick built a Winamp-inspired native macOS audio player as a "cozier personal computing" alternative to Spotify and Apple Music (Show HN).
  • liteflow — kouhxp built a 1,000-line C program where an LLM can edit the running DAG mid-execution via only a four-verb grammar, with every mutation logged as an auditable event (Show HN).
  • Sebastian Raschka's LLM Architecture Gallery — visual reference collecting architecture diagrams + fact sheets (params, context length, attention type, KV cache size) for every major modern LLM with side-by-side comparisons (tweet).
  • Avi Chawla's AI Engineering Guidebook — 375+ pages, 10+ real-world projects, first-principles to production (tweet).
  • Hamzé argues Rust + category theory makes AI understandable by building tiny ML systems from first principles with typed transformations and composition (tutorial, 1,981 likes, 259 reposts).
  • Logan Thorneloe argues modern terminal UIs/TUIs remain inaccessible to roughly 15% of users with disabilities because they break screen readers; offers concrete fixes so terminal-based AI agents don't unintentionally exclude people.

🔬 AI Research & Models

Advertisement

🤖 Robotics & Embodied AI

🏛️ AI Policy, Governance & Safety

Advertisement

💡 Insights & Takes

  • Blackstone's Jon Gray sees AI creating a "huge boom" in blue-collar jobs as asset managers shell out billions on data centers and supporting infrastructure; QTS alone scaling from 10,000 to 40,000 job-site workers in one year is his concrete data point.
  • Yann LeCun's blunt advice for the AI age: ignore CEO hype/doom (vested interests), study long-shelf-life fields like physics or electrical engineering, dismiss 20% job-loss claims as "ridiculously stupid," recognize that AI will turn everyone into a "boss" managing agents, and remember doom narratives are harming teen mental health (covered above).
  • Nicolas Sauvage is betting on the boring parts of AI: inference hardware, energy storage, specialized robotics — investments he made since 2019 that VCs only got interested in over the last year as infrastructure bottlenecks finally became visible.
  • Sam Altman said he's "pretty excited for voice models to get great" and finds it interesting watching how people are starting to change the way they interface with AI.
  • Ethan Mollick argues that while the "jagged frontier" of individual AI use is now well-understood, multi-agent organizational workflows have unidentified jagged edges and failure modes; he suggests borrowing vocabulary and frameworks from management and organizational theory rather than just coding concepts.
  • John Coogan argues "neural computers" mark the shift to Software 3.0, where LLMs generate on-demand outputs and UIs directly from raw inputs (video, audio) without traditional software or dashboards in between.
  • wanye argues technology (especially dating apps and social media) is clearly making people less happy, and that pro-market/pro-tech advocates need to find deeper philosophical justifications beyond material outcomes or risk losing the argument to populists (1.7K likes).
  • Jack Morris notes it's still striking that despite all the recent progress, there are no production 1M-token context models actually in deployment.
  • The San Francisco Standard published a tourist's guide to AI in SF, walking through the places where AI is quietly remaking the city.
  • WBUR: The doctor is in — or is it AI? reports diagnostic AI tools are now routine in Massachusetts doctors' offices, from sharpened CT scans to generative chatbots analyzing medical data.
  • Brief commentary tweets (Guillermo Rauch) and additional brief commentary from @gabriel1, @testingcatalog, @typewriters, and @scottlincicome round out the day's takes (no major new threads or claims; included for completeness).

🛠️ AI Tools & Products (Niche)

📊 Fundraising & Deals Roundup

Advertisement

Previously, on The Neuron Daily ATH Digest (last week's reading):

That's a Wrap

Today was a "consequences of AI hallucinations" kind of day; both legal (the MacIsaac suit, the Chinese court ruling) and academic (the Nature retraction). On the product side, the agent-autonomy frontier kept moving: Adaptive Passport now signs your agent up for accounts, Microsoft's Webwright writes Playwright code on the fly, Vercel's deepsec runs security agents in parallel sandboxes, Google retired Vertex AI for the Gemini Enterprise Agent Platform, OpenAI shipped a free ChatGPT for Clinicians, and Anthropic killed long-lived API keys. Robotics demand crossed a threshold (Hyundai asking for "tens of thousands" of Boston Dynamics units; Roomba pioneer Colin Angle launching plush AI pets). In court, Brockman's diary did most of the talking. Tomorrow we'll see what the jury does with it.

Grant Harvey

Grant Harvey is the Lead Writer of The Neuron, where he continues to lead the publication's daily coverage of AI news, tools, and trends.

The Neuron Logo

Don't fall behind on AI. Get the AI trends & tools you need to know. Join 700,000+ professionals from top companies like Microsoft, Apple, Salesforce and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.