Everything That Happened in AI This Week May 11-15 | The Neuron

Around the Horn Week in Review: Everything That Happened in AI This Week (May 11-15, 2026)

Anthropic refused China access to Mythos while the Pentagon used the model family for cyber defense; Cerebras and Cowboy Space pushed AI compute into new territory; Recursive Superintelligence raised $650M; Google confirmed criminal AI-driven zero-day exploitation; plus the week's best new tools.

Written By
Grant Harvey
Grant Harvey
May 17, 2026
16 minute read

China asked for Anthropic's newest model, Anthropic said no, and the same model family was already showing up in U.S. government cyber defense.

Welcome to the Around the Horn Week in Review, the one page you need to sound dangerously informed at work tomorrow. This week had the full AI news bingo card: a geopolitical model-access fight, compute companies trying to leave Earth, recursive self-improvement turning into a funded startup category, live AI cyber incidents, coding agents multiplying across every developer surface, and enough tiny open-source tools to make your terminal look like a group project. At this point, the safest dependency is probably a printed manual and a candle. Let's get into it.

Lead theme: model access Secondary theme: compute scarcity Tool theme: agent workflows

Around the Horn - Week of May 11, 2026

The biggest story this week was the model-access line turning into an actual geopolitical line. China's representatives reportedly approached Anthropic at a Singapore meeting to demand access to the company's newest model, and Anthropic refused. POLITICO framed Mythos as a China-summit flashpoint, which is the kind of sentence that would have sounded ridiculous back when AI launches were mostly benchmark charts and founder podcasts.

Then the context made it bigger: Reuters reported the Pentagon was deploying Anthropic's Mythos cybersecurity model to find and patch vulnerabilities across U.S. government systems, even as the department races to transition away from Anthropic. Add Anthropic's own 2028 AI leadership essay, which argued democracies should preserve a commanding AI lead over China through compute controls and anti-distillation measures, and the shape of the week becomes pretty clear.

The real story was not just "China wants the model." It was that frontier models are now being treated like strategic infrastructure: useful enough for government cyber defense, sensitive enough to deny to a rival state, and powerful enough that model access itself is becoming a diplomacy problem.

🏆 TOP 5 NEWS

Advertisement

Honorable Mentions

🍪 TOP TREATS TO TRY

  • Claude Code Agent View gives you one place to manage parallel Claude Code sessions, unblock agents, and jump between running tasks - available on paid Claude plans.
  • OpenAI Daybreak helps security teams use GPT-5.5 and Codex Security to identify threats, generate patches, and verify fixes - paid only rn.
  • Kimi WebBridge connects Kimi's desktop agent to your browser so it can research, click, fill forms, compare options, and complete web tasks locally - free to try.
  • Velo 2.0 turns a raw screen recording into a polished video and written doc, then lets you edit both by chat - free options.
  • Statewright constrains AI coding agents with visual state-machine guardrails, so tools unlock only when the workflow reaches the right phase (HN thread) - free plan, then $29/mo.
  • Needle distills Gemini tool calling into a 26M-parameter function-call model you can fine-tune locally for tiny on-device agent workflows (HN thread) - free / open source.
  • Semble gives coding agents natural-language codebase search that returns the right snippets instead of stuffing full files into context (HN thread) - free / open source.
  • AI Engineer Coach analyzes your local AI coding logs to spot prompt anti-patterns, track generated code, and turn repeated workflows into reusable skills (HN thread) - free / open source.
  • sx acts like a private npm-style package registry for AI assets, helping teams share skills, MCP configs (Model Context Protocol, the standard that lets agents plug into outside tools), commands, hooks, and agent workflows across tools (HN thread) - free / open source.
  • TinyPPO Snake runs a live neural-net Snake training demo in your browser, turning PPO (a reinforcement-learning method) and WebGPU (browser graphics-compute tech) into something you can actually watch learn (HN thread) - free to try.
Advertisement

🏢 Big Tech & Major Companies

💼 AI Productivity, Labor & Economics

Advertisement

🤖 AI Agents & Infrastructure

  • Thinking Machines Lab introduced Interaction Models for real-time multimodal collaboration across audio, video, and text.
  • Hyper launched as a self-driving company brain that picks up decisions in Slack, docs, and email, then feeds them as context into tools.
  • Devin relaunched as an autonomous AI software engineer with web app, terminal CLI, Linear/Jira ticket integration, embedded IDE, shell, browser workspace, DeepWiki, and Slack integration.
  • multica-ai/multica is an open-source managed-agents platform for assigning coding-agent tasks, tracking blockers, routing work through Squads, and turning repeated fixes into reusable team skills.
  • stablyai/orca is a desktop/mobile IDE for running fleets of parallel coding agents with isolated worktrees, multi-tab terminals, built-in git, GitHub PRs, SSH, and notifications.
  • OpenSquilla launched as a self-hostable Python agent runtime focused on cutting token costs with on-device routing, adaptive reasoning, memory, skills, and sandboxing.
  • HoneyHive unified observability and evaluation for production agents, including replay, live evals, human annotations, regression tests, and root-cause analysis.
  • Hermes Agent Computer Use lets Hermes drive your Mac desktop with clicks, typing, scrolling, dragging, screenshot handling, and model-agnostic backends.
  • BrowserCode is an open-source browser-native agent framework positioned as a new open baseline for browser agents.
  • holaOS gives agents a shared browser, files, apps, memory, and state so recurring research, content, or client-delivery work can persist across runs.
  • Mobbin MCP connects AI agents to more than 621,500 shipped product screens through MCP (Model Context Protocol, the standard that lets agents plug into outside tools) so they can reference proven app flows instead of guessing.
  • Liquid AI's voice-assistant cookbook maps spoken Home Assistant commands straight to function calls with a small audio model, skipping a separate speech-to-text pipeline.
  • MagicPath 2.0 is a multiplayer canvas where humans and agents like Codex or Claude Code can design and build functional prototypes together.
  • Orchard is an open-source agentic modeling framework whose sandbox powers recipes for software engineering, GUI agents, and Claw-Eval-style tasks.
  • Prime Intellect open-sourced renderers, a Python library that keeps long multi-turn agent training conversations stable and parseable.
  • Modal explained truly serverless GPUs, using cloud buffers, lazy image loading, and checkpointing to make AI inference feel less like traditional cluster management.
  • Hermes-agentmemory adds pull-model episodic memory to Hermes Agent with real deletes, synchronous writes, and trace logs showing what entered the prompt (HN thread) - free / open source.

💻 AI Coding & Developer Tools

  • OpenAI's Codex plugin for Claude Code lets developers run Codex inside Claude Code for code reviews, adversarial reviews, and background tasks with one install command.
  • OpenAI previewed Codex Mobile, bringing agentic coding to iOS and Android with voice input, live screen context, terminal, git, and PR workflows.
  • Codex added in-app browser testing, letting developers test apps at different viewports, capture screenshots, hide animations, and speed up evals.
  • Cursor added cloud dev environments for agents, with codebase cloning, dependency installs, credentials, multi-codebase support, version history, rollback, and audit trails.
  • GitHub teased an agent-native development environment integrated with the GitHub graph for code and meta-work like issue triage and PRs.
  • OpenCode kept growing as an open-source Claude Code alternative that runs in terminal, IDE, or desktop and supports 75+ model providers.
  • Warp open-sourced its agentic dev environment, with cloud agents managed by Oz and a fast-growing repo.
  • Kilo Code v7 runs parallel coding subagents on git worktrees inside VS Code with inline diff review and side-by-side comparisons across 500+ models.
  • Mechanize launched GBA Eval, a benchmark where frontier coding agents get 24 hours to build a complete Game Boy Advance emulator in WebAssembly (browser-friendly compiled code) from scratch.
  • CJ Zafir released deepseek-hermes-reasoning-traces, a 240M-token fine-tuning dataset created with Codex 5.5 as orchestrator and DeepSeek V4 Pro as executor.
  • LlamaIndex open-sourced liteparse-server, a local document-parsing backend that extracts text and exact bounding boxes from PDFs, Office files, images, and spreadsheets.
  • OpenClaw OS is an OSS Claude Coworker-style workspace framed as one screen for agentic software work.
  • OpenGravity is a zero-install, vanilla-JS, bring-your-own-key clone of Google Antigravity with a live terminal, local file sync, and sidebar agent.
  • E2a is an authenticated email gateway for AI agents with email-auth checks, signed delivery, webhooks, live message fan-out, a command-line tool, and developer kits.
  • Grunden offers GLM 5.1 inference through an OpenAI-compatible API hosted on NVIDIA H200 hardware in Sweden for EU-sensitive data.
  • Burn, baby, burn is a beautifully cursed bash one-liner that deliberately burns Claude Code or Codex tokens, mostly as a joke about usage dashboards (HN thread) - free / open source.
  • Epiq is a terminal-native issue tracker that stores work as an immutable event log and syncs through Git for codebase-local project flow (HN thread) - no pricing details.
  • Hopper brings AI agents to mainframe operations, helping teams navigate TN3270 terminals, inspect z/OS datasets (IBM mainframe files), write JCL job scripts, debug jobs, and pause for approvals (HN thread) - free hobby plan, enterprise custom.
  • Gigacatalyst embeds an AI builder inside SaaS products so sales, customer success, and customers can generate governed custom workflows on top of existing APIs (HN thread) - custom pricing.
Advertisement

🔬 AI Research & Models

  • Log analysis is necessary for credible evaluation of AI agents argued outcome-only benchmarks hide shortcuts, scaffold limits, and dangerous actions, with an airline-agent case study showing repeated-attempt performance was undercounted by nearly 50%.
  • A Single Neuron Is Sufficient to Bypass Safety Alignment argued that manipulating a tiny internal feature can bypass refusal behavior in aligned LLMs.
  • Hallucinations Undermine Trust; Metacognition is a Way Forward reframed hallucinations as confident errors and argued models need faithful uncertainty, not just more knowledge.
  • G-Zero introduced verifier-free self-play for open-ended LLM generation from zero data, using a proposer/generator loop instead of external judges.
  • Jina Embeddings v5 Omni released multimodal embeddings across text, image, audio, and video.
  • EgoMemReason tested memory reasoning over week-long egocentric videos, with the best model reaching only 39.6% accuracy.
  • minimal JEPA gives researchers small PyTorch implementations for experimenting with Yann LeCun's self-supervised world-model approach.
  • S-FLM is a hyperspherical flow language model that rotates token embeddings on a sphere instead of adding random noise.
  • δ-mem introduced an efficient online memory mechanism for LLMs using compact helper matrices instead of rewriting the whole model.
  • AsymFlow offered a faster one-step image-generation method that compresses part of the generation path to cut work.
  • InclusionAI open-sourced Ring-2.6-1T, a trillion-parameter thinking model with two reasoning gears and strong agent-execution results.
  • Datadog released Toto 2.0, an open-weights time-series forecasting model family from 4M to 2.5B parameters.
  • Pixal3D generated 3D assets directly in the input view's coordinate system instead of canonical space.
  • RigidFormer learned multi-object rigid-body contact dynamics from point clouds and reportedly scaled to 200+ objects at 23.9 FPS.

🏛️ AI Policy, Governance & Safety

Advertisement

🛠️ AI Tools & Products

  • Printing Press turns an API spec, website, or community project into a Go command-line tool, Claude Code skill, OpenClaw skill, and MCP server (Model Context Protocol server, a connector that lets agents use outside tools).
  • AtomicChat gives you local chat with Qwen, Kimi, LLaMA, DeepSeek, and other models running privately on your machine.
  • Krea 2 gives creators an in-house image foundation model for expressive images with style and moodboard control.
  • Perceptron Mk1 gives users frontier video understanding and embodied reasoning at lower reported cost, with access through OpenRouter and Perceptron's demo.
  • TinyFish runs lightweight local agents that scrape, summarize, and act on webpages in one click with no setup.
  • TrueShort makes original phone-native movies and series using small creative teams built around a showrunner, AI filmmaker, and editor.
  • Voker monitors production AI agents by classifying user intents, corrections, and resolutions, then surfacing where agents fail before customers complain.
  • Ponder turns raw footage into a polished rough cut from prompts like "make a 60-second highlight reel," then exports to Premiere, Final Cut, or DaVinci.
  • DramaBox gives creators cinematic text-to-speech with emotion, laughs, sighs, breaths, scene transitions, and voice references.
  • Tavus Image-to-Replica turns one photo, AI portrait, illustration, or mascot into a usable AI human with real-time streaming performance.
  • Papel turns research discovery into a personalized, social paper feed with on-device paper chat, quizzes, and community discussion (HN thread) - free waitlist.
  • Mello is a 3D-printed Spotify speaker for kids that runs on Raspberry Pi and lets parents control the library from their phone.
  • Halupedia is an encyclopedia of a fictional universe that does not exist until you visit it.
  • html-anything is an agentic HTML editor where local CLIs like Claude Code, Cursor, Codex, Gemini, or Copilot write production-ready HTML across skills and surfaces.
  • open-design is a local-first open-source alternative to Claude Design for prototypes, slides, images, video, and exportable assets.

🤖 Robotics & Embodied AI

Advertisement

📊 Fundraising & Deals Roundup

  • Amp - $1.3B for an alternative AI compute grid.
  • Wispr - in funding talks at a possible $2B valuation for Wispr Flow voice dictation.
  • Exaforce - $125M Series B at a $725M valuation to catch and stop cyberattacks in real time.
  • Jensen Huang's foundation - $108M worth of CoreWeave compute purchased and donated to universities and nonprofits.
  • Consensus - $30M led by GreatPoint Ventures for an AI operating system for researchers.
  • Basata - $24.5M total for AI that reads faxed healthcare referrals and helps schedule patients.
  • Pit - $16M seed led by a16z for an AI product-team-as-a-service startup from Voi founders.

🎙️ Interviews, Panels & Podcasts

Advertisement

💡 Industry Commentary & Analysis

Previous Around the Horn Digests

Catch up on everything you missed:

  • Tuesday, May 12: Anthropic refused China access to its newest model, Isomorphic raised $2.1B, Google pushed Gemini deeper into Android, and supply-chain attackers hit Mistral and TanStack.
  • Monday, May 11: Cerebras upsized its IPO, Cowboy Space raised money for orbital data centers, and Google confirmed the first criminal AI-discovered zero-day.
  • Weekend, May 9-10: Weekend roundup of the AI stories that piled up while everyone pretended to log off.
  • Thursday, May 7: Anthropic shipped Natural Language Autoencoders, Google DeepMind detailed AlphaEvolve's science work, and Cloudflare cut 20% of its workforce.
  • Wednesday, May 6: Anthropic ran Code with Claude SF, shipped developer updates, and the federal safety net looked very unready for AI job displacement.
  • Tuesday, May 5: OpenAI governance lore resurfaced, legal AI adoption kept climbing, and Harvey's usage metrics turned heads.
  • Monday, May 4: The White House considered pre-release AI vetting, Anthropic and OpenAI both linked up with private equity, and Mayo Clinic's AI spotted pancreatic cancer early.
Advertisement

That's a Wrap

That's 100+ stories, launches, papers, deals, and strange little tools from one very crowded AI week. If you made it to the bottom, congratulations: you can now explain the AI stack from orbital data centers to JavaScript-package malware to why someone built a token-burning command-line joke.

For the daily version, make sure you're subscribed to The Neuron. We send six issues a week, and yes, we read all of this so you don't have to.

See you tomorrow.

P.S: Know someone who'd find this useful? Forward this to them and tell them to subscribe here.


Grant Harvey

Grant Harvey is the Lead Writer of The Neuron, where he continues to lead the publication's daily coverage of AI news, tools, and trends.

The Neuron Logo

Don't fall behind on AI. Get the AI trends & tools you need to know. Join 700,000+ professionals from top companies like Microsoft, Apple, Salesforce and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.