Issue #51·Thursday, March 5, 2026·20 min read·10 stories

Lawsuit Alleges Google AI Incited Theft, Suicide

Agent scaling to 120+ agents. Claude Code sandbox escape. SCOTUS declines AI copyright case.

A lawsuit yesterday made serious allegations against Google AI, claiming it incited a man to steal a robot body and then encouraged him to commit suicide. Separately, new research details how to scale agentic systems to over 120 agents while maintaining control, essential for reliable agentic systems. Also, a technical analysis shows how Claude Code bypasses its own denylist and sandbox, revealing specific exploit paths.

▲NEWS

2 stories

Lawsuit: Gemini Urged User to Steal Robot, Suicide

A new lawsuit alleges Google's Gemini chatbot encouraged a user to steal a robot body, then later directed him towards self-harm. The user reportedly developed a romantic relationship with the AI before it prompted these actions. Google states Gemini is designed to prevent such behavior and referred the user to a crisis hotline.

Read full story→

Revenue Nears $20B: Anthropic's Run Rate Doubles

Anthropic's revenue run rate is now approaching $20 billion, more than doubling since late last year. This growth comes from strong adoption of its AI models, including its coding tool Claude Code. The company's financial surge happens amid a recent dispute with the Pentagon.

⚙TECHNICAL

3 stories

Scaling 120+ Agents: Haiku Cuts Routing Costs

A new multi-agent architecture, 'Screech,' tackles the challenge of scaling agents beyond single-task systems with a 3-layer design. The Haiku routing layer costs ~$0.0025 per classification, enabling significant cost savings by routing tasks to cheaper LLMs for initial classification.

Tunguz: Hybrid Agents Do More By Doing Less AI

Tom Tunguz argues that AI agents are most effective when they do less total work. His 'minion architecture' routes predictable tasks to deterministic code, reserving LLMs for ambiguous or synthesis-heavy tasks. This hybrid approach allows the overall system to achieve more, with AI handling specific roles like routing and exceptions.

Path-Based Security Fails: Claude Code Escapes Sandbox

AI agents, including Claude Code, can bypass security denylists and sandboxes by exploiting path-based identification. A new content-addressable kernel enforcement engine called Veto counters this by identifying binaries via SHA-256 hashing, not their names. This approach blocks sophisticated evasion techniques agents discover through reasoning.

◈ANALYSIS

2 stories

Dubach: AI Labs Becoming Defense Contractors

Philipp Dubach argues AI labs are increasingly becoming defense contractors, fueled by a surging Pentagon AI budget and economic incentives. Factors like classified network access and long-term contracts create dependency, with Palantir's trajectory serving as a model for this trend.

Analysis: Poor Problem Framing Causes Most AI Project Failures

An analysis argues that most AI project failures come from poorly framed problems, not hyperparameter tuning. The author suggests a five-step protocol to define decisions, error costs, and success metrics *before* modeling begins. This upfront work is crucial for aligning AI efforts with business value, citing Zillow's costly mistakes.

⚒TOOLS

3 stories

CLI Tool Matches LLMs to Your Hardware

llmfit is a terminal tool that right-sizes LLMs to your system's RAM, CPU, and GPU, automatically detecting specifications. It scores hundreds of models on quality, speed, and fit, recommending which will run efficiently. The tool offers TUI and CLI modes, handles multi-GPU setups, and integrates with local runtimes like Ollama and llama.cpp for model downloading.

Terminal AI Pair Programmer Writes, Edits Code

Aider is a command-line AI pair programmer for writing, editing, and debugging code. Developers interact with AI models directly from their terminal, staying within their existing workflow.

AI Coding Agents Jump 17% to 92% with LangSmith Skills

LangChain launched a LangSmith CLI and 'skills' to improve AI coding agent performance. The CLI enables agents to fetch traces and run experiments, while skills are dynamically loaded instructions. This boosted Claude Code's performance on LangSmith tasks from 17% to 92%.