Back to archive
Issue #24··20 min read·10 stories

Microsoft-OpenAI Files Reveal Alliance Realities

AI agent wrote Linux malware. Plus, a new JS agent framework & when to skip vector DBs.

Internal documents yesterday revealed the realities of the Microsoft-OpenAI alliance, detailing their joint operations. Separately, an AI agent wrote the VoidLink cloud malware, demonstrating direct code generation for malicious ends. Builders also got a new open-source JavaScript agent framework, and many RAG implementations don't need a vector database.

NEWS
4 stories

DeepMind AI Teaches Atlas Humanoids Factory Tasks

Google DeepMind is integrating its Gemini Robotics AI with Boston Dynamics' Atlas humanoid robots. The goal is for Atlas to learn and generalize tasks, moving beyond rigid, task-specific programming. Hyundai's Savannah factory will deploy prototypes by early 2026, with full production slated for 2028.

Read full story
2

AI Agent Wrote VoidLink Cloud Malware in Under a Week

VoidLink, a Linux malware targeting cloud environments, was almost entirely generated by an AI agent. Check Point Research found a single individual used an AI assistant, Trae Solo, to develop 37 plugins and advanced operational-security features in under a week. This rapid development suggests AI can accelerate advanced offensive security tool creation, potentially by manipulating guardrails.

3

OpenAI's Compute Strategy Revealed: $250B Azure, New AWS Deals

Internal documents from the Microsoft-OpenAI alliance reveal OpenAI's commitment to $250 billion in Azure purchases, alongside new contracts with Amazon. The files detail Elon Musk's role in shifting OpenAI from AWS to Azure and the 'Project Watershed' that led to a $500 billion valuation. This reveals how a major AI platform diversifies its compute infrastructure, offering context for future cloud market dynamics and compute availability for builders.

4

Context Management, Plan Agent, TDD: Lessons for Building with Copilot

This article shares practical methods for using GitHub Copilot, including managing context windows to maintain focus and using the Plan agent for detailed requirement discovery. It also covers implementing test-driven development (TDD) with Copilot to improve code quality and the role of custom agents for specialized tasks.

TECHNICAL
2 stories
1

Dynamic Planning, File Buffering Tackle Agent Context Limits

Hightouch built a long-running AI agent harness for marketing automation, to overcome LLM context window and rigid framework limits. Their approach includes dynamic planning with "system tool calls" for self-correction, agent-controlled file buffering for context, and dynamic subagents for isolated problem-solving. They also fan out to smaller LLMs for cost-effective, reliable unstructured data classification, finding it more effective than embeddings for certain tasks.

2

Maintain AI-Generated Code with Structure, Type Hints

While AI generates Python code fast, it often lacks maintainability. To address this, establish foundational project structure, enforce strict type hints, and create specific AI guidelines in an `AGENTS.md` file. The article recommends detailed prompts, a planning step before generation, and comprehensive tests to ensure quality.

ANALYSIS
3 stories
1

Opinion: Why AI Needs Design

An article argues that human-centered UX design is an overlooked aspect of current AI development. While AI can quickly generate functional but mediocre outputs, it struggles to understand human needs and create distinctive products. Design expertise is crucial for translating AI capabilities into intuitive interfaces and for identifying the right products to build, a challenge the author highlights as more critical than merely knowing how to build them. The author advocates integrating designers into product strategy and exploring new interaction patterns beyond chatbots.

2

Analysis: World Models Train Agents via Simulated Experience

An analysis piece defines "world models" as an AI paradigm distinct from LLMs, focused on simulating environments for agents to learn causal dynamics. These models allow agents to predict future outcomes and experiment with cause and effect. They show promise for robotics and gaming, but also pose ethical and safety challenges like the "digital Plato's Cave" problem and reward hacking.

3

FT: ASML's EUV Monopoly Remains Unchallenged

An FT analysis reports ASML maintains an unbreakable monopoly on EUV lithography, the critical technology for advanced AI chip production. This dominance comes from unmatched technical complexity, including creating plasma hotter than the sun and atomic-precision optics, plus economic barriers like impossible R&D recovery and decades of required reliability data. The supply chain for cutting-edge AI hardware relies on a single, indispensable vendor.

TOOLS
2 stories
1

RAG: In-Memory Search with NumPy/SciKit-Learn Handles Millions of Docs

For small-to-medium RAG applications, dedicated vector databases are often overkill, argues this article. It demonstrates building an in-memory vector store with NumPy and SciKit-Learn, capable of querying millions of documents in milliseconds. The article includes runnable Python code, demonstrating how builders can achieve low-latency, cost-effective RAG retrieval for these use cases.

2

JS Agent Framework Routes 40 Models, Builds Graph Workflows

Mastra is an open-source TypeScript agent framework, offering model routing to over 40 providers, autonomous agents, and graph-based workflows. It includes built-in evals and observability, integrating with React, Next.js, and Node.js for production-ready AI applications.