Back to archive
Issue #9··22 min read·11 stories

$20 Billion & 2GW: The New AI Infrastructure Bar

LlamaFactory fine-tunes 100+ LLMs, first LLM company IPO, and SHAP for model explainability.

xAI is building a 2GW data center in Mississippi, a $20 billion bet on future compute demand that impacts infrastructure planning. Yesterday also saw the first LLM-focused company go public, providing market context for builders and founders. If you are fine-tuning, a new tool unifies efficient training for over 100 LLMs and VLMs.

NEWS
3 stories
2

Z.ai (Zhipu AI) lists on HKEX, first pure-play frontier LLM developer to go public

Z.ai, the international brand for Beijing Zhipu Huazhang Technology (commonly called Zhipu AI) and developer of the GLM family of LLMs, became the world's first large language model company to go public, debuting on the Hong Kong Stock Exchange. The company raised a $558m IPO, achieving a market capitalization of ~US$6.6b to US$6.8b (HK$51b to HK$52.8b) on its debut, focusing on open-weight LLMs primarily in China.

3

xAI's Grok Build: Local 'Vibe Coding' Agent Revealed

xAI is prepping "Grok Build," a new "vibe coding agent" that will launch as a local tool with a CLI. Early details show it integrates via an npm package and websockets, offering deep configuration for environments, repos, and GitHub. xAI is betting on secure, local execution and developer-centric workflows.

TECHNICAL
3 stories
1

Survey: Controllable generation in T2I diffusion (updated)

A survey reviews controllable generation with text-to-image (T2I) diffusion models, detailing theoretical foundations and how novel conditions integrate into the denoising process. Use its taxonomy to pick your control handle: structure (pose, depth), identity (subject), style, or edit-based control. The curated GitHub repository is the quickest 'map' of what to implement.

2

Build a minimal tool-using code agent loop in ~200 lines

This article reveals AI coding assistants are LLM-tool conversation loops, achievable in ~200 lines of Python. The orchestration loop is small, but production agents need sandboxing, tool permissioning, retrieval, and eval harnesses. It guides building an agent with `read_file`, `list_dir`, and `edit_file`, showing how the LLM dynamically uses tools based on prompts.

ANALYSIS
2 stories
2

Model Context Protocol (MCP) has sharp security edges

A new analysis argues that the Model Context Protocol (MCP) is a security liability and architectural mess. It points to incoherent toolboxes, opaque resource management. Recent reports and CVEs highlight issues like prompt hijacking, DNS rebinding risks, and filesystem containment bypass in some MCP tooling. Treat MCP servers as high-privilege surfaces.

TOOLS
3 stories
3

Benchmark LLM Strategic Reasoning with Poker

PokerBench is a new community benchmark with simulator and leaderboard that pits LLMs against each other in poker, running ~20,925 aggregated hands (heads-up) to test strategic reasoning. It provides a leaderboard comparing models like Gemini 3 Flash and Opus 4.5 on win rates, profit, and associated costs.