Reports
AI-generated structured vendor updates
AMD MEXT Acquisition Turns NAND Flash into DRAM-Class Memory, Halving AI Inference Cost
AMD acquires MEXT, whose technology makes cheap NAND flash behave like expensive DRAM, doubling to quadrupling usable memory capacity while halving costs. This targets inference and agentic AI memory bottlenecks. AMD also signs a 30MW AI compute deployment deal with Rackspace, rolling out from 2026 to 2028.
Apple Bets on Intel 18A: Foundry Ecosystem Restructuring and Geopolitical Hedge
Apple partners with Intel for domestic chip production using Intel's **18A-P** (risk production) and future **14A** nodes. This is the strongest endorsement yet for Intel's foundry, as Apple diversifies away from TSMC amid capacity squeeze (Nvidia booking 60% of CoWoS) and Taiwan geopolitical risk.
ASML CEO's EUV Supply Warning Signals a Physical Ceiling on AI Chip Expansion
ASML CEO Fouquet confirms talks with Musk on Terafab but stresses supply constraints. EUV lithography, the sole tool for advanced AI chips, cannot scale quickly. With TSMC, Samsung, Intel, and Musk all vying for limited machines, AI chip capacity allocation becomes a zero-sum game, capping the entire AI infrastructure buildout.
Intel foundry获得Google超过300万颗TPU订单,2028年生产目标
...
Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI
Microsoft launches Agent 365 as a unified control plane for AI agents, integrating Entra, Defender, Purview, Intune, and cost management, alongside the Microsoft IQ semantic platform. While claiming model diversity and openness, this effectively locks enterprise AI assets into Microsoft's management toolchain, shifting control from model layer to infrastructure layer.
ASML, TSMC, imec Demo 300mm 2D-Material Transistors at 50nm Pitch
imec, ASML, and TSMC demonstrate the first 300mm wafer integration of MoS2/WS2/WSe2-based n and pFETs with 50nm contacted poly pitch (CPP) using single-patterning EUV lithography, achieving 94% operational yield. This lab-to-fab breakthrough paves the way for 2D channel materials to extend Moore's Law.
NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones
NVIDIA's blog introduces World-Action Models (WAMs) as a paradigm shift from VLM-based VLAs. WAMs leverage pretrained video/world-model backbones to jointly predict future states and robot actions, aiming to bridge the language-to-action grounding gap. This could redefine robot foundation model training but raises concerns about inference cost and latency.
Google Awards 3M+ TPU Packaging Orders to Intel Foundry, Breaking TSMC's CoWoS Monopoly
Google has awarded Intel Foundry over 3 million units of next-gen TPU advanced packaging orders, leveraging Intel's EMIB technology with production starting in 2028. This marks Intel Foundry's largest external customer win and a pivotal shift in AI chip packaging away from TSMC's CoWoS monopoly.
Microsoft Locks Enterprise AI Agent Control Plane via KPMG's Global Agent 365 Rollout
KPMG globally adopts Microsoft Agent 365 to govern AI agents and expands Copilot deployment. Agent 365 becomes the central orchestration layer within KPMG Workbench, coordinating agents across systems, data, and business processes. This embeds Microsoft's AI management plane into the world's largest consulting delivery network, creating vendor lock-in for enterprise AI agent lifecycle control.
NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration
NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.
Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure
At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.
Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud
At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.
Intel and SambaNova Rackscale AI: CPU Regains Inference Control Plane
At Computex 2026, Intel unveiled rack-scale AI infrastructure combining Xeon 6+ with SambaNova SN-50 RDUs, plus a fully disaggregated inference cloud (prefill on NVIDIA Blackwell, decode on RDUs) by Vector Core Compute. This aims to reposition the CPU as the central orchestrator for inference, challenging GPU dominance.
Microsoft Integrates GPT-5.5 Instant into M365 Copilot: Model Choice Becomes the New AI Control Plane
Microsoft integrates GPT-5.5 Instant into M365 Copilot, Copilot Studio, and Foundry, offering model choice between OpenAI and Anthropic Claude. This marks a shift from single-model lock-in to platform-level model orchestration and governance, moving the control point from model capability to routing and policy layers.
Microsoft Defines ‘Agentic Computing Era’, Positions AI Infrastructure and Agent Platform as Core Strategy
Microsoft's CEO, post-earnings, explicitly identifies the shift from end-user-driven workloads to those driven by both end-users and agents as a platform shift that will change the entire tech stack. The company's strategy is focused on building leading AI infrastructure and an agent platform, having already grown its AI business to a $37 billion annual run rate.
Microsoft Platforms AI Capabilities with IQ and Agent 365 to Drive 'Frontier' Enterprise Transformation
Microsoft CEO Judson Althoff outlines its 'Frontier Firm' vision, centered on platformizing AI with 'Microsoft IQ' for contextual intelligence and 'Agent 365' for agent observability and governance. Multiple large-scale customer cases demonstrate the evolution from mass Copilot deployment to autonomous AI agent development, emphasizing business growth through an open, model-diverse platform.
Microsoft Unveils Foundry Platform, Defining New Paradigm for Durable, Stateful AI Agents
Microsoft CEO Satya Nadella demonstrated durable, stateful AI agents built on the Foundry platform. The platform enables agents to run across time boundaries, orchestrate tools and models, and close the loop with evaluation and improvement over long-running workflows, marking a key evolution from conversational assistants to autonomous execution systems.
Microsoft Integrates GPT-5.5 into Enterprise Copilots, Advancing Multi-Model Workflow Orchestration
Microsoft announced the deployment of the GPT-5.5 model across GitHub Copilot, Microsoft 365 Copilot, Copilot Studio, and Foundry. The update emphasizes multi-model orchestration, enabling users to select different models for tasks (e.g., fast scaffolding, deep reasoning, execution, review) and introduces a 'Rubber Duck' agent for multi-model reflection loops.
Microsoft Launches Hosted AI Agent Infrastructure, Treating Agents as Independent Compute Entities
Microsoft introduces "Hosted agents" in its Foundry platform, providing each AI agent with an isolated, enterprise-grade sandbox featuring durable state, built-in identity, and governance. This move aims to standardize the runtime infrastructure for AI agents, lowering the barrier to enterprise deployment, though comments note it shifts the control point from the application layer to the infrastructure layer.
Microsoft Showcases AI Agent Application in Engineering via Azure Foundry and BEYON Platform
Microsoft's CEO showcased Beca's use of Azure, Foundry, and its BEYON platform to build an AI agent for the New Zealand Geotechnical Database. This allows engineers to query data via natural language, reducing data access time by 40%.