NVLink - AI Infrastructure Intelligence Search

NVIDIA Other 2026-07-06

NVIDIA Kyber NVL144 Delayed to 2028: Midplane PCB Manufacturing Becomes AI Scaling Bottleneck

SemiAnalysis reveals NVIDIA's Kyber NVL144 delayed beyond 12 months to 2028 due to 78-layer Orthogonal Backplane manufacturing challenges. The interim NVL72x2 solution is cancelled due to operational burdens, and the 4-die Rubin Ultra is also scrapped, leaving a product gap in NVIDIA's scaling roadmap.

Anthropic Other 2026-06-30

Anthropic Claude Goes Exclusive on Azure, Microsoft Locks AI Model Distribution via GB300

Anthropic's Claude models are now generally available on Azure Foundry, powered by NVIDIA GB300 NVL72 clusters with over 4600 Blackwell Ultra GPUs. Initial models include Opus 4.8 and Haiku 4.5 with prompt caching and extended thinking. Microsoft gains exclusive enterprise distribution, strengthening its competitive position against AWS and Google Cloud.

NVIDIA Other 2026-06-23

NVIDIA Vera Rubin NVL4: CPU-GPU Fusion Locks Supercomputing Architecture

NVIDIA announces the Vera Rubin NVL4 supercomputing platform, integrating the Rubin GPU and Vera CPU via NVLink and InfiniBand for end-to-end acceleration, delivering over 7 exaflops of AI compute. The ARM-based Vera CPU marks a strategic deepening in data center CPUs, with availability expected in Q4 2026.

NVIDIA Other 2026-06-23

NVIDIA Vera Rubin NVL4: Custom ARM CPU and NVLink Converge to Dominate HPC+AI

NVIDIA unveils the Vera Rubin platform, integrating a custom Vera CPU (ARM) and Rubin GPU via NVLink and liquid cooling, delivering >7 exaflops AI and ~5 PF FP64. Targeting HPC+AI convergence at 144 GPUs per rack, it redefines the compute density standard, shipping Q4 2026.

Google Cloud Other 2026-06-17

ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem

ASUS launches the ExpertCenter Pro ET900N G3, powered by NVIDIA's GB300 Grace Blackwell Ultra Desktop Superchip, delivering 20 PFLOPS and 748GB of coherent memory for near-trillion parameter models. Concurrently, Coherent expands InP fab in Texas for optical interconnects, and NVIDIA plans a $20-25B debt offering, signaling a systemic shift of AI control from cloud to localized enterprise hardware.

NVIDIA Other 2026-06-17

NVIDIA & Coherent Expand 6-Inch InP Fab, Locking AI Optical Interconnect Supply Chain

Coherent breaks ground on the world's first 6-inch indium phosphide fab in Texas, backed by $2B from NVIDIA and multi-billion purchase commitments. The facility produces lasers, transceivers, and pluggable optics for silicon photonics interconnects, enabling NVIDIA's Vera Rubin Ultra NVL576 576-GPU clusters and signaling a mass shift from copper to optical backbones in AI data centers.

NVIDIA Other 2026-06-17

NVIDIA and Coherent Scale 6-Inch InP Fab, Optical Interconnect Becomes AI Infrastructure's New Bottleneck Breaker

NVIDIA invests $2B and commits multi-billion purchases to Coherent's expanded 6-inch indium phosphide fab in Texas, scaling production of lasers and optical modules for AI interconnects. This addresses copper's distance and power limitations in large GPU clusters (e.g., Vera Rubin Ultra NVL576), pushing co-packaged optics into volume manufacturing.

NVIDIA Other 2026-06-16

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

NVIDIA Other 2026-06-13

NVIDIA GB300 NVL72 Delivers 20x Agentic Coding Efficiency, Setting New Inference Benchmark

NVIDIA's GB300 NVL72 achieves 20x more concurrent coding agents per megawatt than H200 on the new AA-AgentPerf benchmark, leveraging 72-GPU NVLink fabric, MXFP4 kernels, and MoE optimizations. This first standardized agentic inference benchmark redefines data center capacity planning for AI agents.

NVIDIA Other 2026-06-01

NVIDIA Vera 88-Core Arm CPU: Control Plane Shifts from x86 to NVIDIA for AI Agent Workloads

NVIDIA unveils Vera, its first standalone datacenter CPU with 88 custom Arm Olympus cores, monolithic mesh, 1.2TB/s LPDDR5X bandwidth, achieving 1.8x x86 performance in agent workloads. Tightly coupled with GPUs via NVLink-C2C, Vera shifts the control plane from Intel/AMD to NVIDIA. First customers: OpenAI, Anthropic. Production Q3 2026.

NVIDIA Other 2026-06-01

NVIDIA FOX Blueprint Shifts Factory Control from PLCs to AI Agents on DGX

NVIDIA unveiled the Factory Operations Blueprint (FOX), a reference design for autonomous factory manager agents using NemoClaw, AI-Q Blueprint, and DGX Station (GB300 with 20 PFLOPS FP4, 748GB coherent memory). It unifies live machine signals, quality systems, and robot fleets under an AI decision layer. Foxconn, Pegatron, Advantech, and Wistron are early adopters, projecting 80% faster root cause analysis and 15% labor productivity gains.

NVIDIA Other 2026-06-01

NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in

NVIDIA launches RTX Spark SoC, integrating Blackwell GPU with 20-core Grace CPU (MediaTek co-designed), NVLink-C2C at 600GB/s, up to 128GB unified memory, 1 petaflop FP4 AI, and local 120B-parameter LLM support. This marks a shift from GPU vendor to platform provider, directly challenging Apple M, Qualcomm, and x86 incumbents.

NVIDIA Product Launch 2026-05-29

NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost

NVIDIA launches Blackwell Ultra GB300 NVL72 rack system with 72 Blackwell Ultra GPUs and 36 Grace CPUs, delivering 1,440 PFLOPS FP4 sparse, 20TB HBM3e, 130TB/s NVLink. Claims 50x AI factory output over Hopper. Available now.

NVIDIA Product Launch 2026-05-29

NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control

NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.

NVIDIA Other 2026-05-05

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.

NVIDIA Partnership High Signal 2026-05-02

NVIDIA and Intel Announce $5 Billion Strategic Partnership: New AI Chip Supply Chain Landscape

NVIDIA and Intel announced a $5 billion strategic partnership on September 18, 2025: NVIDIA invests $5 billion for ~4% Intel stake, while Intel customizes x86 CPUs for NVIDIA AI infrastructure and x86 SoCs integrating RTX GPU chiplets for PC products. Through NVLink, the two companies form a coalition of 'AI Computing + NVIDIA CUDA + x86 Ecosystem'. This reshapes the AI chip supply chain landscape with far-reaching implications for AMD and independent chip designers.

NVIDIA Other High Signal 2026-04-30

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

NVIDIA has released Enterprise AI Factory Reference Architectures, offering three standardized configurations from RTX PRO to NVL72 for on-premises deployments. This architecture integrates compute, networking, storage, and software, aiming to transform AI infrastructure from experimental setups into predictable, scalable industrial operational platforms.

NVIDIA Other High Signal 2026-04-22

NVIDIA and Google Cloud Deepen Collaboration to Build Cloud Infrastructure for AI Factories and Physical AI

NVIDIA and Google Cloud have announced an expanded collaboration, introducing new Vera Rubin and Blackwell GPU-powered instances to build "AI factories" scaling to nearly a million GPUs. The integration of Gemini, Nemotron, and other platforms aims to accelerate production deployment of agentic and physical AI, such as robotics and digital twins.

NVIDIA Other Medium Signal 2026-03-31

NVIDIA Expands AI Ecosystem via NVLink Fusion

NVIDIA announces Marvell joining its AI ecosystem through NVLink Fusion technology, enabling more efficient AI computing interconnects. This collaboration enhances data transfer efficiency in large-scale AI training and inference scenarios.

Reports

Filter

NVIDIA Kyber NVL144 Delayed to 2028: Midplane PCB Manufacturing Becomes AI Scaling Bottleneck

Anthropic Claude Goes Exclusive on Azure, Microsoft Locks AI Model Distribution via GB300

NVIDIA Vera Rubin NVL4: CPU-GPU Fusion Locks Supercomputing Architecture

NVIDIA Vera Rubin NVL4: Custom ARM CPU and NVLink Converge to Dominate HPC+AI

ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem

NVIDIA & Coherent Expand 6-Inch InP Fab, Locking AI Optical Interconnect Supply Chain

NVIDIA and Coherent Scale 6-Inch InP Fab, Optical Interconnect Becomes AI Infrastructure's New Bottleneck Breaker

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

NVIDIA GB300 NVL72 Delivers 20x Agentic Coding Efficiency, Setting New Inference Benchmark

NVIDIA Vera 88-Core Arm CPU: Control Plane Shifts from x86 to NVIDIA for AI Agent Workloads

NVIDIA FOX Blueprint Shifts Factory Control from PLCs to AI Agents on DGX

NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in

NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost

NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA and Intel Announce $5 Billion Strategic Partnership: New AI Chip Supply Chain Landscape

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

NVIDIA and Google Cloud Deepen Collaboration to Build Cloud Infrastructure for AI Factories and Physical AI

NVIDIA Expands AI Ecosystem via NVLink Fusion