NVIDIA - AI Infrastructure Intelligence Search

Other Other 2026-07-12

WhiteFiber and DriveNets Achieve 111.2 Tbps Cross-DC AI Fabric, Breaking Power Constraints

WhiteFiber announces Project Redwood, partnering with DriveNets Ethernet AI fabric (FSE, VOQ, deep buffers), WEKA storage, and NVIDIA H200 GPUs, achieving 111.2 Tbps bandwidth and 0.9ms latency over 83km dark fiber, treating two geographically separated GPU clusters as a single logical supercluster. Commercialization planned for Q3 2026.

Microsoft Other 2026-07-12

Microsoft Takes Over OpenAI's Arctic Data Center, Seizing AI Compute Control

Microsoft leases a data center in Norway's Arctic Circle from Nscale, deploying 30,000 NVIDIA Vera Rubin GPUs, filling the gap left by OpenAI's retreat. OpenAI slashes its 2030 infrastructure budget from $140B to $60B. Microsoft surpasses OpenAI in AI compute capacity and gains geographical redundancy.

Meta Other 2026-07-12

Meta Invests $9.17B in Canada AI Data Center, Iris AI Chip Mass Production Begins MTIA Roadmap

Meta announced a $9.17B AI data center in Canada with 1GW capacity, and its first in-house AI chip Iris will mass produce in September, kicking off the MTIA four-generation roadmap. Meta targets 14GW compute by 2027, using 6-month chip iterations to challenge NVIDIA's annual cadence and reduce GPU dependency.

Apple Other 2026-07-10

PrismML's 1-bit Compression: 27B Qwen Model Runs Fully on iPhone 17 Pro in 4GB

PrismML compressed a 27B-parameter dense LLM (Qwen 3.6) to 4GB, running fully on iPhone 17 Pro. Using native 1-bit quantization (weights as {-1, +1}), it achieves >92% compression, 8x faster inference, and 75-80% energy reduction. This challenges Apple's sparse architecture, potentially shifting edge AI from cloud-reliant to device-native.

Amazon Other 2026-07-10

AWS Sells Trainium 3 Externally, Challenging NVIDIA's AI Training Chip Dominance

AWS begins external sales of its Trainium 3 AI training chip, fabricated on TSMC 3nm process, delivering 2.52 PFLOPS per chip. Early customers include Anthropic and Uber. This move directly challenges NVIDIA's dominance and marks AWS's strategic shift from cloud provider to chip vendor.

AMD Other 2026-07-10

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

...

NVIDIA Other 2026-07-09

SambaNova完成11亿美元融资估值110亿美元：推理芯片新格局确立

...

NVIDIA Other 2026-07-08

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA unveils Rosa CPU architecture with custom Rigel core (Arm v9.2), targeting single-threaded performance for Agentic AI workloads, paired with Feynman GPU (1.6nm, 50 PFLOPS) in 2028. This shifts CPU design from core-count scaling to serial-latency optimization, directly challenging AMD EPYC and Intel Xeon dominance.

NVIDIA Other 2026-07-07

NVIDIA Vera CPU获Perplexity/OpenAI/Anthropic/Oracle采用 AI Agent性能验证1.5-1.9x加速

...

NVIDIA Other 2026-07-07

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

NVIDIA launches Vera CPU, a max single-threaded CPU at scale for agentic AI. With Olympus cores delivering 1.8x sustained per-core performance over x86, 1.2TB/s LPDDR5X bandwidth, and 3.4TB/s core-to-core bandwidth, Vera integrates into NVIDIA's unified AI factory architecture, aiming to lock users into its ecosystem.

NVIDIA Other 2026-07-07

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

...

Anthropic Other 2026-07-07

Anthropic企业AI采用首超OpenAI 300亿年化收入运行率确认

...

NVIDIA Other 2026-07-07

NVIDIA Denies Kyber NVL144 Delay, But 78-Layer PCB Bottleneck Exposes AI Hardware Physics Limit

NVIDIA officially denies reports of Kyber NVL144 rack delay to 2028, but SemiAnalysis revelations about a 78-layer ultra-high-density PCB midplane bottleneck and Rubin Ultra cancellation expose hard physical limits in signal integrity and manufacturing, opening a strategic window for AMD and Google.

Amazon Other 2026-07-06

AWS boosts Trainium 3 shipments, accelerating ASIC substitution for NVIDIA GPUs

Supply chain sources indicate Amazon AWS has instructed vendors to increase Trainium 3 shipments for Q3 2026 by 20-30%. This signals strong confidence in its custom ASIC strategy to reduce dependence on NVIDIA GPUs, leveraging superior cost and power efficiency for cloud AI training.

NVIDIA Other 2026-07-06

NVIDIA Kyber NVL144 Delayed to 2028: Midplane PCB Manufacturing Becomes AI Scaling Bottleneck

SemiAnalysis reveals NVIDIA's Kyber NVL144 delayed beyond 12 months to 2028 due to 78-layer Orthogonal Backplane manufacturing challenges. The interim NVL72x2 solution is cancelled due to operational burdens, and the 4-die Rubin Ultra is also scrapped, leaving a product gap in NVIDIA's scaling roadmap.

Anthropic Other 2026-07-06

Anthropic Starts Custom AI Chip Development, Talks Samsung 2nm, Aims for Compute Independence

Anthropic has initiated its own AI chip development and is in talks with Samsung for 2nm foundry services. The move aims to reduce reliance on NVIDIA GPUs, optimize inference costs, and strengthen its technology moat ahead of a potential IPO. It joins OpenAI, Google, and others in the custom ASIC race, signaling a shift from software to hardware competition.

AMD Other 2026-07-06

AMD Unveils Zen 6/7 CPU and MI400/500 GPU Roadmap, Targets NVIDIA Rubin with HBM4 and 2nm

AMD unveiled its Zen 6/7 CPU and MI400/500 GPU roadmap at its 2026 Financial Analyst Day, featuring TSMC 2nm process and HBM4 memory. The MI400 series boasts 432GB memory, 19.6TB/s bandwidth, and 40 PFLOPs FP4 performance, directly targeting NVIDIA's Vera Rubin architecture with an annual cadence to disrupt the AI hardware monopoly.

Anthropic Other 2026-07-05

Anthropic Launches Custom AI Chip: Vertical Integration to Control Inference Cost and Supply

Anthropic launched Claude Sonnet 5 and revealed a custom AI chip initiative, using Samsung foundry. This move aims to reduce dependency on NVIDIA, control long-term inference costs, and marks Anthropic's shift from a pure software company to a vertically integrated infrastructure firm.

OpenAI Other 2026-07-05

OpenAI Ends Azure Exclusivity: Model Delivery Control Shifts from Microsoft to Multi-Cloud

OpenAI and Microsoft restructured their partnership in April 2026, ending exclusive Azure licensing and capacity commitments. OpenAI can now serve customers on any cloud; Microsoft retains right of first refusal and revenue share only on its platform. Driven by GPT-5.1's ~3 exaflops inference demand and FTC antitrust scrutiny.

NVIDIA Other 2026-07-04

NVIDIA Vera Rubin AI Platform Slated for July 2026 Shipments, Iterative Compute Upgrade

NVIDIA confirms its next-gen AI compute platform, Vera Rubin, will start shipping in July 2026 to major cloud providers like Microsoft and Google. The platform uses an advanced process node to boost AI training and inference performance, representing an iterative upgrade over Hopper and Blackwell without a fundamental architectural shift.

Reports

Filter

WhiteFiber and DriveNets Achieve 111.2 Tbps Cross-DC AI Fabric, Breaking Power Constraints

Microsoft Takes Over OpenAI's Arctic Data Center, Seizing AI Compute Control

Meta Invests $9.17B in Canada AI Data Center, Iris AI Chip Mass Production Begins MTIA Roadmap

PrismML's 1-bit Compression: 27B Qwen Model Runs Fully on iPhone 17 Pro in 4GB

AWS Sells Trainium 3 Externally, Challenging NVIDIA's AI Training Chip Dominance

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

SambaNova完成11亿美元融资估值110亿美元：推理芯片新格局确立

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA Vera CPU获Perplexity/OpenAI/Anthropic/Oracle采用 AI Agent性能验证1.5-1.9x加速

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

Anthropic企业AI采用首超OpenAI 300亿年化收入运行率确认

NVIDIA Denies Kyber NVL144 Delay, But 78-Layer PCB Bottleneck Exposes AI Hardware Physics Limit

AWS boosts Trainium 3 shipments, accelerating ASIC substitution for NVIDIA GPUs

NVIDIA Kyber NVL144 Delayed to 2028: Midplane PCB Manufacturing Becomes AI Scaling Bottleneck

Anthropic Starts Custom AI Chip Development, Talks Samsung 2nm, Aims for Compute Independence

AMD Unveils Zen 6/7 CPU and MI400/500 GPU Roadmap, Targets NVIDIA Rubin with HBM4 and 2nm

Anthropic Launches Custom AI Chip: Vertical Integration to Control Inference Cost and Supply

OpenAI Ends Azure Exclusivity: Model Delivery Control Shifts from Microsoft to Multi-Cloud

NVIDIA Vera Rubin AI Platform Slated for July 2026 Shipments, Iterative Compute Upgrade

Reports

Filter

WhiteFiber and DriveNets Achieve 111.2 Tbps Cross-DC AI Fabric, Breaking Power Constraints

Microsoft Takes Over OpenAI's Arctic Data Center, Seizing AI Compute Control

Meta Invests $9.17B in Canada AI Data Center, Iris AI Chip Mass Production Begins MTIA Roadmap

PrismML's 1-bit Compression: 27B Qwen Model Runs Fully on iPhone 17 Pro in 4GB

AWS Sells Trainium 3 Externally, Challenging NVIDIA's AI Training Chip Dominance

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

SambaNova完成11亿美元融资估值110亿美元：推理芯片新格局确立

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA Vera CPU获Perplexity/OpenAI/Anthropic/Oracle采用 AI Agent性能验证1.5-1.9x加速

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

Anthropic企业AI采用首超OpenAI 300亿年化收入运行率确认

NVIDIA Denies Kyber NVL144 Delay, But 78-Layer PCB Bottleneck Exposes AI Hardware Physics Limit

AWS boosts Trainium 3 shipments, accelerating ASIC substitution for NVIDIA GPUs

NVIDIA Kyber NVL144 Delayed to 2028: Midplane PCB Manufacturing Becomes AI Scaling Bottleneck

Anthropic Starts Custom AI Chip Development, Talks Samsung 2nm, Aims for Compute Independence

AMD Unveils Zen 6/7 CPU and MI400/500 GPU Roadmap, Targets NVIDIA Rubin with HBM4 and 2nm

Anthropic Launches Custom AI Chip: Vertical Integration to Control Inference Cost and Supply

OpenAI Ends Azure Exclusivity: Model Delivery Control Shifts from Microsoft to Multi-Cloud

NVIDIA Vera Rubin AI Platform Slated for July 2026 Shipments, Iterative Compute Upgrade

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs