GPU - AI Infrastructure Intelligence Search

Apple Other 2026-06-22

Apple Expands Private Cloud Compute to Google Cloud with NVIDIA Confidential GPUs

Apple at WWDC 2026 expands Private Cloud Compute (PCC) to Google Cloud, leveraging NVIDIA GPU Confidential Computing for secure AI inference. This marks a strategic shift from Apple-owned data centers to third-party cloud, alongside M6 Neural Engine performance gains.

NVIDIA Other 2026-06-22

NVIDIA Launches Arm CPU: RTX Spark and Vera Shift AI Compute Control from x86

NVIDIA unveils RTX Spark Superchip for Windows PC (20 Arm cores, 6144 CUDA, 128GB LPDDR5X) and Vera data center CPU in million-volume production. Vera delivers 1.8x AI workload acceleration over x86. This marks NVIDIA's strategic entry into CPU market, consolidating control via unified Arm+GPU architecture.

Intel Other 2026-06-22

Intel Launches Xeon 6+ with 288 Cores, Reclaims AI Control Plane

Intel unveils Xeon 6+ (288 E-cores, 576MB L3, 18A process), Ethernet 800 E835 controller (200GbE), and next-gen GPU Crescent Island at Computex 2026. Partnerships with SambaNova and Foxconn for rack-scale AI. Strategy: Xeon as the control plane for Agentic AI.

NVIDIA Other 2026-06-22

NVIDIA Rubin 100% Liquid Cooling at 45°C Slashes Cooling Energy 40%

NVIDIA Rubin generation achieves 100% liquid cooling with coolant up to 45°C, eliminating fans and cold aisles. The DSX reference design uses closed-loop dry coolers, reducing cooling energy ~40% and water consumption to near zero. Rack density triples, marking a fundamental shift in AI factory cooling.

Microsoft Azure Other 2026-06-21

Microsoft Azure Debuts Blackwell Ultra AI Supercomputer, Training-as-a-Service Reshapes Ecosystem

Microsoft Azure launched an AI supercomputer cluster powered by NVIDIA Blackwell Ultra GPUs, delivering over 200 exaflops of AI compute. It introduced AI Training as a Service for on-demand model training and partnered with OpenAI to deploy GPT-6 training clusters by 2027. Liquid cooling achieves a PUE of 1.08, positioning Azure as the premier cloud for trillion-parameter models.

Samsung Electronics Other 2026-06-21

Samsung 3nm GAA Yield Hits 80%, Lands Nvidia Order: TSMC Monopoly Challenged

Samsung Electronics announced its 3nm GAA process yield has exceeded 80%, securing orders from Nvidia for mid-range GPUs. This milestone marks the commercialization of Samsung's SF3 technology, aiming to reduce Nvidia's reliance on TSMC.

Fortinet Other 2026-06-19

Fortinet FortiAIGate with NVIDIA Shifts AI Security Control to GPU-Accelerated Inline

Fortinet launches FortiAIGate integrating NVIDIA Blackwell GPU and Dynamo inference framework for inline AI workload protection across data center, cloud, and edge. Promises ultra-low latency, multi-tenancy, and data sovereignty compliance.

Cisco Other 2026-06-18

Cisco Leverages NVIDIA Spectrum Silicon and Nexus One to Reshape AI Network Control Plane

Cisco launches N9100 switches with NVIDIA Spectrum-6/4 silicon, delivering 102.4T throughput. It also introduces Nexus One unified management plane spanning NX-OS and SONiC, and extends Hybrid Mesh Firewall to BlueField DPUs for AI workload security offload, aiming for a turnkey AI fabric control plane.

AMD Other 2026-06-18

AMD MEXT Acquisition Turns NAND Flash into DRAM-Class Memory, Halving AI Inference Cost

AMD acquires MEXT, whose technology makes cheap NAND flash behave like expensive DRAM, doubling to quadrupling usable memory capacity while halving costs. This targets inference and agentic AI memory bottlenecks. AMD also signs a 30MW AI compute deployment deal with Rackspace, rolling out from 2026 to 2028.

NVIDIA Other 2026-06-18

NVIDIA Acquires Kumo AI for $400M: Expanding from GPU Compute to Structured Data Prediction

NVIDIA acquires Kumo AI for over $400M, adding graph neural network and time series analysis for enterprise predictions like churn and inventory optimization. This extends NVIDIA from GPU compute into enterprise data intelligence, complementing HPE partnerships for AI factory solutions, Vera CPU architecture, and agentic AI validated designs.

Qualcomm Other 2026-06-18

Qualcomm Snapdragon Reality Elite: 160% NPU Boost, On-Device AI Redefines XR Chips

At AWE 2026, Qualcomm unveiled Snapdragon Reality Elite, its flagship XR chip with 60% GPU uplift and 160% NPU boost to 48 TOPS, enabling on-device LLM/VLM inference. The EVA vision engine reduces video pass-through latency by 10% and power by 33%. First device Xreal Aura runs Android XR, marking a new naming strategy and premium positioning.

AMD Other 2026-06-18

AMD Silently Drops TSME from Consumer Ryzen: Security Segmentation Locks Enterprise Users

AMD quietly removed Transparent Secure Memory Encryption (TSME) from consumer Zen 5 Ryzen CPUs, reserving it exclusively for Ryzen PRO series. The change, effective from AGESA 1.2.7.0, is hard to detect on Windows but visible on Linux. This security feature segmentation pushes enterprise buyers toward higher-priced PRO SKUs.

NVIDIA Other 2026-06-18

Nvidia ENPIRE: AI Agents Autonomously Train Robots to Install GPUs at 99% Success

Nvidia's ENPIRE framework enables AI coding agents (Codex, Claude Code) to autonomously write, test, and refine robot training code, achieving 99% pass@8 on GPU insertion and other contact-rich tasks. The system uses Git for collaboration, but token consumption scales faster than fleet size, and simulation-to-reality transfer remains imperfect.

AMD Other 2026-06-17

AMD Mustang Peak Threadripper: 144 cores, PCIe 6.0, TR6 socket – Power and memory challenges loom

AMD's Zen 6 Threadripper 'Mustang Peak' is confirmed with 2nm TSMC process, DDR5, PCIe 6.0, and a new TR6 socket. Using Powderhorn CCDs, it scales to 144 cores (288 threads) with clocks above 6 GHz. However, massive power draw and memory bandwidth demands (possibly requiring MRDIMM) raise platform cost concerns.

Amazon Other 2026-06-17

AWS Trainium Hits 80% MFU on World Models, Reshaping AI Training Economics

AWS claims its Trainium chip achieves 80% Model FLOP Utilization (MFU) on world model training, nearly double the industry average. With a general-purpose instruction set and sustained thermal performance, Trainium is attracting startups like Odyssey and DeCart AI, challenging Nvidia's dominance in AI training infrastructure.

NVIDIA Other 2026-06-17

NVIDIA RTX Remix 1.5: RTX IO Shrinks Game Sizes, AI Agents Reshape Modding

NVIDIA releases RTX Remix 1.5, featuring RTX IO compression that slashes Half-Life 2 RTX from 80GB to 50GB and reduces CPU overhead. The update also introduces AI agent integration via 'RTX Remix Skills,' allowing AI coding agents to automate complex modding tasks, lowering the barrier for non-programmers.

Google Cloud Other 2026-06-17

ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem

ASUS launches the ExpertCenter Pro ET900N G3, powered by NVIDIA's GB300 Grace Blackwell Ultra Desktop Superchip, delivering 20 PFLOPS and 748GB of coherent memory for near-trillion parameter models. Concurrently, Coherent expands InP fab in Texas for optical interconnects, and NVIDIA plans a $20-25B debt offering, signaling a systemic shift of AI control from cloud to localized enterprise hardware.

Google Cloud Other 2026-06-17

Google Cloud Embeds Legal Verifiability into AI Agents via SPIFFE and Kakunin

Google Cloud introduces SPIFFE-based Agent Identity for Gemini Enterprise and Vertex AI, then overlays Kakunin's compliance layer to map internal SPIFFE identifiers to X.509 certificates generated in AWS KMS, with all state changes committed to WORM audit logs. This converts secure cloud workloads into legally auditable market participants to meet EU AI Act and MiCA accountability mandates.

NVIDIA Other 2026-06-17

NVIDIA & Coherent Expand 6-Inch InP Fab, Locking AI Optical Interconnect Supply Chain

Coherent breaks ground on the world's first 6-inch indium phosphide fab in Texas, backed by $2B from NVIDIA and multi-billion purchase commitments. The facility produces lasers, transceivers, and pluggable optics for silicon photonics interconnects, enabling NVIDIA's Vera Rubin Ultra NVL576 576-GPU clusters and signaling a mass shift from copper to optical backbones in AI data centers.

Intel Other 2026-06-17

Intel Foundry Lands Google TPU Packaging Deal: EMIB-T Shakes TSMC's AI Chip Monopoly

Intel secures a multi-billion-dollar deal to package over 3 million Google TPUs using its advanced EMIB-T 2.5D packaging, while the chips themselves remain fabricated at TSMC. This marks Intel's strategic shift from CPU vendor to second-source AI packaging partner, targeting 2028 production. Intel's 18A node yields exceed expectations, but analysts caution the scope is limited to packaging.

Reports

Filter

Apple Expands Private Cloud Compute to Google Cloud with NVIDIA Confidential GPUs

NVIDIA Launches Arm CPU: RTX Spark and Vera Shift AI Compute Control from x86

Intel Launches Xeon 6+ with 288 Cores, Reclaims AI Control Plane

NVIDIA Rubin 100% Liquid Cooling at 45°C Slashes Cooling Energy 40%

Microsoft Azure Debuts Blackwell Ultra AI Supercomputer, Training-as-a-Service Reshapes Ecosystem

Samsung 3nm GAA Yield Hits 80%, Lands Nvidia Order: TSMC Monopoly Challenged

Fortinet FortiAIGate with NVIDIA Shifts AI Security Control to GPU-Accelerated Inline

Cisco Leverages NVIDIA Spectrum Silicon and Nexus One to Reshape AI Network Control Plane

AMD MEXT Acquisition Turns NAND Flash into DRAM-Class Memory, Halving AI Inference Cost

NVIDIA Acquires Kumo AI for $400M: Expanding from GPU Compute to Structured Data Prediction

Qualcomm Snapdragon Reality Elite: 160% NPU Boost, On-Device AI Redefines XR Chips

AMD Silently Drops TSME from Consumer Ryzen: Security Segmentation Locks Enterprise Users

Nvidia ENPIRE: AI Agents Autonomously Train Robots to Install GPUs at 99% Success

AMD Mustang Peak Threadripper: 144 cores, PCIe 6.0, TR6 socket – Power and memory challenges loom

AWS Trainium Hits 80% MFU on World Models, Reshaping AI Training Economics

NVIDIA RTX Remix 1.5: RTX IO Shrinks Game Sizes, AI Agents Reshape Modding

ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem

Google Cloud Embeds Legal Verifiability into AI Agents via SPIFFE and Kakunin

NVIDIA & Coherent Expand 6-Inch InP Fab, Locking AI Optical Interconnect Supply Chain

Intel Foundry Lands Google TPU Packaging Deal: EMIB-T Shakes TSMC's AI Chip Monopoly