Compute - AI Infrastructure Intelligence Search

OpenAI Other 2026-06-25

OpenAI and Broadcom Unveil Jalapeno Inference ASIC, Reshaping AI Hardware Landscape

OpenAI, in collaboration with Broadcom, has developed Jalapeno, a custom LLM inference accelerator. The chip uses a multi-chip module with HBM3E memory and achieved tape-out in just nine months. Designed for OpenAI's model stack, it aims to reduce inference costs and dependency on NVIDIA GPUs, with initial deployment planned for late 2026.

NVIDIA Other 2026-06-24

NVIDIA and AWS Default GPU Vector Search with cuVS, G7 Instances Deliver 4.6x Inference

NVIDIA and AWS collaborate to embed cuVS as default GPU-accelerated vector search in OpenSearch Serverless, delivering 10x faster indexing at 1/4 cost. New EC2 G7 instances with RTX PRO 4500 Blackwell GPUs achieve up to 4.6x inference performance. AWS achieves GB300 Exemplar Cloud status for training.

ARM Other 2026-06-24

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

LineShine supercomputer achieves 2.198 ExaFLOPS FP64 sustained using 13.79 million ARMv9 cores across 20,480 nodes, making it the first system to exceed 2 ExaFLOPS without GPUs. Each node has dual LX2 CPUs (304 cores) with 32GB HBM, demonstrating a CPU+HBM architecture breakthrough for HPC.

Nokia Other 2026-06-24

Nokia, Amazon Web Services expand collaboration to deliver autonomous networks built for the AI era

...

NVIDIA Other 2026-06-23

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA announces a liquid cooling system for its Rubin GPUs running 45°C coolant (hotter than a hot tub), using dry coolers in a closed loop to cut electricity and eliminate water evaporation (100% reduction). However, chillers may still be needed in hot climates, and chip longevity impacts remain unaddressed.

NVIDIA Other 2026-06-23

NVIDIA Launches Agent Toolkit: Nemotron Models, OpenShell Runtime for Specialized AI Agents

NVIDIA unveils Agent Toolkit, an open modular foundation with Nemotron models, NemoClaw blueprints, and OpenShell runtime, enabling enterprises to build secure, specialized AI agents. It targets life sciences, cybersecurity, and industrial workflows, aiming to turn frontier models into domain-specific digital coworkers.

NVIDIA Other 2026-06-23

NVIDIA Dominates TOP500 with Full-Stack Lock-in: Grace CPU, InfiniBand, and GPU Integration

NVIDIA powers 81% of TOP500 supercomputers, with Grace CPU adoption rising to 26 systems and Quantum InfiniBand connecting 376. The full-stack strategy (GPU+CPU+networking) shifts procurement from open components to single-vendor lock-in; top 8 Green500 systems use NVIDIA GPUs.

AMD Other 2026-06-23

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AMD powers 4 of top 10 TOP500 supercomputers and previews MI430X GPU with >200 TFLOPS native FP64. This targets AI-for-science workloads, making double-precision compute a key metric for converged HPC-AI infrastructure, directly challenging NVIDIA and Intel.

Amazon Other 2026-06-23

AWS Lambda MicroVMs: Stateful Isolated Sandboxes via Firecracker Snapshots

AWS launches Lambda MicroVMs, leveraging Firecracker for VM-level isolation, near-instant launch/resume, and stateful execution. Users build images from Dockerfiles in S3, launch from pre-initialized snapshots, and suspend/resume automatically, enabling multi-tenant AI code sandboxes and interactive analytics.

ARM Other 2026-06-23

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

IDC reports Q1 2026 global server revenue hit a record $122.6B, with Arm-based servers capturing >45% share (x86 at 52%). Accelerated servers (GPU/ASIC/FPGA) generated >70% revenue. Nvidia's Grace CPU (NVL72) and hyperscaler custom Arm chips drive the shift; x86 still leads in unit volume but faces supply constraints.

ASML Other 2026-06-23

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

ASML's CEO publicly acknowledges tracking Elon Musk's planned terawatt-scale AI supercomputer Terafab, comparing it to Korean DRAM megaprojects. This signals that the sole EUV lithography supplier is allocating capacity, potentially transforming AI chip supply chain and vertical integration.

NVIDIA Other 2026-06-23

Nvidia Vera Rubin CPU: 10-Wide Core Redefines CPU for Agentic Computing

At GTC Taipei 2026, Nvidia unveiled the Vera Rubin CPU with a custom 10-wide fetch/decode/execute pipeline, claiming world-leading IPC and bandwidth. Designed for agentic computing, it complements Nvidia GPUs. Nvidia also announced a partnership with Microsoft to reinvent the PC as a Personal AI and committed to returning 50% of free cash flow to shareholders.

Intel Other 2026-06-23

Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control

At Computex 2026, Intel unveiled the 288-core Xeon 6+ (Intel 18A) and 3rd-gen Core Ultra, claiming Agentic AI shifts CPU:GPU ratio from 1:8 to 1:1. Partnering with SambaNova and Foxconn for rack-scale inference systems, Intel repositions the CPU as the orchestrator for multi-step AI reasoning, aiming to reclaim control from GPU-centric architectures.

Anthropic Other 2026-06-22

Micron-Anthropic Deal: Memory Co-Architecture Locks in AI Supply Chain

Micron and Anthropic sign a strategic agreement covering joint memory/storage architecture design, multi-year supply, Claude adoption, and investment. This ties frontier AI model demands directly to infrastructure design, aiming to optimize token economics and power efficiency, but essentially locks in supply and restructures the ecosystem.

NVIDIA Other 2026-06-22

Dell PowerEdge XE8812: Liquid-Cooled Density Trap with NVIDIA Vera Rubin NVL4

Dell launches PowerEdge XE8812 with NVIDIA Vera Rubin NVL4, delivering 144 GPUs per rack, 300kW+ power, and 100% direct liquid cooling. It offers a generational leap in memory and compute density for HPC and AI, but deeply locks users into Dell's PowerRack, iDRAC, and ORv3 ecosystem from chip to rack.

NVIDIA Other 2026-06-22

NVIDIA JUPITER Validates Grace Hopper: Exascale Science Goes Production

Europe's first exascale supercomputer JUPITER, powered by NVIDIA Grace Hopper Superchips and Quantum-X800 InfiniBand, achieves breakthroughs in brain mapping at cellular scale, 1km-resolution climate simulation, 6G AI, and 50-qubit quantum simulation, proving exascale is production-ready.

Hewlett Packard Enterprise Other 2026-06-22

HPE ProLiant DL394 Gen12 with NVIDIA Vera CPU: ARM Takes on x86 in AI

HPE unveils ProLiant DL394 Gen12 server powered by NVIDIA Vera CPU at Computex 2026, shipping fall 2026. Vera is NVIDIA's first datacenter CPU, in mass production, delivering 1.8x AI workload performance over x86. Early customers include OpenAI, Anthropic, xAI, and others. HPE continues GreenLake as-a-service while also offering Intel Xeon 6+ options.

Meta Other 2026-06-22

Arm's Self-Designed AGI CPU with Meta: Ecosystem Shift from Licensor to Silicon Vendor

Arm unveils its first self-designed data center CPU, the AGI CPU, with 136 cores on 3nm, purpose-built for agentic AI inference. Co-developed with Meta, which will deploy it across its data centers. Claims 2x rack performance over x86, reducing AI capex by $100B per gigawatt. Signals Arm's shift from IP licensing to direct silicon sales, reshaping ecosystem dynamics.

Apple Other 2026-06-22

Apple Expands Private Cloud Compute to Google Cloud with NVIDIA Confidential GPUs

Apple at WWDC 2026 expands Private Cloud Compute (PCC) to Google Cloud, leveraging NVIDIA GPU Confidential Computing for secure AI inference. This marks a strategic shift from Apple-owned data centers to third-party cloud, alongside M6 Neural Engine performance gains.

NVIDIA Other 2026-06-22

NVIDIA Launches Arm CPU: RTX Spark and Vera Shift AI Compute Control from x86

NVIDIA unveils RTX Spark Superchip for Windows PC (20 Arm cores, 6144 CUDA, 128GB LPDDR5X) and Vera data center CPU in million-volume production. Vera delivers 1.8x AI workload acceleration over x86. This marks NVIDIA's strategic entry into CPU market, consolidating control via unified Arm+GPU architecture.

Reports

Filter

OpenAI and Broadcom Unveil Jalapeno Inference ASIC, Reshaping AI Hardware Landscape

NVIDIA and AWS Default GPU Vector Search with cuVS, G7 Instances Deliver 4.6x Inference

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

Nokia, Amazon Web Services expand collaboration to deliver autonomous networks built for the AI era

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA Launches Agent Toolkit: Nemotron Models, OpenShell Runtime for Specialized AI Agents

NVIDIA Dominates TOP500 with Full-Stack Lock-in: Grace CPU, InfiniBand, and GPU Integration

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AWS Lambda MicroVMs: Stateful Isolated Sandboxes via Firecracker Snapshots

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

Nvidia Vera Rubin CPU: 10-Wide Core Redefines CPU for Agentic Computing

Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control

Micron-Anthropic Deal: Memory Co-Architecture Locks in AI Supply Chain

Dell PowerEdge XE8812: Liquid-Cooled Density Trap with NVIDIA Vera Rubin NVL4

NVIDIA JUPITER Validates Grace Hopper: Exascale Science Goes Production

HPE ProLiant DL394 Gen12 with NVIDIA Vera CPU: ARM Takes on x86 in AI

Arm's Self-Designed AGI CPU with Meta: Ecosystem Shift from Licensor to Silicon Vendor

Apple Expands Private Cloud Compute to Google Cloud with NVIDIA Confidential GPUs

NVIDIA Launches Arm CPU: RTX Spark and Vera Shift AI Compute Control from x86