infrastructure - AI Infrastructure Intelligence Search

Google Cloud Other 2026-06-16

Apple Rebuilds Siri with Google Gemini, Cuts Legacy Hardware Support

Apple rebuilds Siri using Google Gemini-derived capabilities, introducing five new AFM 3 foundation models (including a 20B-parameter multimodal on-device model). The move is paired with the sharpest hardware support cut in watchOS 27, limiting to S9/S10 chips, signaling a strategic shift from vertical integration to hybrid AI partnerships and accelerated hardware refresh cycles.

Google Other 2026-06-16

Google Open-Sources Brazos: Plug-and-Play Liquid Cooling for Air-Cooled DCs

Google introduces Brazos, a rack-mounted closed-loop liquid-to-air cooling system for existing air-cooled data centers. Supporting 60kW per rack, it is open-sourced via OCP, enabling high-density AI/HPC deployments without facility retrofits.

CrowdStrike Other 2026-06-16

CrowdStrike's Continuous Identity for AI Agents: Real-Time Risk Engine Replaces Static Policies

CrowdStrike launches Continuous Identity for AI Agents, assigning cryptographically verifiable identities via SPIFFE and authorizing every agent action based on owner, caller, and device risk in real time. It eliminates standing privileges, integrates with Falcon AIDR for permission misuse detection, and extends the identity security control plane across human, non-human, and AI identities.

Cisco Other 2026-06-16

Cisco Security Portfolio Moves to AWS Marketplace: Ecosystem Lock-in Accelerates, Multi-Cloud Neutrality Questioned

Cisco announces availability of its full SaaS security portfolio (Duo, Secure Access, Identity Intelligence, Hybrid Mesh Firewall) on AWS Marketplace, with deep integration with Amazon Bedrock and SageMaker for AI security and zero-trust agent management. This move simplifies procurement and accelerates deployment but deepens AWS dependency, potentially sacrificing multi-cloud flexibility.

Cloudflare Other 2026-06-15

Cloudflare Announces Scheduled Maintenance and Global Infrastructure Expansion

...

Cisco Other 2026-06-15

Cisco G300: A Lock-in Play for AI Network Control Plane Dominance

Cisco launches the Silicon One G300 programmable AI networking chip for AI data centers and ML clusters. It extends Cisco's unified routing, switching, and AI acceleration architecture, but fundamentally aims to lock users into a proprietary control plane, countering open ecosystems from Broadcom and Nvidia.

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's blog introduces World-Action Models (WAMs) as a paradigm shift from VLM-based VLAs. WAMs leverage pretrained video/world-model backbones to jointly predict future states and robot actions, aiming to bridge the language-to-action grounding gap. This could redefine robot foundation model training but raises concerns about inference cost and latency.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

Cloudflare Other 2026-06-15

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

Cloudflare integrates key Ensemble AI talent, bringing NdLinear and NdLinear-LoRA—architectural model compression techniques that preserve multidimensional activations to reduce parameters and compute. This aims to slash inference costs on Workers AI, boost GPU utilization, and accelerate global edge AI deployment.

NVIDIA Other 2026-06-14

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

SK Telecom plans to build a gigawatt-scale AI cloud in Korea using NVIDIA's DSX platform, with first AI factory online in 2027. The platform integrates NVIDIA accelerated computing, systems, and software to support sovereign, physical, and agentic AI services, targeting expansion across Asia.

NVIDIA Other 2026-06-14

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA and SK hynix have announced a multiyear partnership to co-develop next-generation custom memory for NVIDIA's AI factory ecosystem, including Vera Rubin supercomputers, Vera CPUs, RTX Spark PCs, and Jetson Thor robotic platforms. SK hynix will also use NVIDIA CUDA-X libraries and Omniverse to accelerate semiconductor design and build fab digital twins.

NVIDIA Other 2026-06-14

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA unveils Vera CPU, purpose-built for AI agents, featuring 88 Olympus cores and 1.2TB/s LPDDR5X memory. Claiming 1.8x faster task completion over x86, it targets agentic AI workloads. Customers include Anthropic, OpenAI, and Oracle Cloud Infrastructure, signaling a shift of the AI control plane to NVIDIA's ecosystem.

NVIDIA Other 2026-06-13

NVIDIA GB300 NVL72 Delivers 20x Agentic Coding Efficiency, Setting New Inference Benchmark

NVIDIA's GB300 NVL72 achieves 20x more concurrent coding agents per megawatt than H200 on the new AA-AgentPerf benchmark, leveraging 72-GPU NVLink fabric, MXFP4 kernels, and MoE optimizations. This first standardized agentic inference benchmark redefines data center capacity planning for AI agents.

NVIDIA Other 2026-06-13

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.

NVIDIA Other 2026-06-11

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.

Cisco Other 2026-06-11

Cisco Cloud Control: The Control Plane Shift to AI-Native Unified Infrastructure and Observability

Cisco unveils Cisco Cloud Control, a new operating model integrating Splunk for AI-native observability and agentic operations. By unifying network infrastructure, data fabric, and AI trust, it aims to reduce MTTR and costs—but also tightens vendor lock-in on both networking and monitoring.

Reports

Filter

Apple Rebuilds Siri with Google Gemini, Cuts Legacy Hardware Support

Google Open-Sources Brazos: Plug-and-Play Liquid Cooling for Air-Cooled DCs

CrowdStrike's Continuous Identity for AI Agents: Real-Time Risk Engine Replaces Static Policies

Cisco Security Portfolio Moves to AWS Marketplace: Ecosystem Lock-in Accelerates, Multi-Cloud Neutrality Questioned

Cloudflare Announces Scheduled Maintenance and Global Infrastructure Expansion

Cisco G300: A Lock-in Play for AI Network Control Plane Dominance

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA GB300 NVL72 Delivers 20x Agentic Coding Efficiency, Setting New Inference Benchmark

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

Cisco Cloud Control: The Control Plane Shift to AI-Native Unified Infrastructure and Observability