NVIDIA - AI Infrastructure Intelligence Search

NVIDIA Other 2026-06-16

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.

NVIDIA Other 2026-06-16

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

SiMa.ai launches open-source Palette Neat, an agentic development environment for Physical AI, paired with its sub-10W Modalix SoM. It uses natural language to abstract compute complexity, slashing dev cycles from months to days. Pin-compatible with NVIDIA SoM, it targets breaking the GPU ecosystem lock-in.

NVIDIA Other 2026-06-16

NVIDIA RTX Spark SoC Invades Windows PC: Arm CPU + GPU with 128GB Unified Memory Reshapes AI PC

At HPE Discover 2026, NVIDIA unveiled the RTX Spark SoC for Windows PCs, built on TSMC 3nm with a MediaTek-designed Arm CPU, 70B transistors, and up to 128GB unified memory. This marks NVIDIA's official entry into the PC SoC market, directly challenging Intel, AMD, and Qualcomm in the AI PC segment.

Microsoft Other 2026-06-16

Microsoft Work IQ Agent-First Platform Shifts Enterprise Integration Control from Developers to AI Runtime

Microsoft launched Work IQ, an agent-first enterprise platform replacing traditional app connections. AI agents dynamically discover data structures at runtime without manual coding. Alongside Copilot super app, Scout personal assistant, and Project Solara, Microsoft pivots to agent-centric architecture.

MediaTek Other 2026-06-16

MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon

MediaTek doubles its 2026 AI ASIC revenue target to $2B, leveraging Google hyperscaler deals and the NVIDIA RTX Spark chip (featuring MediaTek's N1X Arm CPU). It aims for 10-15% of the $70-80B custom AI chip market by 2027, directly challenging Broadcom's dominance.

NVIDIA Other 2026-06-16

HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia

SK Hynix, Samsung, and Micron have crossed $1 trillion market cap as HBM becomes the hard limit in AI infrastructure. Asian suppliers now account for 90% of Nvidia's production costs, shifting the bottleneck from GPU compute to stacked memory and advanced packaging.

NVIDIA Other 2026-06-15

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's blog introduces World-Action Models (WAMs) as a paradigm shift from VLM-based VLAs. WAMs leverage pretrained video/world-model backbones to jointly predict future states and robot actions, aiming to bridge the language-to-action grounding gap. This could redefine robot foundation model training but raises concerns about inference cost and latency.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

NVIDIA Other 2026-06-14

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

SK Telecom plans to build a gigawatt-scale AI cloud in Korea using NVIDIA's DSX platform, with first AI factory online in 2027. The platform integrates NVIDIA accelerated computing, systems, and software to support sovereign, physical, and agentic AI services, targeting expansion across Asia.

NVIDIA Other 2026-06-14

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA and SK hynix have announced a multiyear partnership to co-develop next-generation custom memory for NVIDIA's AI factory ecosystem, including Vera Rubin supercomputers, Vera CPUs, RTX Spark PCs, and Jetson Thor robotic platforms. SK hynix will also use NVIDIA CUDA-X libraries and Omniverse to accelerate semiconductor design and build fab digital twins.

NVIDIA Other 2026-06-14

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA unveils Vera CPU, purpose-built for AI agents, featuring 88 Olympus cores and 1.2TB/s LPDDR5X memory. Claiming 1.8x faster task completion over x86, it targets agentic AI workloads. Customers include Anthropic, OpenAI, and Oracle Cloud Infrastructure, signaling a shift of the AI control plane to NVIDIA's ecosystem.

NVIDIA Other 2026-06-13

NVIDIA GB300 NVL72 Delivers 20x Agentic Coding Efficiency, Setting New Inference Benchmark

NVIDIA's GB300 NVL72 achieves 20x more concurrent coding agents per megawatt than H200 on the new AA-AgentPerf benchmark, leveraging 72-GPU NVLink fabric, MXFP4 kernels, and MoE optimizations. This first standardized agentic inference benchmark redefines data center capacity planning for AI agents.

NVIDIA Other 2026-06-13

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.

NVIDIA Other 2026-06-12

NVIDIA and SK Hynix Lock Down HBM4/5 Roadmap, Cementing Vera Rubin Supply Chain

NVIDIA and SK Hynix sign a multi-year agreement to co-define HBM4 production and HBM5 pre-research for Vera Rubin GPUs. Samsung also enters HBM4 supply as a second source. The deal elevates SK Hynix from vendor to co-developer, potentially creating a de facto memory standard barrier that marginalizes Micron and others.

Intel Other 2026-06-12

Google Awards 3M+ TPU Packaging Orders to Intel Foundry, Breaking TSMC's CoWoS Monopoly

Google has awarded Intel Foundry over 3 million units of next-gen TPU advanced packaging orders, leveraging Intel's EMIB technology with production starting in 2028. This marks Intel Foundry's largest external customer win and a pivotal shift in AI chip packaging away from TSMC's CoWoS monopoly.

AMD Other 2026-06-12

AMD Zen 6 Venice 256-Core EPYC Claims 3.3x Rack Performance Over NVIDIA Vera, But Estimates Raise Questions

AMD unveils first estimated performance of Zen 6 Venice EPYC (2nm, 256 cores), claiming 3.3x rack-level integer throughput over NVIDIA Vera at 100kW total power. A direct counter to NVIDIA's Arm push, but based on projected estimates, not silicon.

AMD Other 2026-06-12

AMD Backs All-Instinct GPU Cloud: TensorWave's $350M Series B Signals NVIDIA Ecosystem Breakout

TensorWave closes $350M Series B led by Magnetar and AMD Ventures at $1.55B valuation. The cloud is exclusively built on AMD Instinct GPUs (MI300X to MI455X), targeting memory-intensive AI workloads to offer a viable alternative to NVIDIA CUDA lock-in and validate ROCm software stack maturity in production.

Cisco Other 2026-06-11

Cisco AI Defense Policy Studio: Meta-Prompting Unwritten Policy into Auditable Guardrails

Cisco introduces AI Defense Policy Studio, an AI assistant that guides policy owners through authoring custom guardrails via a chat-and-review UI. It uses meta-prompting to translate informal guidance into human- and model-readable policy documents, directly deployable to Cisco AI Defense for runtime enforcement across models and applications.

NVIDIA Other 2026-06-11

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.

Reports

Filter

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

NVIDIA RTX Spark SoC Invades Windows PC: Arm CPU + GPU with 128GB Unified Memory Reshapes AI PC

Microsoft Work IQ Agent-First Platform Shifts Enterprise Integration Control from Developers to AI Runtime

MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon

HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA GB300 NVL72 Delivers 20x Agentic Coding Efficiency, Setting New Inference Benchmark

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and SK Hynix Lock Down HBM4/5 Roadmap, Cementing Vera Rubin Supply Chain

Google Awards 3M+ TPU Packaging Orders to Intel Foundry, Breaking TSMC's CoWoS Monopoly

AMD Zen 6 Venice 256-Core EPYC Claims 3.3x Rack Performance Over NVIDIA Vera, But Estimates Raise Questions

AMD Backs All-Instinct GPU Cloud: TensorWave's $350M Series B Signals NVIDIA Ecosystem Breakout

Cisco AI Defense Policy Studio: Meta-Prompting Unwritten Policy into Auditable Guardrails

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving