Filter

×
Active Filters Clear All
Keyword: AI Infra ×
81 Total Reports
3/5 Page
Cisco Other High Signal 2026-04-16

Cisco and NVIDIA Elevate Network to AI Media Processing Control Plane

Cisco and NVIDIA deepen collaboration with a validated design based on the open-standard Media Exchange Layer (MXL). This integration merges Cisco's IP media fabric with NVIDIA's Holoscan platform, transforming the network from a transport layer into an active processing layer that supports real-time AI inference, enabling low-latency, multilingual AI-driven live media production for broadcasters.

Microsoft Other High Signal 2026-04-16

Microsoft Activates Fairwater Hyperscale AI Datacenter Ahead of Schedule, Setting New Infrastructure Standard

Microsoft announced the early activation of its Fairwater datacenter in Wisconsin, positioned as the world's most powerful AI facility. It integrates hundreds of thousands of NVIDIA GB200 GPUs into a single seamless cluster via massive fiber interconnect, targeting unprecedented compute scale for next-generation AI training and inference workloads.

NVIDIA Other High Signal 2026-04-15

NVIDIA Shifts AI Infrastructure Metric from FLOPS to Cost Per Token

NVIDIA advocates for "cost per token" as the primary economic metric for AI infrastructure, replacing "FLOPS per dollar." This shift moves the focus from computational inputs to business outputs, requiring full-stack optimization across hardware, software, and networking to lower enterprise AI inference TCO.

Cisco Other High Signal 2026-04-14

Cisco Validates On-Premises AI Deployment Logic with Internal Case Study

Cisco's Customer Experience (CX) unit deployed on-premises AI infrastructure using UCS servers and Nexus switches to handle sensitive customer data, addressing cloud-related data sovereignty and unpredictable inferencing cost challenges. This move demonstrates an architectural shift from variable operational expenses to deterministic capital investment for AI workloads.

Intel Other High Signal 2026-04-09

Intel and Google Deepen Collaboration to Define Core of Heterogeneous AI Infrastructure

Intel and Google announced a multiyear collaboration to advance next-generation AI and cloud infrastructure. The core is reinforcing the central role of CPUs and custom IPUs in heterogeneous AI systems, optimizing performance and efficiency through multi-generational Xeon processors, and expanding co-development of ASIC-based IPUs to improve efficiency and predictable performance at hyperscale.

Intel Other High Signal 2026-04-09

Intel and Google Deepen Collaboration on CPU and IPU for Heterogeneous AI Infrastructure

Intel and Google announced a multi-year collaboration to advance next-generation AI and cloud infrastructure through aligned Xeon processor roadmaps and expanded co-development of custom ASIC-based IPUs. This reinforces the central role of CPUs in AI system orchestration and the critical value of IPUs in offloading infrastructure tasks to improve efficiency at hyperscale.

Cisco Other High Signal 2026-04-09

Cisco Demonstrates Unified S/NOC with Agentic AI for Autonomous Security Operations at MWC 2026

At MWC 2026, Cisco operated a unified Security and Network Operations Center (S/NOC), demonstrating seamless integration across its Security Cloud, XDR, and Splunk platforms. The core innovation was the use of a beta Agentic AI to generate "Instant Attack Storyboards" for triage and investigation, with automated workflows bridging incidents to Splunk Enterprise Security for deeper threat hunting.

Intel Other High Signal 2026-04-08

Intel and SambaNova Announce Heterogeneous Inference Architecture for Agentic AI

Intel and SambaNova have announced a collaborative blueprint for Agentic AI production workloads. The heterogeneous design combines GPUs, SambaNova RDUs, and Intel Xeon 6 processors to address performance, efficiency, and software compatibility issues, with availability expected in H2 2026.

Cisco Other Medium Signal 2026-04-08

Cisco Deepens Nutanix Partnership, Extending HCI to AI and Edge

Cisco announced multiple advancements in its partnership with Nutanix, focusing on integrating the Nutanix Cloud Platform into Cisco AI PODs, Cisco Unified Edge, and FlashStack. The goal is to provide a unified, validated blueprint and operational model for both AI and traditional workloads from core to edge.

ARM Other 2026-04-07

Arm Partners with Monash University Malaysia to Advance Semiconductor Talent for AI Era

Arm announced a collaboration with Monash University Malaysia's School of Engineering, donating IC design development boards and appointing an executive as a guest lecturer. The initiative aims to cultivate semiconductor talent with hands-on Arm architecture and modern system design experience for the AI era.

Microsoft Other High Signal 2026-04-06

Microsoft Partners with Domestic Operators to Build Sovereign AI Infrastructure in Japan

Microsoft announced a $10B investment in Japan over four years, with a key pillar being a collaboration with Sakura Internet and SoftBank. This partnership will offer GPU-based AI compute services through Azure, managed by domestic providers to ensure data residency within Japan. This addresses the demand for sovereign AI infrastructure for sensitive workloads.

Anthropic Other Medium Signal 2026-04-06

Anthropic Establishes Fourth APAC Office in Sydney, Explores Local Compute Capacity

Anthropic announced it will open its fourth Asia-Pacific office in Sydney, Australia, to serve the ANZ market. The company plans to deepen engagement with local institutions and explore expanding compute capacity in Australia via third-party partners to address enterprise data residency requirements.

Google Other High Signal 2026-04-03

Google Launches Gemma 4 Open Models, Targeting Edge Inference and AI Agent Architecture

Google introduces the Gemma 4 open model family, with four sizes from 2B to 31B parameters, emphasizing breakthrough intelligence-per-parameter and native support for agentic workflows, multimodality, and long context. The small models are engineered for edge devices, aiming to bring frontier reasoning to mobile and IoT scenarios.

Google Other Medium Signal 2026-04-03

Google Launches Gemma 4 Open Model Family

Google introduces Gemma 4 open model family with four size variants, optimized for edge and mobile devices. The series supports multimodal processing, long context windows and 140+ languages under Apache 2.0 license.

Cisco Other Medium Signal 2026-04-02

Cisco Launches Validated AI Infrastructure Solution

Cisco introduced validated AI infrastructure designs in collaboration with NVIDIA and Red Hat, offering pre-integrated AI POD solutions to address compatibility and security challenges in enterprise DIY AI infrastructure. The solution encompasses complete compute, networking, storage and AI software stacks with modular scalability.

AMD Other High Signal 2026-04-02

AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities

AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.

ARM Other High Signal 2026-04-01

ARM Launches AGI CPU Silicon, Extends AI Infrastructure Reach

ARM debuts its first self-designed AGI CPU silicon, moving beyond IP licensing to offer full-stack solutions from custom silicon to integrated platforms. This shift redefines control points in AI infrastructure supply chains, enabling enterprises to optimize AI workload deployment at hardware layer.

Intel Other Medium Signal 2026-04-01

Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference

Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.

NVIDIA Other High Signal 2026-03-31

NVIDIA Collaborates with Energy Leaders to Position AI Factories as Smart Grid Assets

NVIDIA, in collaboration with Emerald AI, proposes treating large-scale AI data centers (AI factories) as flexible, intelligent grid assets rather than static power loads. This architecture integrates accelerated computing, power networking, and control to enhance grid reliability and optimize energy efficiency. Several major energy companies plan to collaborate on this architecture to support AI workloads and accelerate power connection.

NVIDIA Other High Signal 2026-03-31

NVIDIA Collaborates with Energy Leaders on AI Factory-Grid Integration Architecture

NVIDIA and Emerald AI introduced a new architecture treating AI factories as intelligent grid assets, combining accelerated computing, real-time energy orchestration and reference designs. The Vera Rubin DSX-based approach enables dynamic grid response and has gained support from multiple energy providers.