Filter

×
Active Filters Clear All
Keyword: Memory ×
80 Total Reports
4/4 Page
ARM Other High Signal 2026-03-27

Arm Expands into Silicon Products with First Self-Designed AGI CPU

Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.

NVIDIA Other High Signal 2026-03-26

NVIDIA Introduces Physical AI Data Factory Blueprint, Transforming Compute into Synthetic Data

At GTC, NVIDIA introduced the Physical AI Data Factory Blueprint, an open reference architecture designed to transform compute into large-scale, high-quality synthetic training data. Built on Cosmos world models and the OSMO operator, it addresses the bottleneck of scaling real-world data, aiming to serve as the data engine for next-gen autonomous systems and robots.

ARM Other High Signal 2026-03-25

ARM Launches AGI CPU for Agentic AI Infrastructure Era

ARM introduces the Arm AGI CPU, its first silicon product, designed for agentic AI infrastructure on Neoverse. Optimized for massively parallel workloads, it supports 272 cores per blade in a 1OU design, delivering 8160 cores per rack and over 2x performance vs. x86 systems.

ARM Other High Signal 2026-03-25

Arm Neoverse Reshapes Control Layer in AI Infrastructure

ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.

NVIDIA Other High Signal 2026-03-24

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community

NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.

ARM Other High Signal 2026-03-24

ARM and NVIDIA Drive Localization Revolution in AI Workstations

ARM and NVIDIA jointly launch DGX Spark AI workstations based on GB10 Grace Blackwell chips, with eight major OEMs releasing products simultaneously. The solution features unified memory architecture supporting 200B parameter models locally, with third-party tests showing 41% faster rendering and 3.2x AI processing speed versus x86 alternatives, enabling seamless cloud-to-edge toolchain migration.

AMD Other High Signal 2026-03-18

AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea

AMD and NAVER Cloud announced a strategic collaboration to accelerate sovereign AI infrastructure in Korea. NAVER Cloud will expand deployment of AMD EPYC "Venice" CPUs and gain early access to next-gen Instinct MI455X GPUs, with joint optimization of AI services and software stacks on AMD platforms.

AMD Other High Signal 2026-03-18

AMD and Samsung Deepen Collaboration, Locking HBM4 Supply and Exploring Foundry Partnership

AMD and Samsung signed an MOU, designating Samsung as the primary HBM4 supplier for the next-gen Instinct MI455X GPU and collaborating on DDR5 memory optimized for 6th Gen EPYC CPUs. The companies will also explore opportunities for Samsung to provide foundry services for future AMD products.

NVIDIA Other 2026-03-17

Project Rheo: NVIDIA Shifts Robot Training Control from Real Hospitals to Simulation

NVIDIA unveils Project Rheo, a blueprint combining Isaac Sim, GR00T VLA models, and synthetic data generation for hospital robotics. Developers train Physical AI policies in digital twins—loco-manipulation (surgical tray pick-and-place) and precision bimanual tasks (trocar assembly)—with Cosmos Transfer 2.5 for cross-scene generalization.

NVIDIA Other 2026-03-13

NVIDIA Warp: Differentiable Physics Simulation for AI Training on GPU

NVIDIA Warp is a framework for GPU-accelerated, differentiable physics simulation. It enables writing high-performance kernels in Python, with automatic differentiation, and integrates with PyTorch/JAX. The 2D Navier-Stokes example demonstrates end-to-end optimization, reducing the cost of generating training data for physics AI.

NVIDIA Other Medium Signal 2026-03-04

NVIDIA Extends CUDA Tile Programming Model to Julia Language

NVIDIA introduces its CUDA Tile high-level GPU programming model to the Julia ecosystem via the cuTile.jl package. This move aims to lower the barrier to high-performance GPU kernel development by abstracting low-level thread and memory management with a tile-based data model, while maintaining high syntax and performance parity with the Python version.

Trend Micro Other High Signal 2026-03-03

Trend Micro Report Highlights AI Supply Chain Risks and Model Attack Surfaces

Trend Micro's 'Fault Lines in the AI Ecosystem' report systematically analyzes security risks in the AI supply chain, including training data poisoning, third-party plugin vulnerabilities, and model theft attacks. It indicates that enterprise AI security boundaries have expanded from traditional IT infrastructure to the model layer and data pipelines.

AMD Other Medium Signal 2026-03-01

AMD Launches Gaming PC Certification Framework to Strengthen Platform Strategy

AMD introduces Advantage Gaming Desktops certification program, requiring OEMs to adopt AMD's 3A platform combining processors, GPUs and software technologies. The program sets hardware performance standards including Ryzen 7/9 processors and Radeon RX 7000 GPUs, with integrated software optimization.

OpenAI Other High Signal 2026-01-29

OpenAI Integrates GPT-5 with Memory System for Large-Scale Data Reasoning

OpenAI has developed an in-house AI data agent that integrates GPT-5, Codex, and a memory system to reason over massive datasets and deliver reliable insights in minutes. This integration demonstrates OpenAI's strategic direction in enhancing AI reasoning capabilities and data processing efficiency.

NVIDIA Other 2026-01-23

NVFP4 + TeaCache Drive 10x FLUX.2 Inference Speedup, Locking Blackwell Ecosystem

NVIDIA and BFL optimize FLUX.2 on DGX B200/B300 using NVFP4 4-bit quantization, TeaCache step skipping, CUDA Graphs, and torch.compile, achieving 6.3x (single GPU) to 10.2x (dual GPU) latency reduction vs H200, with 40% memory savings. The stack is tightly coupled to TensorRT-LLM visualgen and Blackwell hardware.

NVIDIA Other 2025-11-08

NVIDIA Launches Interactive AI Agent for GPU-Accelerated Data Science with Nemotron Nano-9B

NVIDIA unveils an interactive AI agent powered by Nemotron Nano-9B-v2 and CUDA-X libraries, enabling natural language orchestration of ML workflows. It achieves 3x-43x GPU acceleration over CPU for data processing, model training, and hyperparameter optimization.

Apple Other 2025-11-06

Apple Reinforces On-Device AI and Spatial Computing Developer Ecosystem Through Success Stories and Tutorials

Apple highlights Swift Student Challenge winners who built apps using SwiftUI, Core ML (on-device ML framework), and spatial computing. It also announced new tutorials for the 2026 challenge focusing on SwiftUI, spatial computing, and machine learning. This underscores Apple's ongoing investment in on-device AI, immersive experiences, and its developer toolchain.

NVIDIA Other Medium Signal 2025-10-22

NVIDIA Publishes Tutorial for Converting Lightweight LLM into Terminal AI Agent

NVIDIA released a developer tutorial guiding users to build an AI agent that understands natural language and executes Bash commands, using its open-source Nemotron Nano v2 model within roughly 200 lines of Python code. The tutorial emphasizes building from scratch and simplifying with LangGraph, focusing on safe tool calling and human-in-the-loop control.

Trend Micro Other High Signal 2025-09-08

Trend Micro Highlights Power Automate as an Enterprise Automation Security Blind Spot

Trend Micro's research report reveals that the complexity of low-code automation tools like Microsoft Power Automate is being exploited by cybercriminals to evade detection and exfiltrate data. The study highlights critical security risks from visibility gaps within automation platforms and warns of growing demand for such attack capabilities in the cybercriminal underground.

Qualcomm Other High Signal 2024-03-12

Bosch and Qualcomm Deepen Collaboration to Consolidate ADAS and Cockpit Compute on Single SoC

Bosch and Qualcomm are expanding their strategic partnership to jointly develop production-ready ADAS solutions based on the Snapdragon Ride platform, and to consolidate cockpit and ADAS functions onto a single SoC using Snapdragon Ride Flex. This aims to provide automakers with a clear migration path from distributed to centralized compute architectures, reducing system complexity and cost.