Skip to content
Information Technology

Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

Graid Technology Inc. 3 mins read

From edge inference to NVIDIA STX, purpose-built KV cache infrastructure for consistent performance at scale.

SUNNYVALE, CA / ACCESS Newswire / April 21, 2026 / Graid Technology, the pioneer in GPU-accelerated NVMe storage, today announced its Agentic AI Storage Portfolio: a purpose-built family of KV cache solutions designed to eliminate the storage bottleneck that stalls "always-on" production AI. The portfolio spans three deployment tiers: KV Cache Server, KV Cache Rack, and KV Cache Platform, all built on SupremeRAID technology. KV Cache Platform, the portfolio's highest tier, is purpose-aligned to NVIDIA's STX reference architecture, with native BlueField-4 DPU execution on the roadmap for H2 2026.

As agentic AI moves from experimentation to production, the infrastructure assumptions that underpinned single-shot inference have broken down. Models running continuous multi-step tasks and maintaining context across hours of operation generate KV cache demands that overwhelm GPU HBM. The result: latency spikes up to 18x, GPU utilization as low as 50%, and model-level failures, including hallucinations and reasoning degradation, that are difficult to detect and costly to recover from.

SupremeRAID addresses this directly, aggregating up to 32 NVMe drives into a single 280 GB/s virtual pool, bypassing the CPU via GPU Direct Storage, and delivering KV cache reads at 1.3ms- 77x faster than standard NVMe. The three portfolio tiers bring this capability to every deployment scale:

KV Cache Server - single-node NVMe acceleration for individual inference servers and edge AI deployments. Available now.

KV Cache Rack - rack-scale, partner-validated solutions co-engineered with leading server OEM partners for enterprise multi-GPU clusters. Available now.

KV Cache Platform - Purpose-built for NVIDIA's STX reference architecture, with native BlueField-4 DPU execution and rack-scale storage expansion on the roadmap.

"A year ago, at GTC 2025, Jensen Huang predicted that storage would become GPU-accelerated for the first time. This year, NVIDIA turned that concept into an architecture with STX and CMX," said Leander Yu, CEO of Graid Technology. "Our KV Cache Portfolio is built for precisely this moment, delivering the storage performance that agentic AI demands, at storage-tier economics."

For enterprises and infrastructure teams evaluating agentic AI deployments, the full deployment architecture, technical specifications, and NVIDIA STX compatibility details are available in the solution brief: Graid Technology Agentic AI Storage Portfolio: Purpose-built KV Cache Solutions for Inference at Scale.

To learn more about Graid Technology's AI offerings, visit graidtech.com/ai.

Media Inquiries:
Andrea Eaken, Sr. Director of Marketing, Americas & EMEA
[email protected]

____________________________________

About Graid Technology
Graid Technology is building the storage backbone for the future of AI, enterprise, and high-performance computing. As the creator of SupremeRAID, the world's first and only GPU-based RAID, and the global steward of Intel® Virtual RAID on CPU (Intel® VROC), Graid Technology delivers flexible RAID solutions that maximize NVMe performance while ensuring resilient, scalable data protection for modern data infrastructure. Headquartered in Silicon Valley with global operations and R&D in Taiwan, Graid Technology is advancing RAID innovation for the next generation of data-intensive workloads. To learn more, visit graidtech.com.

Graid Technology Launches Agentic AI Storage Portfolio to Eliminate KV Cache Bottlenecks

SOURCE: Graid Technology Inc.



View the original press release on ACCESS Newswire

Media

More from this category

  • Information Technology
  • 21/05/2026
  • 09:47
Coro

Coro and Leader Partner to Simplify Cybersecurity for Australian MSPs

New distribution partnership brings Coros’ AI-powered cybersecurity platform to Leader’s nationwide partner ecosystemCHICAGO, May 20, 2026 (GLOBE NEWSWIRE) -- Coro, the leading cybersecurity platform purpose-built for organizations with lean IT teams, today announced a strategic distribution partnership with Leader, one of Australia’s leading information and communication technology (ICT) distributors. Through this partnership, Leader’s nationwide network of managed service providers (MSPs) and reseller partners will gain access to Coro’s modular cybersecurity platform through the Leader Cloud marketplace.The partnership comes as MSPs face increasing pressure to secure customers against a rapidly evolving threat landscape while managing growing operational complexity, staffing constraints, and…

  • Business Company News, Information Technology
  • 20/05/2026
  • 14:00
Data#3 Limited

Data#3 appoints Craig Ellis as General Manager for WA

May 20, 2026; Perth, Australia: Leading Australian technology services and solutions provider, Data#3, is pleased to announce the appointment of Craig Ellis as General Manager for Western Australia (WA), effective from 11 May 2026. Ellis has been with Data#3 for eight years and was previously Manager – Infrastructure Solutions for WA, which included product and services responsibilities. He built a strong reputation for executive level customer engagement and an outcomes led approach, helping organisations align technology decisions to business priorities and long-term value. A long-standing WA local, his experience includes security, artificial intelligence, and complex services environments. Data#3 Chief Customer…

  • Contains:
  • Information Technology
  • 20/05/2026
  • 10:11
Arteris, Inc.

Arteris Technology Adopted by Li Auto for Intelligent Vehicles

Arteris FlexNoC network-on-chip (NoC) IP and Magillem software successfully deployed in the Li Auto L9 Livis high-tech flagship SUV via its proprietary autonomous driving systems-on-chip (SoCs)CAMPBELL, Calif., May 19, 2026 (GLOBE NEWSWIRE) -- Arteris, Inc. (Nasdaq: AIP), a leading provider of semiconductor technology for accelerating innovation in the AI era, today announced that its system IP technology has been deployed by Li Auto Inc., a leader in the China new energy vehicle (NEV) market. Arteris technology is supporting the underlying AI compute data movement and integration automation for Li Auto’s current and future smart vehicles, starting with its proprietary SoCs…

Media Outreach made fast, easy, simple.

Feature your press release on Medianet's News Hub every time you distribute with Medianet. Pay per release or save with a subscription.