• Home
  • Projects
  • Blog
  • About
Benchmarking AI inference on CPUs: A transparent blueprint for the enterprise

Benchmarking AI inference on CPUs: A transparent blueprint for the enterprise

by Maryam Tahhan, John Harrigan, Anton Ivanov, Paul Power, Luigi Mario Zuccarelli | May 28, 2026 | AI

As enterprises look to optimize the total cost of ownership (TCO) of Large Language Model deployment, utilizing existing enterprise CPU infrastructure alongside GPU resources for specific inference workloads has become a strategic initiative. However, infrastructure...
Protecting Triton kernel deployments with cryptographic signatures

Protecting Triton kernel deployments with cryptographic signatures

by Anton Ivanov, Maryam Tahhan | Feb 5, 2026 | AI

Triton is a domain-specific language and compiler for writing high-performance GPU kernels (snippets of compiled GPU code) using a Python-like syntax. It offers fine-grained control over memory and parallelism, making it ideal for custom, architecture-optimized...
Using eBPF in unprivileged Pods

Using eBPF in unprivileged Pods

by Maryam Tahhan, Andrew Stoycos, Anton Ivanov | Jul 18, 2023 | Hybrid Cloud

Extended Berkeley Packet Filter (eBPF) presents an attractive technology that Kubernetes applications can take advantage of, either to accelerate their packet processing needs (as an in kernel Fast Path) or as part of various monitoring and telemetry projects....

Categories

  • AI
  • Developer Productivity
  • Edge Computing
  • Hybrid Cloud
  • Sustainability
  • Trust
Privacy statement
Terms of use
All policies and guidelines
About

Copyright © 2021-2026 Red Hat, LLC