• Home
  • Projects
  • Blog
  • About
From hand-tuned to generated: A reproducible Triton GPU kernel benchmark across different vendors

From hand-tuned to generated: A reproducible Triton GPU kernel benchmark across different vendors

by Alessandro Sangiorgi, Liron Kesem | Feb 12, 2026 | AI

In the world of Large Language Models (LLMs), speed is very important. Much of this speed comes from highly specialized functions called GPU kernels which are small, focused routines that instruct the GPU how to perform calculations with the maximum efficiency....
Understanding Triton Cache: Optimizing GPU Kernel Compilation

Understanding Triton Cache: Optimizing GPU Kernel Compilation

by Alessandro Sangiorgi | May 16, 2025 | AI

If you’re working with GPU kernels, you’ve likely encountered Triton – a language and compiler designed to write highly efficient custom GPU kernels. One of Triton’s valuable features is its kernel caching system, which can significantly...

Categories

  • AI
  • Developer Productivity
  • Edge Computing
  • Hybrid Cloud
  • Sustainability
  • Trust
Privacy statement
Terms of use
All policies and guidelines
About

Copyright © 2021-2026 Red Hat, LLC