• Home
  • Projects
  • Blog
  • About
From hand-tuned to generated: A reproducible Triton GPU kernel benchmark across different vendors

From hand-tuned to generated: A reproducible Triton GPU kernel benchmark across different vendors

by Alessandro Sangiorgi, Liron Kesem | Feb 12, 2026 | AI

In the world of Large Language Models (LLMs), speed is very important. Much of this speed comes from highly specialized functions called GPU kernels which are small, focused routines that instruct the GPU how to perform calculations with the maximum efficiency....

Categories

  • AI
  • Developer Productivity
  • Edge Computing
  • Hybrid Cloud
  • Sustainability
  • Trust
Privacy statement
Terms of use
All policies and guidelines
About

Copyright © 2021-2026 Red Hat, LLC