Emerging Technologies Focus: AI

Current Projects

Latest Blog Posts

Understanding Triton Cache: Optimizing GPU Kernel Compilation

The goal of this blog post is to explore Triton’s caching mechanism: how it works, what affects it, how different frameworks leverage it, and how you can optimize it for your specific workloads.

Model authenticity and transparency with Sigstore

by Ivan Font

What is the Sigstore model transparency project? Sigstore’s Model Transparency project is a Sigstore community project aimed at applying the software supply chain security practice of signing to machine learning (ML) models. Hosted on Github at...

A container-first approach to Triton development

by Maryam Tahhan

The Triton project from OpenAI is at the forefront of a groundbreaking movement to democratize AI accelerators and GPU kernel programming. It provides a powerful and flexible framework for writing high performance GPU kernels. As AI workloads become increasingly...

Getting started with PyTorch and Triton on AMD GPUs using the Red Hat Universal Base Image

by Sanjeev Rampal, Steven Royer

In a prior blog post, we provided an overview of the Triton language and its ecosystem. Triton is a Python based DSL (Domain Specific Language), compiler and related tooling designed for writing efficient GPU kernels in a hardware-agnostic manner, offering high-level...

User experience and its importance in adoption of democratized AI

by Anil Vishnoi, Brent Salisbury, Ryan Cook

"An intuitive and accessible UI is critical to even the most powerful AI system" As artificial intelligence (AI) continues to evolve, its influence spans across industries, transforming operations and enhancing decision-making processes. At this point in time, I...

[newsplugin_feed id='1691680568274' title='AI in the news' keywords='AI & ML & "machine learning"' search_mode='title' link_open_mode='_blank' link_follow='yes' show_source='true' count='5' wp_uid='6']

Emerging Technologies Focus: AI

Current Projects

ROSA – Data Analysis Models

Aspen social graph analysis

ROSA – Foundation Models Generative AI for Doc Search

Codeflare Stack Integration to ODH/RHODS

HCS – Red Hat Subscription Delivery

ShadowBot.AI

Open MLOps Stacks – Data Labeling

generative AI for CVE backports

Open ML-Ops Stacks – VectorDB

Confidential Compute for ML Workloads in OpenShift

Latest Blog Posts

Understanding Triton Cache: Optimizing GPU Kernel Compilation

Model authenticity and transparency with Sigstore

A container-first approach to Triton development

Getting started with PyTorch and Triton on AMD GPUs using the Red Hat Universal Base Image

User experience and its importance in adoption of democratized AI

Privacy statement

Terms of use

All policies and guidelines

About