Projects
This could be a subhead about the projects on this page.

From hand-tuned to generated: A reproducible Triton GPU kernel benchmark across different vendors

Protecting Triton kernel deployments with cryptographic signatures

Skip the JITters: Fast, trusted model kernels with OCI caching

Architecting Cloud-Native Ambient Agents: Patterns for Scale and Control

Simplifying Edge AI Builds with Verified GitHub Actions Patterns

A Practical Approach to Smart Tool Retrieval for Enterprise AI Agents

Tool RAG: The Next Breakthrough in Scalable AI Agents

Triton Kernel Profiling with NVIDIA Nsight Tools

Intelligent inference request routing for large language models

Enhancing AI inference security with confidential computing: A path to private data inference with proprietary LLMs

A developer’s guide to PyTorch, containers, and NVIDIA – Solving the puzzle

Understanding Triton Cache: Optimizing GPU Kernel Compilation
No results found.
