Blog

The last mile: from zero trust tokens to real-world resources

by Pavel Anni | Jun 24, 2026 | AI

Key takeaways: The last mile problem is translating a verified zero trust delegation token into credentials that external resources will accept while preserving the permission intersection. This is solved by a credential gateway that validates the token, computes the...

Triton Kernel Profiling with Proton and ROCm on AMD GPUs

by Joseph Groenenboom, Craig Magina | Jun 17, 2026 | AI

This is the second article in our series on Triton kernel profiling. In our first post, Triton kernel profiling with NVIDIA Nsight tools, we introduced how to profile and optimize custom Triton GPU kernels on NVIDIA hardware. In this post, we focus specifically on...

Wiring zero trust identity for AI agents: SPIFFE, token exchange, and Kagenti

by Pavel Anni | Jun 10, 2026 | AI, Trust

Key takeaways: The identity plumbing for zero trust delegation is accomplished by wiring three technologies together: SPIFFE for service-to-service cryptographic workload identity (mTLS), AuthBridge via RFC 8693 token exchange to pass user delegation context (JWTs),...

From context to dreams: architecting memory for AI agents

by Sanjeev Rampal, Ben Capper, Kateryna Romashko, Wes Jackson, Ryan Cook | Jun 1, 2026 | AI

Have you ever felt that every conversation you have with an LLM across sessions feels like starting over from scratch? LLMs have a problem: they have the memory of a goldfish (no disrespect to goldfish intended). This article explores the solution: Agent memory. Agent...

Benchmarking AI inference on CPUs: A transparent blueprint for the enterprise

by Maryam Tahhan, John Harrigan, Anton Ivanov, Paul Power, Luigi Mario Zuccarelli | May 28, 2026 | AI

As enterprises look to optimize the total cost of ownership (TCO) of Large Language Model deployment, utilizing existing enterprise CPU infrastructure alongside GPU resources for specific inference workloads has become a strategic initiative. However, infrastructure...

Zero trust for AI agents: why delegation beats impersonation

by Emerging Technologies, Pavel Anni | May 21, 2026 | AI, Trust

When an AI agent acts on your behalf, how much of "you" should it become? In AI systems, agent impersonation creates security risks by granting overly broad permissions. This post introduces a delegation model using a permission intersection' pattern, ensuring agents...

Who’s really calling? Securing agent-to-agent communication

by Emerging Technologies, Kevin Cogan, Morgan Foster | May 13, 2026 | AI, Trust

The gap between what an agent claims and what the platform can verify is a real attack surface, and it grows with every new agent you onboard. As agents increasingly discover and call each other at runtime, protocols like Agent2Agent (A2A) have introduced a useful...

Code execution with MCP: How sandboxed Python replaces tool schema bloat in AI agents

by Rounak Bende | Apr 23, 2026 | AI

As the number of tools connected to an AI agent grows, JSON Schema definitions become a massive scaling bottleneck. Every tool carries a full schema that gets loaded into the LLM’s context window on every turn. Our tests show that replacing these schemas with a...

PyTorch Call Stack Deep Dive: Tracing Tensor Operations from Python to C++ Kernels

by Christopher Leonard | Mar 27, 2026 | AI

Eliminating the ‘Rego tax’: How AI orchestrators automate Kubernetes compliance

by Anamika Valappil, Alekhya Koppineni | Mar 20, 2026 | AI, Trust

Manually writing OPA Rego policies is a significant bottleneck for many platform teams, creating a 'Rego tax' that can slow down development and introduce risk. This article introduces a new approach: a Dynamic Kubernetes Policy Generator that uses a large language...

The last mile: from zero trust tokens to real-world resources

Triton Kernel Profiling with Proton and ROCm on AMD GPUs

Wiring zero trust identity for AI agents: SPIFFE, token exchange, and Kagenti

From context to dreams: architecting memory for AI agents

Benchmarking AI inference on CPUs: A transparent blueprint for the enterprise

Zero trust for AI agents: why delegation beats impersonation

Who’s really calling? Securing agent-to-agent communication

Code execution with MCP: How sandboxed Python replaces tool schema bloat in AI agents

PyTorch Call Stack Deep Dive: Tracing Tensor Operations from Python to C++ Kernels

Eliminating the ‘Rego tax’: How AI orchestrators automate Kubernetes compliance

Explore

Privacy statement

Terms of use

All policies and guidelines

About