• Home
  • Projects
  • Blog
  • About
Intelligent inference request routing for large language models

Intelligent inference request routing for large language models

by Ron Haberman, Huamin Chen, Yossi Ovadia, Ricardo Noriega De Soto, Andre Fredette, David Brewster | Nov 11, 2025 | AI

Today’s AI environment is experiencing a surge in specialized Large Language Models (LLMs), each possessing unique abilities and strengths123: Some are strong in reasoning and mathematics, while others may excel in creative writing. Yet most applications resort...
Introducing Kepler: Efficient power monitoring for Kubernetes

Introducing Kepler: Efficient power monitoring for Kubernetes

by Parul Singh, Huamin Chen | Aug 22, 2023 | Sustainability

Introduction Monitoring and optimizing power consumption is crucial for efficient resource management in Kubernetes environments. To address this need, a powerful tool called Kepler (Kubernetes-based Efficient Power Level Exporter) has emerged. Leveraging software...
Introducing Kepler: Efficient power monitoring for Kubernetes

Sustainability, the cloud native way

by Huamin Chen, Parul Singh, Kaiyi Liu | Feb 21, 2023 | Sustainability

Sustainability has gained tremendous mindshare across the world. Energy conservation and CO2 emissions reductions are among the key initiatives for environmental sustainability. Since data centers contributed to as much as 1% global electricity usage there is a...

Categories

  • AI
  • Developer Productivity
  • Edge Computing
  • Hybrid Cloud
  • Sustainability
  • Trust
Privacy statement
Terms of use
All policies and guidelines
About

Copyright © 2021-2023 Red Hat, Inc.