Articles from Source: Red-Hat-Developer-Blog

Blast radius validation: Large and small Red Hat OpenShift nodes

2026-04-02 03:00
🔍 Exploring the impact of node size in Red Hat OpenShift! This article evaluates if larger nodes increase operational risk during planned maintenance and outages. Tests were conducted using three-node clusters with varying sizes and workloads. 📊 Results show that when resources are balanced, larger nodes do not significantly increase recovery time. In fact, during planned maintenance, larger nodes recovered faster than smaller ones. Key focus areas include: - Operational risk related to core...
Source: Red Hat Developer Blog
Chris Janiszewski, Ata Mufti

Run Gemma 4 with Red Hat AI on Day 0: A step-by-step guide

2026-04-02 00:24
🚀 Exciting news from Google DeepMind! The Gemma 4 model family has been launched, featuring four variants with 2B to 31B parameters. All models support multimodal inputs—text, image, video, and the smaller ones even support audio. You can start using Gemma 4 today via the Red Hat AI Inference Server, which offers immediate experimentation. With a Mixture-of-Experts architecture, the 26B model maintains high efficiency while reducing inference costs. Learn more about installation and usage in...
Source: Red Hat Developer Blog
Saša Zelenović, Selbi Siddiqi, Tarun Kumar, Daniele Trifirò, Lucas Wilkinson

Implement a multicluster event exporter for enterprise automation

2026-04-01 07:01
🚀 Red Hat Advanced Cluster Management enhances Kubernetes with a multicluster global hub, enabling management at scale. This hub provides unified visibility and policy compliance across thousands of clusters. A key feature is the multicluster global hub agent, which can function as an event exporter, streamlining event data for enterprise automation. By standardizing events into CloudEvents format, it simplifies integration with tools like Ansible and Splunk, allowing for real-time visibility...
Source: Red Hat Developer Blog
Meng Yan

Bootable containers: Reduce friction with Red Hat Enterprise Linux image mode

2026-04-01 03:15
🚀 Deploying applications can be challenging, especially in regulated environments. The transition from local development to production often leads to infrastructure hurdles. Red Hat’s image mode for RHEL aims to bridge the gap between development and operations. By treating RHEL as a bootable container, it simplifies infrastructure management and promotes a unified workflow. This approach allows teams to collaborate effectively, using a common set of tools and language. With bootable...
Source: Red Hat Developer Blog
Louis Imershein

Unsloth and Training Hub: Lightning-fast LoRA and QLoRA fine-tuning

2026-04-01 03:15
🚀 Exciting updates in LLM fine-tuning! Red Hat's Training Hub is an open-source library designed to simplify the post-training process for large language models. It offers a unified interface for various algorithms, making transitions seamless. Key features include support for LoRA and QLoRA, which allow for fast, efficient model adaptation with reduced memory needs. With performance enhancements through the Unsloth backend, teams can achieve quicker training times with less VRAM. Explore...
Source: Red Hat Developer Blog
Aditi Saluja, Mustafa Eyceoz, Oleg Silkin

Unlocking efficiency: A guide to operator cache configuration on Red Hat OpenShift and Kubernetes

2026-03-31 17:40
Understanding operator cache configuration on Red Hat OpenShift and Kubernetes is crucial for efficient memory management. As operators scale, issues like out of memory (OOM) errors can arise, especially in cluster scope deployments. These errors may occur during upgrades or due to unexpected caching behavior of the controller-runtime. The controller-runtime builds a cache of cluster resources, which can lead to high memory usage at startup or during spikes in resource events. Proper...
Source: Red Hat Developer Blog
Jim Fitzpatrick

How to get raw device mapping with OpenShift Virtualization

2026-03-31 07:01
Unlock the potential of OpenShift Virtualization with Raw Device Mapping (RDM)-like experiences! 🚀 This article explains how RDM volumes allow direct LUN attachments to VMs, enhancing disk I/O performance and supporting SCSI commands. OpenShift can replicate this with proper CSI drivers, enabling efficient volume management. Explore migration strategies and key features for using RDM-like volumes. #OpenShift #Virtualization #CloudComputing #VMware #TechInsights
Source: Red Hat Developer Blog
Raffaele Spazzoli

Spending transaction monitor: Agentic AI for intelligent financial alerts

2026-03-30 12:29
Discover the future of financial monitoring with the Spending Transaction Monitor's agentic AI! 💰 Traditional systems often require complex rule configurations. This new approach allows users to set alerts using natural language, making it simpler to manage and adapt to real spending behaviors. 📝 Key features include intelligent recommendations, multi-channel notifications, and automated rule creation through AI agents. This solution not only enhances user experience but also offers adaptive...
Source: Red Hat Developer Blog
Saurabh Agarwal, Theia Surette, Sid Kattoju, Yash Oza, RJ Johnson

Vibes, specs, skills, and agents: The four pillars of AI coding

2026-03-30 07:16
Unlock the potential of AI coding with a four-pillar approach: vibes, specs, skills, and agents. 🌐 Utilize vibes for idea exploration, create specs for clear instructions, develop skills for specific capabilities, and employ agents for executing code. This balance leads to smoother, more accurate outcomes. Embrace this methodology to enhance productivity and code quality. 💻✨ #AICoding #SoftwareEngineering #RedHat #DevOps #TechInnovation
Source: Red Hat Developer Blog
Rich Naszcyniec

Integrate Claude Code with Red Hat AI Inference Server on OpenShift

2026-03-26 13:52
🚀 Exciting developments in coding efficiency! The article discusses integrating Claude Code, a terminal-based coding agent, with the Red Hat AI Inference Server on OpenShift. This setup enhances productivity by allowing developers to interact with code using natural language while keeping the inference process private. 📋 Key prerequisites include an OpenShift cluster with GPU support and a Hugging Face account. The deployment process involves creating a Helm chart and setting environment...
Source: Red Hat Developer Blog
Alexander Barbosa Ayala

Scale LLM fine-tuning with Training Hub and OpenShift AI

2026-03-26 07:00
Unlock the potential of large language models (LLMs) with Red Hat OpenShift AI! 🚀 Data science teams can now transform local experiments into scalable workflows. Begin with the Training Hub for fine-tuning, then move to OpenShift AI for collaborative work. Scale with Kubeflow Trainer and operationalize using AI pipelines and Model Registry. This structured approach ensures your workflow remains consistent while improving production capabilities. 💻✨ #AI #MachineLearning #OpenShift #DataScience...
Source: Red Hat Developer Blog
Brian Gallagher

Reproducible builds in Project Hummingbird

2026-03-26 03:15
🔍 Red Hat's Project Hummingbird introduces reproducible builds for enhanced software supply chain security. These builds allow users to verify that OCI images match their published versions, preventing undetectable tampering. Hummingbird images, designed for environments with minimal CVEs, are created in the Konflux software factory and come with an SBOM and SLSA provenance artifact. 🛠️ By using tools like cosign and podman, users can easily rebuild Hummingbird images, ensuring trust and...
Source: Red Hat Developer Blog
Jonathan Lebon

Getting started with the vLLM Semantic Router project's Athena release: Optimize your tokens for agentic AI

2026-03-25 15:52
🚀 The vLLM Semantic Router's Athena 0.2 release optimizes token management for AI. It intelligently routes requests between local and cloud models to reduce costs by over 90%. 🌐 The setup involves a local Qwen3-Coder-Next and a cloud fallback with Google's Gemini 2.5 Pro. The API remains OpenAI-compatible, ensuring a seamless experience. Learn how to implement it with detailed guidance on GitHub. #AI #TokenOptimization #OpenSource #vLLM #SemanticRouter
Source: Red Hat Developer Blog
Christopher Nuland

Dynamic resource allocation goes GA in Red Hat OpenShift 4.21: Smarter GPU scheduling for AI workloads

2026-03-25 14:25
🚀 OpenShift 4.21 introduces Dynamic Resource Allocation (DRA), now in General Availability! DRA enhances GPU scheduling by allowing workloads to request hardware based on specific device attributes, overcoming limitations of the previous device plug-in model. Key features include resource sharing, topology awareness, and prioritized device requests, streamlining resource management for AI workloads. Discover how DRA transforms your GPU cluster management! #OpenShift #GPU #CloudComputing...
Source: Red Hat Developer Blog
Harshal Patil

How to run a Red Hat-powered local AI audio transcription

2026-03-25 07:01
Unlock the potential of AI with local audio transcription! 🎤 This guide walks you through setting up a transcription application using Red Hat AI. With just a few Python commands, you can keep your audio files secure and offline. Learn how to install the necessary tools, download the Red Hat model, and transcribe audio files seamlessly. Dive into the world of open-source AI and enhance your projects! 🚀 #RedHatAI #AudioTranscription #OpenSource #AIFuture #PythonProgramming
Source: Red Hat Developer Blog
Seth Kenlon

Run Model-as-a-Service for multiple LLMs on OpenShift

2026-03-24 07:16
Unlock the power of multiple LLMs with Model-as-a-Service (MaaS) on OpenShift! 🚀 This guide details how to create a unified entry point for AI inference, simplifying developer interactions with various models like Qwen and TinyLlama. Learn to build intelligent routing that efficiently directs traffic and reduces GPU waste. One endpoint means unified authentication and simplified monitoring. Explore the integration of llm-d capabilities into Red Hat OpenShift AI for a streamlined enterprise...
Source: Red Hat Developer Blog
Vladimir Belousov

Evaluate OpenShift cluster health with the cluster observability operator

2026-03-24 07:16
🌐 Red Hat OpenShift introduces a new component health overview within the cluster observability operator, now in Developer Preview. This feature aids in assessing the health of the OpenShift control plane and other components, categorizing their status as OK, warning, or error through a Perses dashboard. To use, install cluster observability operator 1.4 or later via OperatorHub and enable recommended monitoring. Explore Prometheus metrics to view component relationships and health statuses....
Source: Red Hat Developer Blog
Tomas Remes

Integrate Red Hat Advanced Cluster Management with Argo CD

2026-03-24 07:16
🌐 Red Hat Advanced Cluster Management integrates with Argo CD for enhanced Kubernetes application management. This setup allows clusters to subscribe to Git repositories using Channels and Subscriptions, supporting both push and pull models. 🔍 Key features include the ApplicationSet Custom Resource and the ability to configure sync policies for better resource management. 📊 Audit logs play a crucial role in tracking actions and troubleshooting issues. Always verify the Argo CD console and...
Source: Red Hat Developer Blog
Francisco De Melo Junior

Upgrade Advanced Cluster Management hubs without disruption

2026-03-23 07:00
Upgrading Red Hat Advanced Cluster Management hubs can be challenging due to risks like downtime and upgrade failures. The new solution, managed cluster migration, allows for parallel hub deployment. This means a new hub version can be set up while the old one remains operational, ensuring zero disruption during the upgrade process. The migration is monitored closely, with automatic rollbacks in case of failures, making the upgrade process safer and more reliable. #RedHat #Kubernetes...
Source: Red Hat Developer Blog
Dang Peng Liu

Eval-driven development: Build and evaluate reliable AI agents

2026-03-23 07:00
🚀 Check out insights from our journey in developing the rh-ai-quickstart/it-self-service-agent! We explored an evaluations framework tailored for AI agents, emphasizing the need for comprehensive testing due to their inherent variability. Key stages of our evaluation journey include: 1️⃣ Manual testing with predefined conversations 2️⃣ Automated evaluations with custom metrics 3️⃣ Continuous integration for ongoing improvements Learn more about how we integrated these practices into our...
Source: Red Hat Developer Blog
Michael Dawson

Hybrid loan-decisioning with OpenShift AI and Vertex AI

2026-03-19 07:01
Explore how modern financial applications make loan decisions using hybrid machine learning systems! 🤖💼 The architecture features a loan approval classifier on Google Cloud with Vertex AI, while an ONNX regression model for interest rate prediction runs on Red Hat OpenShift AI on-premise. This setup ensures sensitive data remains secure and compliant. 🔒 Key components include a lightweight React frontend and a Llama-based chatbot for enhanced user interaction, all orchestrated seamlessly....
Source: Red Hat Developer Blog
Harshil Sabhnani

Rebalance hub workloads with managed cluster migration

2026-03-19 07:01
🌐 As Kubernetes deployments grow, managing workloads efficiently becomes crucial. Red Hat Advanced Cluster Management introduces a multicluster global hub for dynamic redistribution of managed clusters. This system allows for workload balancing, easing the strain on individual hubs and reducing latency. To implement this, ensure the global hub is set up and source clusters are imported correctly. This approach supports various scenarios, including capacity redistribution and geographic...
Source: Red Hat Developer Blog
Dang Peng Liu

How to operate OpenShift in air-gapped environments

2026-03-19 07:01
Operating Red Hat OpenShift in air-gapped environments enhances security by eliminating direct internet access. Organizations must manage all software and updates through controlled processes to meet strict compliance and sovereignty requirements. 🔒🌐 Key challenges include initial setup and ongoing maintenance. It’s crucial to establish a defined operational cadence for mirroring platform updates, managing operator catalogs, and ensuring security fixes. 📅🔄 Implementing a mirror factory can...
Source: Red Hat Developer Blog
Phillip Knezevich

Automate test and failure analysis via streams for Apache Kafka

2026-03-19 07:01
In enterprise software testing, syncing failure analysis between ReportPortal and Polarion is crucial. This article outlines an event-driven solution using Apache Kafka, Debezium CDC, and Quarkus to automate this sync. The approach ensures real-time updates with minimal lag, eliminating manual data entry. Key challenges included handling divergent timelines and ensuring consistent data flow across platforms. The CDC method captures updates at the database level, allowing for seamless...
Source: Red Hat Developer Blog
Guannan Sun

LLM Compressor v0.10: Faster compression with distributed GPTQ

2026-03-18 15:21
🚀 Exciting news for AI developers! LLM Compressor v0.10 has launched, introducing faster compression for large language models (LLMs). Key features include: - **Distributed GPTQ**: Achieve up to 3.8x speedup using multiple GPUs. - **Compressed-tensors offloading**: Compress models beyond your memory capacity. - **GPTQ FP4 microscale support**: Utilize NVFP4 and MXFP4 quantization schemes. This update enhances performance and accuracy, making it a valuable tool for managing large models...
Source: Red Hat Developer Blog
Kyle Sayers, Charles Hernandez, Dipika Sikka

How Advanced Cluster Management simplifies rule management

2026-03-18 07:01
Managing security for secondary networks can be complex and time-consuming. The article discusses how Red Hat Advanced Cluster Management for Kubernetes simplifies this task. By using ConfigMaps, you can define network rules centrally on a hub cluster. This automation allows for the creation of localized MultiNetworkPolicies across all managed clusters, reducing manual efforts and errors. The system ensures compliance and security posture at scale, making it easier for teams to manage their...
Source: Red Hat Developer Blog
Moyo Oyegunle

Prepare to enable Linux pressure stall information on Red Hat OpenShift

2026-03-18 03:01
🚀 Red Hat OpenShift 4.21 introduces Linux pressure stall information (PSI) via MachineConfig, enhancing resource monitoring for CPU, memory, and I/O. Enabling PSI helps identify resource contention and hidden bottlenecks, improving autoscaling and debugging. However, it increases memory usage for Prometheus pods, showing a 42% rise with over 500 test containers. For more insights on PSI metrics and their impact, check out the article. #OpenShift #Linux #PSI #Kubernetes #CloudComputing
Source: Red Hat Developer Blog
Qiujie Li

Advanced Cluster Management 2.16 right-sizing recommendation GA

2026-03-17 07:00
🔍 Red Hat Advanced Cluster Management 2.16 has launched its right-sizing recommendations for namespaces and OpenShift Virtualization workloads, now available for enterprise use. 📊 This feature provides a unified approach to resource optimization, helping organizations manage clusters efficiently. Key benefits include improved resource insight, data-driven recommendations, and insightful visualizations through integrated Grafana dashboards. 💡 Administrators can identify over and under-...
Source: Red Hat Developer Blog
Darshan Vandra, Raj Zalavadia

Configure NVIDIA Blackwell GPUs for Red Hat AI workloads

2026-03-16 20:30
📢 Exciting news for Red Hat AI users! The NVIDIA RTX PRO 4500 Blackwell Server Edition offers enhanced GPU acceleration for enterprise data centers. This server edition boosts performance for various Red Hat AI applications, making it easier to build and deploy AI workloads efficiently. To configure, install the NVIDIA GPU Operator in Red Hat OpenShift, ensuring optimal settings for driver and kernel modules. Discover the benefits of compact, power-efficient AI deployments with Blackwell!...
Source: Red Hat Developer Blog
Erwan Gallen, Tarun Kumar, Antonin Stefanutti, Selbi Nuryyeva, Michey Mehta

Unlocking UBI to Red Hat Enterprise Linux container images

2026-03-16 12:30
Unlocking UBI for Red Hat Enterprise Linux (RHEL) enhances your container development. While the Red Hat Universal Base Image (UBI) is lightweight, it lacks some essential packages. By utilizing a no-cost Red Hat Developer subscription, you gain access to a broader set of RHEL packages, including databases and additional tools. This subscription supports both individual projects and business workloads, ensuring you have a reliable foundation for your development needs. Explore the benefits of...
Source: Red Hat Developer Blog
Louis Imershein

Zero trust GitOps: Build a secure, secretless GitOps pipeline

2026-03-13 07:01
🔍 Discover how OpenShift GitOps enhances security with short-lived tokens! This integration with the external secrets operator allows for secure management of credentials, minimizing the risk of breaches. Short-lived tokens provide limited access and ensure continuous authentication. Learn more about this innovative approach to secure GitOps pipelines! #OpenShift #GitOps #Cybersecurity #DevOps #Kubernetes
Source: Red Hat Developer Blog
Nick Png

How to manage Red Hat OpenShift AI dependencies with Kustomize and Argo CD

2026-03-13 07:01
🚀 Managing dependencies for Red Hat OpenShift AI can be complex, but the new odh-gitops repository simplifies the process. This repository offers a GitOps-ready template with Kustomize manifests for deploying all necessary dependencies. You can easily apply configurations using the `oc apply -k` command or through Argo CD for automated deployment. For those preferring Helm, a chart alternative is in development and will be available soon. Check out the odh-gitops repository to learn more!...
Source: Red Hat Developer Blog
Davide Bianchi, Andrea Tarocchi

How to develop agentic workflows in a CI pipeline with cicaddy

2026-03-12 07:00
🚀 Discover how to develop agentic workflows in your CI pipeline using cicaddy! Traditional agentic platforms can be complex, but cicaddy simplifies this by allowing you to integrate AI directly within your existing CI/CD workflows. This approach enhances your workflows without replacing them, using Large Language Models for tasks like report generation and anomaly detection. Learn how to automate tasks seamlessly! 🔍🔄 #CICD #AIWorkflow #DevOps #Automation #Cicaddy
Source: Red Hat Developer Blog
Guannan Sun

Accelerated expert-parallel distributed tuning in Red Hat OpenShift AI

2026-03-11 15:50
Red Hat OpenShift AI enhances AI performance through distributed fine-tuning of foundation models. The article discusses challenges in coordinating computation and communication across GPUs. To address this, it introduces the open-source library, fms-hf-tuning, which supports efficient fine-tuning of language and vision-language models. Key features include data preprocessing, throughput optimization, and expert parallelism techniques. The library aims to improve memory efficiency and...
Source: Red Hat Developer Blog
Karel Suta, Amita Sharma

Improve code quality and security with PatchPatrol

2026-03-11 07:01
🚀 Introducing PatchPatrol, an open-source tool designed to enhance code quality and security for enterprise teams using Red Hat OpenShift. PatchPatrol integrates AI-powered analysis directly into your CI/CD pipelines, ensuring every code change meets security standards. It offers dual modes: one for code quality and another focused on security vulnerabilities, including OWASP Top 10 detection. With flexible backend options, teams can choose between local and cloud models based on their needs....
Source: Red Hat Developer Blog
Herve Beraud

Agent Skills: Explore security threats and controls

2026-03-10 07:16
📢 Anthropic has launched Agent Skills, a new functionality now available across various agents, including Goose. This feature allows agents to perform tasks tailored to user needs using structured skills organized in folders. 🔍 The article discusses the importance of managing security threats and access controls with Agent Skills. Key considerations include proper folder permissions, vulnerability management, and the risks associated with executable scripts. 🔒 To mitigate potential risks like...
Source: Red Hat Developer Blog
Florencio Cano Gabarda

How to run Slurm workloads on OpenShift with Slinky operator

2026-03-10 07:16
🚀 Exciting advancements in high-performance computing! This article covers how to run Slurm workloads on OpenShift using the Slinky operator. Slurm is a robust workload manager, and combining it with OpenShift enhances scalability and automation for HPC environments. 🔧 The Slinky operator simplifies deployment, scaling, and lifecycle management of Slurm components within Kubernetes. This integration allows teams to utilize familiar tools while leveraging containerized infrastructure. 💡 Key...
Source: Red Hat Developer Blog
Prudhvi Vuda, Swati Kale

Effortless Red Hat Enterprise Linux virtual machines with Libvirt and Kickstart

2026-03-10 03:01
Creating virtual machines (VMs) for development is essential for testing and reproducing production environments. This article outlines a straightforward method using Libvirt and Kickstart scripts in Red Hat Enterprise Linux (RHEL) to automate VM provisioning. With Kickstart, developers can set up customized test VMs without needing extensive network infrastructure. Both Windows and MacOS users can utilize these scripts effectively, even within WSL2 or through Podman Desktop. Learn how to...
Source: Red Hat Developer Blog
Fernando Lozano

5 steps to triage vLLM performance

2026-03-09 14:12
Navigating LLM performance in production can be challenging. The article outlines a 5-step diagnostic workflow for optimizing vLLM deployments. 🔍 Start by defining your performance objectives based on workload profiles: throughput-sensitive, latency-sensitive, or bursty. 1️⃣ Identify where latency occurs by analyzing Time to First Token (TTFT) and Inter-Token Latency (ITL). 2️⃣ Monitor server saturation to understand request queue dynamics. 3️⃣ Evaluate GPU memory and KV cache health to...
Source: Red Hat Developer Blog
David Whyte-Gray, Thameem Abbas Ibrahim Bathusha, Michael Goin, Ashish Kamra

Automate AI agents with the Responses API in Llama Stack

2026-03-09 14:12
🚀 Automate AI agents with the Responses API in Llama Stack! This article discusses how the Responses API enhances AI agent orchestration while maintaining precise control over conversations. It automates tool calls and state management, facilitating smoother interactions. Learn about the benefits of adopting this API, especially for IT process automation, and explore hands-on examples through the AI quickstart series. For more insights, check out the full article! 📈🤖 #AI #Automation...
Source: Red Hat Developer Blog
Michael Dawson