Articles from Source: Red-Hat-Developer-Blog

Strengthen privacy and security with encrypted DNS in RHEL

2025-09-17 07:01
🔒 In today's digital world, securing DNS traffic is crucial. Unencrypted DNS queries can expose sensitive information to eavesdroppers, leading to potential data breaches. 🔍 Encrypted DNS, particularly DNS over TLS (DoT), is now available in Red Hat Enterprise Linux 10 and 9.6. This advancement strengthens network security by ensuring that DNS queries are kept private and verifiable. 🛠️ The article provides a step-by-step guide for implementing encrypted DNS to improve system security during...
Source: Red Hat Developer Blog
Dominika Borges, Pavel Březina, Francisco Trivino Garcia

How to enable Ansible Lightspeed intelligent assistant

2025-09-16 07:00
🚀 Discover the Red Hat Ansible Lightspeed intelligent assistant in the latest Red Hat Ansible Automation Platform 2.6! This integrated chatbot utilizes various inference backends and can be deployed with Red Hat OpenShift AI. The article provides a step-by-step guide for installing OpenShift AI and setting up an inference service on the same cluster. Key topics include: - Installing OpenShift AI - Deploying an Inference Service - Configuring Ansible Lightspeed Stay tuned for more insights in...
Source: Red Hat Developer Blog
Riya Sharma, Elijah DeLee

Why some agentic AI developers are moving code from Python to Rust

2025-09-15 07:00
AI developers are exploring a shift from Python to Rust for agentic AI solutions. While Python is popular for its simplicity and rich libraries, its Global Interpreter Lock (GIL) limits performance in CPU-bound tasks, especially as systems scale from 5 to 500 agents. Rust offers a solution with better concurrency and scalability, allowing more efficient handling of multiple agents and CPU-intensive tasks. Developers are finding that a hybrid approach—prototyping in Python and optimizing with...
Source: Red Hat Developer Blog
Louis Imershein

Confidential VMs: The core of confidential containers

2025-09-15 07:00
🔍 Discover the essentials of Confidential Virtual Machines (CVMs) and their role in enhancing the security of confidential containers (CoCo). CVMs utilize hardware and software to ensure data confidentiality, isolating workloads from the host environment. This integration with Red Hat Enterprise Linux (RHEL) and OpenShift boosts security standards for data in use. 🛡️ Learn about features like Unified Kernel Images (UKI) and remote attestation that enhance the protection of workloads....
Source: Red Hat Developer Blog
Emanuele Giuseppe Esposito

Benchmarking with GuideLLM in air-gapped OpenShift clusters

2025-09-15 07:00
Deploying and benchmarking large language models (LLMs) in air-gapped environments is vital for regulated enterprises. This article details the process of using the Red Hat AI Inference Server with vLLM and GuideLLM for performance evaluation within a disconnected OpenShift cluster. Key components include prebuilt container images, Persistent Volume Claims (PVCs), and OpenShift-native Job resources. GuideLLM, an open-source tool, provides metrics like token throughput and latency, ensuring...
Source: Red Hat Developer Blog
Philip Hayes, Thameem Abbas Ibrahim Bathusha

Run Qwen3-Next on vLLM with Red Hat AI: A step-by-step guide

2025-09-12 22:59
🚀 Exciting news in AI! The Qwen3-Next model features a new hybrid attention and sparse MoE architecture, enhancing training efficiency and inference speed. 🔧 With Day 0 support from vLLM, organizations can deploy it immediately using Red Hat AI for secure and scalable solutions. 📊 Key improvements include multi-token prediction and optimized training stability. For a step-by-step guide on deployment, check the latest blog! #AI #RedHat #OpenSource #MachineLearning #Qwen3Next
Source: Red Hat Developer Blog
clobner

How to implement observability with Python and Llama Stack

2025-09-12 07:01
Discover how to enhance AI application observability using Python and Llama Stack! 🚀 This article highlights the importance of observability in production, focusing on logging, metrics, and distributed tracing. It introduces OpenTelemetry as a key tool for instrumenting applications and discusses setting up Jaeger for trace visualization. Explore the detailed steps for implementing observability, including configuring your Llama Stack instance to capture and visualize traces effectively....
Source: Red Hat Developer Blog
Michael Dawson

Deploy a lightweight AI model with AI Inference Server containerization

2025-09-12 07:01
🚀 Ready to explore AI? This tutorial provides a step-by-step guide to deploy a lightweight AI model, Llama-3.2-1B, using the Red Hat AI Inference Server. It’s designed for quick testing on personal machines with local GPUs. 🔧 You'll need a valid Red Hat account and a compatible GPU. The tutorial covers everything from logging in to the Red Hat container registry to running the model with minimal setup. For more details, check out the tutorial! #AI #RedHat #MachineLearning #Containerization...
Source: Red Hat Developer Blog
Christina Zhang

vLLM Semantic Router: Improving efficiency in AI reasoning

2025-09-11 07:01
Introducing the vLLM Semantic Router, an open-source solution for enhancing efficiency in AI reasoning. 🎉 This system intelligently routes queries based on their complexity, ensuring that resources are used wisely. It utilizes a ModernBERT-based classifier for semantic classification, sending simpler requests to faster models and more complex ones to stronger models. Key benefits include improved accuracy (+10.2%), reduced latency (–47.1%), and decreased token usage (–48.5%). This approach...
Source: Red Hat Developer Blog
Huamin Chen

Declaratively assigning DNS records to virtual machines

2025-09-11 07:00
🔍 Virtual machines often require DNS records to maintain unique identities. This article discusses how to declaratively assign these records using Red Hat OpenShift Virtualization. 📊 It highlights the importance of naming conventions and provides methods to automate DNS record assignments via labels and annotations. 🔧 Key considerations include network exposure, IP assignment, and managing multiple DNS records for diverse network interfaces. #VirtualMachines #DNSManagement #OpenShift #GitOps...
Source: Red Hat Developer Blog
Raffaele Spazzoli

How to deploy language models with Red Hat OpenShift AI

2025-09-10 07:00
🚀 Red Hat OpenShift AI is transforming the deployment of language models! This guide explores the OpenShift AI console, your hub for managing data science projects. You can easily deploy models like Llama, leveraging GPU acceleration and resource scaling. Key features include project dashboards, model tracking, and multiple storage options. Check out the step-by-step deployment process for Llama, from GPU setup to testing both internal and external access. 🔗 Watch the full video demo for a...
Source: Red Hat Developer Blog
Alicia Cao

AI search with style: Fashion on OpenShift AI with EDB

2025-09-10 07:00
Unlocking fashion e-commerce with AI! 🛍️✨ Traditional keyword searches often miss the mark in understanding customers' true intent. This article highlights a solution using semantic search, which captures meaning and intent in fashion searches. EDB Postgres AI and Red Hat OpenShift AI work together to process AI data, enabling seamless visual and text searches. Users can upload images or describe items without needing exact terms. This innovative approach not only enhances search accuracy but...
Source: Red Hat Developer Blog
Shane Heroux

What qualifies for Red Hat Developer Subscription for Teams?

2025-09-09 14:00
Unlock development potential with the Red Hat Developer Subscription for Teams! 🚀 This program provides organizations using Red Hat technologies free access to Red Hat Enterprise Linux for development activities. It's available via Red Hat representatives or self-service at developers.redhat.com. Key activities covered include software design, coding, building, testing, and pre-production setups. The subscription supports efficient application development and deployment. For more details,...
Source: Red Hat Developer Blog
Josh Swanson, Brian Gollaher

How to run OpenAI's gpt-oss models locally with RamaLama

2025-09-09 07:01
Unlock the power of OpenAI's gpt-oss models locally with RamaLama! 🖥️ These models, available in 20B and 120B variants, enable advanced AI capabilities right on your machine. RamaLama simplifies the setup process using containerization for security and ease. 🚀 Key features include zero trust security and automatic GPU optimization. Get started easily with just a single command. Explore more about running AI models in a secure, efficient manner! 🔒✨ #OpenAI #AIModels #RamaLama #MachineLearning...
Source: Red Hat Developer Blog
Cedric Clyburn

Using DNS over TLS in OpenShift to secure communications

2025-09-09 07:01
🔒 Secure your DNS traffic in Red Hat OpenShift with DNS over TLS (DoT). This feature enhances security by ensuring all DNS communications are encrypted, aligning with zero trust architecture principles. 🛠️ The recent RHEL 10 release introduces encrypted DNS, allowing DoT during installation and runtime. While currently a Technology Preview in Identity Management (IdM), it lays the groundwork for secure operations. 🔍 Explore installation steps for IdM and OpenShift, including configuring DNS...
Source: Red Hat Developer Blog
Josep Andreu Font, Ramon Gordillo Gutierrez

Scaling DeepSeek and Sparse MoE models in vLLM with llm-d

2025-09-08 14:02
🚀 Exciting advancements in scaling Mixture of Experts (MoE) models with vLLM and the llm-d project are transforming open-source LLM capabilities. 🌐 This article discusses innovations like multi-head latent attention and sparse configurations, enabling efficient deployment in Kubernetes. Learn how vLLM enhances expert parallelism and communication for large models. For detailed insights, check the full article! 📊 #MachineLearning #AI #Kubernetes #DeepLearning #OpenSource
Source: Red Hat Developer Blog
Robert Shaw, Tyler Smith

Scaling DeepSeek-style MoEs with vLLM and llm-d using Wide EP

2025-09-08 14:02
🔍 Exciting advancements in serving large-scale Mixture of Experts (MoE) language models are discussed in a recent article on vLLM and llm-d. The article covers the architectural changes in vLLM that enhance the efficiency of DeepSeek-style models. Key innovations include multi-head latent attention and sparse configurations with hundreds of experts. llm-d enables high-performance deployments in Kubernetes, offering intelligent scheduling and expert parallelism for efficient scaling. Learn...
Source: Red Hat Developer Blog
Robert Shaw, Tyler Smith

Multicluster authentication with Ansible Automation Platform

2025-09-08 07:00
Implementing multicluster authentication between Red Hat Ansible Automation Platform and Red Hat Advanced Cluster Management can enhance security and streamline operations. This integration allows for centralized authentication, reducing credential proliferation. Key features include dynamic token management and network security via Cluster Proxy. For successful implementation, ensure both ManagedServiceAccount and Cluster Proxy are enabled in your setup. Access detailed steps and best...
Source: Red Hat Developer Blog
Alberto Filice

Verify Cosign bring-your-own PKI signature on OpenShift

2025-09-08 07:00
🚀 Red Hat OpenShift 4.16 introduces ClusterImagePolicy and ImagePolicy for sigstore verification. These tech preview features support Fulcio CA and public key policies. 🔍 The bring-your-own PKI (BYO-PKI) feature, available from OpenShift 4.19, allows validation of container images using existing X.509 certificates. 🔧 The article outlines how to sign images with Cosign and configure OpenShift for signature verification using ClusterImagePolicy. #OpenShift #Cosign #BYOPKI #ContainerSecurity...
Source: Red Hat Developer Blog
Qi Wang

What's new in network observability 1.9

2025-09-05 07:01
🚀 Exciting updates in Network Observability 1.9! This version enhances insights into network traffic with features like IPsec tracking, flowlogs-pipeline filter queries, and UDN mapping. It is compatible with Red Hat OpenShift Container Platform 4.19 and older releases. Learn how to enable IPsec and explore the new CLI enhancements for capturing flows, metrics, and packets. For detailed installation instructions, refer to the OpenShift documentation. 📊🔍 #NetworkObservability #RedHat...
Source: Red Hat Developer Blog
Steven Lee

Customize your deployments with the Red Hat Developer Hub Operator

2025-09-04 16:53
🚀 The Red Hat Developer Hub enhances internal developer portals when paired with Red Hat OpenShift. Key updates in version 1.2 include: - A new Argo CD front-end plug-in for better UX. - Enhanced orchestration capabilities for software templates. - Improved provenance tracking through ScaffoldedFrom metadata. Deployment is streamlined using the Red Hat Developer Hub Operator. Access your Backstage instance easily with the provided route. For detailed customization, a Backstage custom resource...
Source: Red Hat Developer Blog
Francisco De Melo Junior

How to migrate from Fluentd to Vector in OpenShift 4

2025-09-04 07:01
🔄 **Migrating from Fluentd to Vector in OpenShift 4** 🔄 This article provides a comprehensive guide for migrating the default log collector in OpenShift 4 from Fluentd to Vector. With Fluentd being deprecated in Logging 5.X, this transition is essential to access the new features in Logging 6.0. Vector serves as a log collector and analyzer, simplifying the processing of logs for real-time analysis. It allows logs to be sent to destinations like Amazon CloudWatch and supports JSON formatted...
Source: Red Hat Developer Blog
Calvin Lee, Jamie Parker

How platform engineering accelerates enterprise AI adoption

2025-09-04 07:01
🚀 Platform engineering is reshaping enterprise AI adoption. By integrating technologies like Kafka and service mesh, organizations can enhance AI deployments. 🔍 Key challenges include reproducibility and compliance. Platform engineering addresses these by providing self-service access and standardized environments, enabling developers and data scientists to innovate efficiently. 💻 Red Hat OpenShift and Developer Hub are pivotal in making AI resources accessible while ensuring governance. #AI...
Source: Red Hat Developer Blog
Maarten Vandeperre, Camille Nigon

How to deploy Azure Red Hat OpenShift using Terraform

2025-09-04 07:01
🚀 Learn how to deploy Azure Red Hat OpenShift using Terraform! This article covers the steps to set up and configure your Azure infrastructure with Terraform, ensuring compliance using Azure policies. It emphasizes the importance of governance rules and the use of policy-as-code for better resource management. Key prerequisites include Terraform CLI, Azure CLI, and proper role assignments. The setup involves creating a network, configuring security, and deploying the OpenShift cluster with...
Source: Red Hat Developer Blog
Mario Dietner

Effective observability with Red Hat build of OpenTelemetry

2025-09-03 07:01
🚀 Discover the power of observability with the Red Hat build of OpenTelemetry! This framework enables comprehensive metrics and logs reporting, crucial for monitoring your applications and infrastructure. It simplifies data collection and management while providing scalability and flexibility. Key features include: - Unified data collection - Seamless integration with monitoring tools - Enterprise-grade support from Red Hat Explore how to get started and enhance your observability practices!...
Source: Red Hat Developer Blog
Giovanni Fontana

vLLM with torch.compile: Efficient LLM inference on PyTorch

2025-09-03 07:01
🚀 Efficient LLM inference is crucial in today’s diverse tech landscape. The article discusses how **torch.compile**, PyTorch's JIT compiler, streamlines performance by automatically optimizing kernels. This reduces the burden on developers, allowing them to focus on model design rather than manual tuning. Incorporated into **vLLM**, torch.compile enhances usability and performance through custom compiler passes. It supports dynamic batch sizes and improves startup times with caching...
Source: Red Hat Developer Blog
Luka Govedič, Addie Stevens, Michael Goin, Saša Zelenović

Your LLM is too large: How I generate production-ready failure analysis on a toaster

2025-09-02 07:00
Running production-grade Kubernetes failure analysis on a cost-effective edge device can streamline troubleshooting. Using Llama 3.2:3B with 4-bit quantization, root cause analysis is achieved in just 70 seconds. This method incorporates pattern preprocessing to efficiently identify known failures without overwhelming the system with raw logs. Real-world results show a significant cost reduction, from $0.30-3.00 per analysis to less than $0.001, while providing actionable insights. Explore...
Source: Red Hat Developer Blog
Caleb Evans

Migrate your OpenShift logging stack from Elasticsearch to Loki

2025-09-01 07:01
🚀 To leverage the latest logging features in Red Hat OpenShift 6.0, migrating from Elasticsearch to Loki is essential. This guide details how to test changes in development and plan for production implementations. Loki, a scalable log aggregation system, offers improved performance by using log labels. It allows multiple tenants, which simplifies resource management. 🔄 The migration process involves running both stacks in parallel, ensuring old logs remain accessible via Elasticsearch while...
Source: Red Hat Developer Blog
Oscar Casal Sanchez, Jamie Parker

Migrating Ansible Automation Platform 2.4 to 2.5

2025-08-29 07:01
🚀 Migrating from Ansible Automation Platform 2.4 to 2.5 involves careful steps to ensure a smooth transition. This article outlines the preparation, export, and import process using the configify.aapconfig collection. Key points include: - Ensure Ansible Automation Platform 2.5 is deployed separately. - Follow similar steps as migrating from AWX 25. - Focus on user roles and authentication changes in the new version. For detailed steps, refer to the article! 📜 #Ansible #Automation #DevOps...
Source: Red Hat Developer Blog
Konstantin Kuminsky

Multicluster resiliency with global load balancing and mesh federation

2025-08-28 07:01
Explore the new architecture for multicluster resiliency using global load balancing and mesh federation! 🌐 This approach combines a global load balancer and a federated service mesh to enhance service availability and disaster recovery, particularly for stateless workloads. New capabilities in Red Hat OpenShift Service Mesh 3.0 and Red Hat Connectivity Link now allow for more robust deployments. Learn how to configure these tools for optimal performance! #Multicluster #RedHat #CloudComputing...
Source: Red Hat Developer Blog
Raffaele Spazzoli

Simplify local prototyping with Camel JBang infrastructure

2025-08-28 07:01
🚀 Apache Camel simplifies integration with systems like databases and APIs through minimal coding. 🌟 Camel JBang, a command-line interface, enhances prototyping by allowing rapid creation and testing of integration flows without complex setups. 🔧 Its infra command quickly launches backends like Kafka or ActiveMQ, streamlining the development process and reducing setup time. For more details, check out the full article! #ApacheCamel #CamelJBang #Integration #Development #Prototyping
Source: Red Hat Developer Blog
Bruno Meseguer

Smart deployments at scale: Leveraging ApplicationSets and Helm with cluster labels in Red Hat Advanced Cluster Management for Kubernetes

2025-08-27 07:01
Managing multiple Kubernetes clusters can be complex, but Red Hat Advanced Cluster Management simplifies this process. 🌐 It offers a centralized platform to oversee the entire lifecycle of Kubernetes clusters, ensuring consistent health monitoring and policy enforcement across environments. Combining ApplicationSets and Helm with cluster labels allows for tailored deployments, adapting configurations based on specific cluster characteristics. This integration streamlines operations and...
Source: Red Hat Developer Blog
Mikel Sanchez

How to verify container signatures in disconnected OpenShift

2025-08-27 07:01
🔍 Discover how to verify container signatures in disconnected OpenShift environments using the latest tools from sigstore! The article explores the use of oc-mirror v2 in Red Hat OpenShift 4.19, allowing mirroring of container images and their cryptographic signatures. It provides a proof of concept and detailed configuration steps for enabling signature verification with CoSign. Check it out for practical insights! 🛠️🔒 #OpenShift #ContainerSecurity #Sigstore #RedHat #DevOps
Source: Red Hat Developer Blog
Mohammad Ahmad, Tom Stockwell, Alex Guidi, Jack Adolph

Event-driven ingestion of Keycloak entities

2025-08-27 07:01
🚀 Discover a solution to delayed updates in entity information with the Backstage Events System! This article details a Proof of Concept (PoC) that enables near real-time synchronization of Keycloak entities into Red Hat Developer Hub (RHDH). Key benefits include immediate updates and efficient, incremental syncing, reducing API calls and CPU usage. Learn how to set up the PoC and optimize your developer catalog! 🔗 Check out the PoC code: [GitHub Link] #Keycloak #RedHat #EventDriven...
Source: Red Hat Developer Blog
Kashish Mittal

BGP dynamic routing with Fast Data Path on RHOSO 18

2025-08-27 07:01
Exploring the performance of dynamic routing with OVN-BGP-Agent and Fast Data Path on RHOSO 18 has yielded insightful findings. 🚀 A recent Proof of Concept assessed throughput, packet loss, stability, and resource utilization using Trex and BIRD. The results show high throughput, especially with large frames, and stable performance over extended periods. 📈 However, there are limitations, including bottlenecks for small packets and some manual configuration challenges. Insights from this study...
Source: Red Hat Developer Blog
Pradipta Sahoo, Spoorthi K, Haresh Khandelwal

A VM tuning case study: Balancing power and performance on AMD processors

2025-08-26 07:01
During a server deployment, a significant performance gap was found between bare metal and virtual machine (VM) workloads. Optimizations, including adjusting system profiles and enabling CPU scaling drivers, were implemented. These changes resulted in notable improvements in VM performance, with the tuned VM even surpassing the original bare-metal completion times. The study highlights how targeted adjustments can lead to substantial gains in efficiency. 🔧💻⚡️ #VMTuning...
Source: Red Hat Developer Blog
Kevin Buettner

Optimize GPU utilization with Kueue and KEDA

2025-08-26 07:01
Explore how integrating Kueue and KEDA can optimize GPU utilization in AI workloads! 🚀 This proof of concept showcases a method to enhance resource efficiency on OpenShift AI. The combination allows scaling long-running workloads to zero when idle, reducing costs significantly. Learn how to implement this strategy with a focus on resource management and effective workload scheduling. 📊💻 #AI #GPUUtilization #OpenShift #Kubernetes #CloudComputing
Source: Red Hat Developer Blog
Christian Zaccaria

Implement AI safeguards with Python and Llama Stack

2025-08-26 07:01
🚀 Exploring AI safety with Llama Stack! This article highlights how to implement guardrails in AI applications using Python and Llama Stack. It introduces two main built-in guardrails: Llama Guard, which filters unsafe content, and Prompt Guard, designed to prevent circumvention of safety measures. The post provides insights into setting up Llama Stack and utilizing these guardrails effectively in Python. #AI #LlamaStack #Python #MachineLearning #AIsafety
Source: Red Hat Developer Blog
Michael Dawson

LLM Compressor 0.7.0 release recap

2025-08-25 16:09
🚀 LLM Compressor has released version 0.7.0, enhancing performance for quantizing large language models. Key updates include: 1️⃣ New QuIP and SpinQuant-style transforms for improved accuracy. 2️⃣ Mixed-precision support with FP4 enhancements for better layer quantization. 3️⃣ DeepSeek v3-style block quantization for efficient compression without calibration data. Explore more about these features! #LLMCompressor #AI #MachineLearning #Quantization #TechUpdate
Source: Red Hat Developer Blog
Dipika Sikka, Kyle Sayers, Brian Dellabetta, Helen Zhao

What is an image mode 3-way merge?

2025-08-25 07:01
🔍 Curious about the 3-way merge in Red Hat Enterprise Linux (RHEL)? In image mode, a new filesystem image is created to manage updates. This process includes a third version, older than the current and new images, to reduce conflicts. The merge prioritizes local changes, ensuring personalized settings remain intact. Utilizing OSTree, RHEL manages multiple OS installations effectively, making the merging process smoother. 🖥️✨ #RedHat #Linux #3WayMerge #OSTree #TechUpdates
Source: Red Hat Developer Blog
Matt Micene