Articles from Source: Netflix-Technology-Blog

Dynamically Splitting Wide Partitions in Cassandra for Time Series Workloads

2026-06-03 02:05
🚀 Netflix's TimeSeries Abstraction efficiently manages petabytes of temporal data using Apache Cassandra. However, wide partitions in datasets can lead to high read latencies and timeouts. To address these challenges, the team developed a partitioning strategy that divides data into time chunks. This approach helps manage wide partitions and improves query efficiency. Additionally, they implemented dynamic partitioning that auto-detects and splits wide partitions based on usage, resulting in...
Source: Netflix Technology Blog
Netflix Technology Blog

High-Throughput Graph Abstraction at Netflix: Part I

2026-05-29 18:49
📊 Netflix's Graph Abstraction is designed to support high-throughput graph use cases, achieving nearly 10 million operations per second. It focuses on OLTP scenarios, ensuring low latency and cost efficiency while managing 650 TB of data. The architecture leverages existing data abstractions for efficient traversals and real-time indexing. This is just the first part of a series exploring its capabilities and integration with the Netflix ecosystem. Stay tuned for more insights! 🌐 #NetflixTech...
Source: Netflix Technology Blog
Netflix Technology Blog

From Silos to Service Topology: Why Netflix Built a Real-Time Service Map

2026-05-29 14:01
🚀 Exciting advancements at Netflix! The engineering team has developed a real-time service map to enhance understanding of our complex infrastructure. This living map helps engineers quickly identify service dependencies, troubleshoot issues, and minimize disruptions for our members. Key benefits include: - Unified view of service connections - Fast access to detailed metrics - Improved incident response times This innovation supports thousands of microservices, ensuring smooth streaming...
Source: Netflix Technology Blog
Netflix Technology Blog

Scaling ArchUnit with Nebula ArchRules

2026-05-08 15:55
Netflix's JVM Ecosystem team is enhancing Java library management with **Nebula ArchRules**. By utilizing ArchUnit, they can enforce architectural rules across thousands of Java repositories, addressing technical debt effectively. Key features include: - Cross-language support using bytecode analysis. - Custom rule creation with a user-friendly API. - Sharing rules across multiple repositories. This initiative aims to improve code quality and streamline library lifecycle management. 🔧📦...
Source: Netflix Technology Blog
Netflix Technology Blog

Scaling Camera File Processing at Netflix

2026-04-24 15:06
Netflix's Media Production Suite (MPS) is designed to enhance global filmmaking by automating tasks and standardizing workflows. 🎬 By integrating FilmLight’s API (FLAPI), Netflix efficiently manages camera metadata and image processing. This collaboration aims to minimize errors and streamline production processes. MPS supports both seasoned filmmakers and newcomers, ensuring creativity remains the focus. #Netflix #MediaProduction #Filmmaking #Collaboration #TechInnovation
Source: Netflix Technology Blog
Netflix Technology Blog

The Human Infrastructure: How Netflix Built the Operations Layer Behind Live at Scale

2026-04-17 15:01
🚀 Netflix has transformed its live streaming capabilities over the past three years. From streaming one show a month to over nine daily, they now support millions of concurrent viewers. 🔧 Initially, engineers handled operations without a dedicated team or command center. As demand increased, Netflix established specialized roles and created the Broadcast Operations Center (BOC) for efficient event management. 🌐 With ongoing growth, including plans for international operations, Netflix...
Source: Netflix Technology Blog
Netflix Technology Blog

Evaluating Netflix Show Synopses with LLM-as-a-Judge

2026-04-10 16:26
📺 Netflix faces the challenge of helping users choose from thousands of titles. To enhance viewer experience, they emphasize the importance of high-quality show synopses. 📝 Their new LLM-based system evaluates synopsis quality across four key dimensions, achieving over 85% agreement with creative writers. This method allows Netflix to identify issues before a show's release. 🔍 The dual focus on creative quality and member feedback ensures that synopses serve both artistic standards and viewer...
Source: Netflix Technology Blog
Netflix Technology Blog

Stop Answering the Same Question Twice: Interval-Aware Caching for Druid at Netflix Scale

2026-04-06 22:15
🚀 Netflix has developed an experimental caching layer for Apache Druid to enhance real-time analytics at scale. With over 10 trillion rows and 15 million events per second, repetitive queries from dashboards became a challenge. The new caching system retains older data, reducing redundant queries and improving response times by up to 66%. This innovative approach balances data freshness with performance, ensuring efficient data handling during high-demand events. #Netflix #Druid...
Source: Netflix Technology Blog
Netflix Technology Blog

Powering Multimodal Intelligence for Video Search

2026-04-04 00:44
Filmmakers today generate extensive footage, making it challenging to extract key moments quickly. The rise of AI-driven video search aims to simplify this process by moving beyond traditional keyword matching. This approach utilizes multiple specialized models to identify characters and scenes, creating a unified intelligence that responds to complex queries in real time. Key challenges include processing large volumes of data and ensuring relevant clips are surfaced efficiently. The goal is...
Source: Netflix Technology Blog
Netflix Technology Blog

Smarter Live Streaming at Scale: Rolling Out VBR for All Netflix Live Events

2026-04-02 21:46
📺 Big news from Netflix! As of January 26, 2026, all live events are now encoded using Variable Bitrate (VBR) instead of Constant Bitrate (CBR). VBR adapts bitrate based on scene complexity, improving efficiency and scalability. While this enhances quality, it also introduces challenges with traffic predictability. We are rethinking delivery management to handle these changes effectively. #Netflix #LiveStreaming #VBR #TechUpdates #VideoQuality
Source: Netflix Technology Blog
Netflix Technology Blog

Scaling Global Storytelling: Modernizing Localization Analytics at Netflix

2026-03-06 15:01
📊 At Netflix, we're modernizing our localization analytics to better serve our 300M+ members across 190+ countries. Our efforts focus on consolidating fragmented systems, improving user experience, and building a unified data architecture. This will enhance our ability to track dubbing and subtitling efficiently. We're also shifting to event-level analytics to refine user engagement with localized content. 📺🌍 #Netflix #Localization #Analytics #DataStrategy #GlobalStorytelling
Source: Netflix Technology Blog
Netflix Technology Blog

Optimizing Recommendation Systems with JDK’s Vector API

2026-03-03 01:36
Optimizing Netflix's Ranker service has led to significant CPU improvements. A key focus was video serendipity scoring, which initially consumed 7.5% of CPU. By introducing batching and re-architecting memory layout, the team transformed the scoring process from costly nested loops to efficient matrix multiplications. This change reduced CPU usage per request and improved overall performance. The integration of JDK's Vector API enabled further optimizations, leading to a 7% drop in CPU...
Source: Netflix Technology Blog
Netflix Technology Blog

Mount Mayhem at Netflix: Scaling Containers on Modern CPUs

2026-02-28 22:55
Behind the scenes at Netflix, scaling containers is essential for seamless streaming. 🚀 However, a recent update revealed bottlenecks linked to CPU architecture during container launches. When migrating to a new container platform, some nodes experienced long stalls due to growing mount table lengths. This led to health check timeouts and system lockups. 🔒 The issue was particularly prevalent on r5.metal instances with many-layered container images. Investigations showed that lock contention...
Source: Netflix Technology Blog
Netflix Technology Blog

MediaFM: The Multimodal AI Foundation for Media Understanding at Netflix

2026-02-23 18:24
Introducing MediaFM, Netflix's new multimodal content embedding model! 🎬 This innovative model combines audio, video, and text to enhance our understanding of diverse media. By analyzing tens of millions of shots, MediaFM helps improve ad relevancy, clip popularity, and more. With its transformer-based architecture, it generates contextual embeddings for better content recommendations and promotional assets. Stay tuned for more updates! 🚀 #Netflix #AI #MediaUnderstanding #TechInnovation...
Source: Netflix Technology Blog
Netflix Technology Blog

Scaling LLM Post-Training at Netflix

2026-02-13 08:05
At Netflix, we are advancing Large Language Models (LLMs) through a specialized Post-Training Framework. This framework focuses on aligning LLMs with specific intents and member interactions, enhancing personalization and search experiences. The architecture supports efficient data pipelines and distributed training, enabling model developers to innovate without getting bogged down by infrastructure complexities. Key features include dynamic sequence packing and tailored optimization...
Source: Netflix Technology Blog
Netflix Technology Blog

Automating RDS Postgres to Aurora Postgres Migration

2026-02-12 14:07
In 2024, Netflix's Online Data Stores team reviewed their database technologies and chose to standardize on Amazon Aurora PostgreSQL. This decision was based on PostgreSQL's strong performance and industry momentum. The migration will start with RDS PostgreSQL, ensuring a smooth transition with minimal disruption. A self-service migration workflow has been designed to empower teams, managing operational and technical challenges effectively. For more details, check out the full article! 🌐💻...
Source: Netflix Technology Blog
Netflix Technology Blog

The AI Evolution of Graph Search at Netflix

2026-01-26 19:01
🔍 Netflix is evolving its Graph Search platform by integrating AI to enhance search capabilities. Natural language processing is now being used, allowing users to query in everyday language instead of complex structured queries. This shift aims to improve user experience and reduce friction in retrieving information. The first part of a three-part series details how Netflix is implementing and refining this AI-driven approach. Stay tuned for more updates! 🚀 #Netflix #AI #GraphSearch...
Source: Netflix Technology Blog
Netflix Technology Blog

How Temporal Powers Reliable Cloud Operations at Netflix

2025-12-15 23:51
🚀 Netflix has adopted Temporal, a Durable Execution platform, to enhance its cloud operations since 2021. This tool significantly reduces transient deployment failures from 4% to 0.0001%. Temporal streamlines processes for Spinnaker, Netflix's multi-cloud delivery platform, enabling more reliable and efficient deployments. With over 100 use cases and growing, Temporal is now integral to Netflix's operations. #Netflix #CloudComputing #Temporal #DevOps #SoftwareEngineering
Source: Netflix Technology Blog
Netflix Technology Blog

Netflix Live Origin

2025-12-15 17:38
🚀 Discover the architecture behind Netflix's Live Origin! This custom server bridges cloud live streaming and Open Connect, managing content delivery efficiently. It utilizes a multi-tenant microservice model on AWS, ensuring resilience through redundant pipelines and epoch locking for segment selection. The Live Origin enhances streaming by detecting segment defects and optimizing traffic management, prioritizing critical requests during high loads. Stay tuned for more insights! #NetflixLive...
Source: Netflix Technology Blog
Netflix Technology Blog

AV1 — Now Powering 30% of Netflix Streaming

2025-12-04 20:09
🎥 AV1 is now powering 30% of Netflix streaming, marking a major milestone in enhancing video quality and efficiency. This modern codec offers superior compression, allowing for better streaming experiences across various devices. 📱📺 Since its launch in 2018, AV1 has expanded from Android to smart TVs, web browsers, and Apple devices, reducing bandwidth use while improving visual quality. As Netflix looks forward to AV2, AV1 remains essential in revolutionizing streaming. 🌐 #Netflix #AV1...
Source: Netflix Technology Blog
Netflix Technology Blog

Supercharging the ML and AI Development Experience at Netflix

2025-11-04 20:33
🚀 Exciting advancements in ML and AI at Netflix! The Metaflow framework, open-sourced in 2019, enhances the development experience by streamlining workflows from prototype to production. With the new Spin functionality, users can iterate rapidly, allowing for seamless transitions and faster debugging, making AI development smoother and more efficient. For more details, check out the full article! #MachineLearning #AI #Metaflow #NetflixTech #Innovation
Source: Netflix Technology Blog
Netflix Technology Blog

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

2025-10-25 22:01
🚀 Exciting developments in generative recommender systems! The article discusses how post-training generative recommenders (GRs) can enhance user experience by modeling behavior over time. It highlights the challenges of relying solely on observed user patterns, which can lead to poor recommendations. A new approach, Advantage-Weighted Supervised Fine-tuning (A-SFT), addresses issues with noisy reward models and limited counterfactual feedback. This method combines supervised fine-tuning with...
Source: Netflix Technology Blog
Netflix Technology Blog

Behind the Streams: Real-Time Recommendations for Live Events Part 3

2025-10-21 00:53
🚀 Exciting insights from Netflix's latest article on enhancing live event experiences! As live events attract millions of viewers, Netflix has engineered a system for real-time recommendations. This ensures fans receive timely updates without overwhelming cloud services. Key strategies include prefetching data and adaptive broadcasting, effectively synchronizing devices and managing traffic spikes. Stay tuned for more on Netflix's innovative solutions! 🎥📡 #Netflix #LiveEvents #TechInnovation...
Source: Netflix Technology Blog
Netflix Technology Blog

How and Why Netflix Built a Real-Time Distributed Graph: Part 1 — Ingesting and Processing Data…

2025-10-17 18:42
🌐 Netflix has developed a Real-Time Distributed Graph (RDG) to analyze member interactions across various services effectively. In Part 1 of their blog series, they outline the motivation behind the RDG and its data processing architecture. The transition from a single streaming service to multi-faceted offerings like live events and games necessitated a new approach to data analysis. By leveraging a graph system, Netflix can connect user activities across devices rapidly, enhancing...
Source: Netflix Technology Blog
Netflix Technology Blog

100X Faster: How We Supercharged Netflix Maestro’s Workflow Engine

2025-09-29 16:10
🚀 Exciting improvements to Netflix's Maestro engine! The recent upgrade boosts performance by 100X, reducing workflow overhead from seconds to milliseconds. This redesign enhances scalability and meets evolving business needs, supporting more complex workflows. Explore the updated Maestro on GitHub and enhance your workflow orchestration today! 🌐 #Netflix #Maestro #DataEngineering #WorkflowOptimization #OpenSource
Source: Netflix Technology Blog
Netflix Technology Blog

Building a Resilient Data Platform with Write-Ahead Log at Netflix

2025-09-26 18:57
📊 Netflix faces unique challenges in data management at scale, including data loss, corruption, and system entropy. To tackle these issues, they developed the Write-Ahead Log (WAL), a system that enhances data consistency and reliability. WAL ensures durable data changes and efficient message retries, crucial for Netflix’s real-time data pipelines. The simplified API allows teams to easily integrate different storage solutions while maintaining high performance. Learn more about how WAL is...
Source: Netflix Technology Blog
Netflix Technology Blog

Scaling Muse: How Netflix Powers Data-Driven Creative Insights at Trillion-Row Scale

2025-09-22 21:24
🚀 At Netflix, our Muse application plays a vital role in delivering data-driven insights to enhance content discovery for members. Muse helps creative teams identify effective promotional media by analyzing audience engagement with various assets. As user demands evolved, we upgraded Muse's architecture to support advanced features while ensuring high performance. We implemented techniques like HyperLogLog sketches for efficient data processing and utilized the Hollow library for faster...
Source: Netflix Technology Blog
Netflix Technology Blog

Empowering Netflix Engineers with Incident Management

2025-09-19 16:48
🚀 Netflix is transforming its incident management approach to enhance reliability for users worldwide. The company has shifted from a centralized model to empowering engineering teams to manage incidents independently, promoting a culture of ownership and learning. This change involved selecting a user-friendly tool, Incident.io, to simplify the process and encourage participation. Key aspects include intuitive design, internal integrations, and a balance between customization and...
Source: Netflix Technology Blog
Netflix Technology Blog

From Facts & Metrics to Media Machine Learning: Evolving the Data Engineering Function at Netflix

2025-08-21 17:39
At Netflix, we are evolving our data engineering function with the introduction of Media ML Data Engineering. 🎥📊 This new specialization focuses on managing complex media data, allowing for centralized access to various media assets like video, audio, and text. The initiative aims to enhance machine learning capabilities and improve analytics through the Media Data Lake, which supports advanced technologies. Key responsibilities include standardizing media assets and enriching metadata to...
Source: Netflix Technology Blog
Netflix Technology Blog

ML Observability: Bringing Transparency to Payments and Beyond

2025-08-18 18:15
At Netflix, ML observability is crucial for monitoring and understanding machine learning models in production. It allows teams to track performance, detect anomalies, and ensure reliability. This is particularly important in payment processing, where optimizing transactions helps reduce friction for users. By utilizing ML observability tools, we can enhance model performance and maintain stakeholder trust through clear insights into model behavior. Examples include logging, monitoring, and...
Source: Netflix Technology Blog
Netflix Technology Blog

Accelerating Video Quality Control at Netflix with Pixel Error Detection

2025-08-11 21:29
🚀 Netflix has developed an automated method for video quality control that detects pixel-level artifacts, reducing manual reviews. This new system identifies hot pixels that can distract viewers, ensuring a seamless viewing experience. By using a specialized neural network, Netflix speeds up the QC process from hours to minutes. This innovation allows creative teams to focus more on storytelling rather than technical issues. 🎥✨ #Netflix #VideoQuality #Innovation #TechForGood #Filmmaking
Source: Netflix Technology Blog
Netflix Technology Blog