Articles from Source: Databricks-Blog

Announcing the winners of the 2026 Databricks Customer Awards

2026-06-09 17:30
🌟 Exciting news from Databricks! The 2026 Customer Awards have recognized 10 outstanding organizations and leaders. These winners demonstrate excellence in various categories, including innovation, transformation, and social impact. Representing diverse sectors like energy, semiconductors, dairy co-ops, and nonprofits, they are harnessing data and AI to tackle real-world challenges. Congratulations to all the winners! πŸŽ‰ #Databricks #CustomerAwards #DataInnovation #AI #LeadersInIndustry
Source: Databricks Blog

Announcing the 2026 Databricks Customer Awards Industry winners

2026-06-09 17:30
πŸš€ Exciting news! The 2026 Databricks Customer Awards have announced their industry winners. Ten organizations from diverse sectors, including financial services, healthcare, and retail, have been recognized for their innovative use of data and AI. These winners showcase how leveraging Databricks can address complex challenges and achieve significant results. Congratulations to all! πŸŽ‰ #Databricks #CustomerAwards #DataIntelligence #AI #Innovation
Source: Databricks Blog

Transforming solar and wind maintenance reports with Genie and AI agents

2026-06-08 18:15
πŸš€ Plenitude is transforming solar and wind maintenance reports by using Databricks Genie and AI agents. They have developed an agent-based system that converts unstructured maintenance PDFs into a searchable data model. This allows users to ask natural-language questions and generate visualizations easily. Early results show improved analysis speed and secure access, paving the way for predictive maintenance. #RenewableEnergy #AI #DataAnalytics #SolarEnergy #WindEnergy
Source: Databricks Blog

Enterprise Data Strategy Roadmap for Business Outcomes

2026-06-08 13:45
πŸ“Š An effective enterprise data strategy links data assets directly to business goals. This includes strong data governance, architecture, and analytics frameworks that adapt to changing needs. πŸ” Key elements are data quality management and master data management, essential for informed decision-making and compliance. πŸš€ Implementing a phased roadmap with cross-functional teams and data literacy programs can enhance competitive advantage and foster a lasting data-driven culture. #DataStrategy...
Source: Databricks Blog

Enabling Evolutionary Database Development: database branching with Lakebase, continued

2026-06-05 13:35
🌐 Exciting advancements in database development are here! In the latest article, Pramod Sadalage and Kevin Hartman explore the Evolutionary Database Design methodology from 20 years ago, now enhanced by Databricks Lakebase. The new copy-on-write branching allows for efficient database changes, lifting previous constraints and enabling better collaboration between DBAs and developers. Jen, a character from the original series, demonstrates how these updated practices support team-scale...
Source: Databricks Blog

Data + AI Summit 2026: Insider’s Guide for Financial Services Leaders

2026-06-05 13:00
🌟 The Data + AI Summit 2026 is set to attract top financial services leaders in San Francisco. This guide offers insights on key sessions in banking, insurance, payments, and capital markets. Learn how major firms like Morgan Stanley and JPMorgan Chase are leveraging AI for transformation and modernization. Maximize your experience by focusing on executive forums and networking opportunities. #DataAISummit #FinancialServices #AI #Networking #Innovation
Source: Databricks Blog

3x Faster Search: Parallel Test-Time Scaling with Instructed-Retriever-1

2026-06-04 13:31
πŸš€ Exciting news in AI research! The Databricks AI Research Team has introduced a significant update to the Agent Bricks Knowledge Assistant. Answer generation time is now 2x faster, and search time has improved by over 3x, achieving a Time To First Token of around two seconds. ⏱️ This enhancement is driven by Instructed-Retriever-1, which uses parallel processing for retrieval tasks, improving both recall and precision without sacrificing quality. Learn more about this innovative approach in...
Source: Databricks Blog

Apache Spark Real-Time Mode for Gaming: A Better Way to Do Real-Time Sessionization

2026-06-03 20:25
πŸš€ Exciting advancements in the gaming industry! Apache Sparkβ„’ Real-Time Mode is enhancing sessionization for millions of active gaming devices. This technology allows for real-time tracking with sub-second latency, ensuring personalized gaming experiences. The use of transformWithState timers enables proactive heartbeats, delivering timely updates and improving gameplay. #GamingTech #DataEngineering #ApacheSpark #RealTimeAnalytics #GameDevelopment
Source: Databricks Blog

Bring Databricks into Kiro IDE with the AI Dev Kit Power

2026-06-03 17:43
πŸš€ Exciting news for developers! Kiro IDE can now connect with the Databricks Data Intelligence Platform in two effective ways. You can use the four Databricks-managed MCP servers for a quick 10-minute setup or opt for the new Databricks AI Dev Kit Power for a more comprehensive integration. Explore these options to enhance your development experience! πŸ’»βœ¨ #Databricks #KiroIDE #AIDevKit #DataIntelligence #TechNews
Source: Databricks Blog

Scaling Enterprise Conversational Intelligence: Cross-industry Technology and Functional Solutions Powered by Databricks Genie

2026-06-03 11:45
πŸš€ Databricks Genie is transforming enterprise conversational intelligence across various industries. Leading consulting and SI partners have developed innovative solutions that cater to specific technology and functional needs in areas like sales, marketing, HR, and more. These ready-to-deploy offerings aim to accelerate AI transformation for enterprises. #Databricks #AI #ConversationalIntelligence #EnterpriseSolutions #TechInnovation
Source: Databricks Blog

Beyond parsing X12: Closing the gap for revenue cycle workflows in healthcare

2026-06-02 19:28
Healthcare billers face a significant workflow issue despite having parsed EDI data effectively. Many still find themselves bogged down by spreadsheets and SQL queries, spending too much time on tasks like denials and appeals. To address this, Genpact and Databricks have developed an operational workbench that enhances efficiency while keeping data secure. #Healthcare #RevenueCycle #EDI #DataManagement #Innovation πŸ“ŠπŸ’ΌπŸš‘
Source: Databricks Blog

Agentic BI: A Practical Guide for BI Teams and Business Users

2026-06-02 17:07
Discover how Agentic BI is transforming data analytics for organizations. πŸ“Š This approach utilizes AI agents to automate the entire analytics workflow, from data preparation to delivering insights. It addresses the common dissatisfaction many organizations face with static dashboards. A strong semantic layer is crucial for ensuring consistent answers across queries. BI teams and business users can adopt this method gradually for effective implementation. #AgenticBI #DataAnalytics #AI...
Source: Databricks Blog

Data Science vs Data Analytics: Compare Careers, Skills, and Degrees

2026-06-02 16:49
Data science and data analytics serve distinct roles in the data field. Data analytics focuses on analyzing past data using tools like SQL and Power BI. In contrast, data science involves creating predictive models and automating decision-making. When choosing between them, consider your technical skills, comfort with unstructured data, and career goals. Both roles complement each other, with analysts providing essential insights for data scientists. πŸ“ŠπŸ”πŸ’» #DataScience #DataAnalytics...
Source: Databricks Blog

AI in Defense: How Artificial Intelligence Is Reshaping National Security

2026-06-02 14:52
Artificial intelligence is reshaping national security and military operations today. 🌍 Countries are rapidly advancing AI for defense, resulting in a global AI race that carries significant strategic implications. βš”οΈ Key factors include responsible AI governance, model validation, and ensuring human oversight in combat settings. πŸ” To effectively integrate AI, defense organizations must focus on interoperability standards, acquisition reform, and workforce training. πŸ“š #AIDefense...
Source: Databricks Blog

Data Governance Architecture: A Complete Blueprint for Modern Organizations

2026-06-02 14:37
πŸ“Š Data governance architecture is essential for organizations aiming to enhance data quality. Key components include policies, roles, and technologies that guide data management. Effective programs rely on four pillars: people, policies, processes, and technology, all supported by a governance council and data stewards. Modern strategies utilize automated tracking and role-based access to ensure compliance and quality at scale. #DataGovernance #DataQuality #ModernOrganizations #DataManagement...
Source: Databricks Blog

Query Tags: The Context Your Warehouse Queries Have Been Missing

2026-06-02 14:32
πŸ” Discover how Databricks SQL enhances query management with Query Tags. This feature automatically logs key attributes of every query, including who executed it and where. Teams can now attribute warehouse costs by various dimensions and monitor queries from partner tools like dbt, PowerBI, and Tableau through automatic tagging. Queries can be tagged from multiple sources, ensuring better tracking and troubleshooting. #DataAnalytics #SQL #Databricks #QueryManagement #BusinessIntelligence
Source: Databricks Blog

Practical Data Warehouse Design and Architecture Guide

2026-06-02 12:46
Unlock the potential of data with a solid data warehouse design! πŸ“Š This guide emphasizes the importance of aligning stakeholder reporting needs before choosing a schema or storage solution. It outlines a three-tier architecture that enhances data management. Key components include ETL/ELT pipelines, automated testing, and access controls to ensure data consistency and security. #DataWarehouse #DataEngineering #Analytics #DataArchitecture #BusinessIntelligence
Source: Databricks Blog

AI Governance Maturity Model: Matrix, Assessment, and Roadmap

2026-06-02 12:21
Understanding AI governance is crucial for organizations today. The AI Governance Maturity Model offers a five-level framework, from ad hoc to optimized, to help assess and improve governance capabilities. A recent Gartner survey revealed that less than half of large organizations can show measurable progress in governance, despite many claiming to have oversight programs. Organizations that view AI governance as a strategic advantage can enhance their competitive edge. πŸ“ŠπŸš€ #AIGovernance...
Source: Databricks Blog

Introducing Cross-Engine ABAC

2026-06-02 03:00
πŸš€ Exciting news in data governance! The introduction of Cross-Engine ABAC in Unity Catalog allows users to define attribute-based access controls. This means you can set tag-based row filters and column masks just once, and they will be enforced across various engines. This advancement promotes centralized governance and enhances data security. πŸŒπŸ”’ #DataGovernance #ABAC #UnityCatalog #DataSecurity #TechNews
Source: Databricks Blog

Personalizing Genie Code with instructions, skills, memory, and MCP

2026-06-01 23:45
Unlock the full potential of Genie Code by personalizing it to fit your team's workflow. πŸ› οΈ The article emphasizes the importance of tailoring instructions, skills, and memory to enhance efficiency. By integrating the Model Code Practices (MCP), teams can leverage Genie Code effectively. Discover how these adjustments can optimize your coding experience! πŸ’»βœ¨ #GenieCode #SoftwareDevelopment #TeamEfficiency #Personalization #CodingSkills
Source: Databricks Blog

Debunking 8 data layout myths: why Liquid Clustering outperforms partitioning

2026-06-01 15:00
Discover how Liquid Clustering is reshaping data layout in modern lakehouses. πŸ“Š This approach outperforms traditional partitioning by addressing its limitations. Eight common myths about partitioning are debunked, showing that many teams may be missing out on better solutions. Users of Liquid Clustering experience significant gains in query latency, write throughput, and storage efficiency, especially at petabyte scale. πŸš€ #DataManagement #LiquidClustering #Lakehouse #DataAnalysis #TechTrends
Source: Databricks Blog

Enabling Evolutionary Database Development: database branching with Lakebase

2026-05-29 22:04
Enabling Evolutionary Database Development is a new series by Pramod Sadalage and Kevin Hartman. It discusses the long-standing methodology of Evolutionary Database Design, emphasizing key practices and the evolution of database management since 2010. The introduction of copy-on-write database branching with Databricks Lakebase in 2026 is highlighted as a significant advancement, allowing for efficient, individual database instances for developers. This series explores the implications of...
Source: Databricks Blog

AI Doesn't Scale Until You Stop Calling It Innovation

2026-05-29 21:43
Many enterprises have teams focused on AI, but few have successfully operationalized it. Philippe Rambach of Schneider Electric emphasizes that aligning AI efforts with business value and customer needs is crucial for scaling. Companies that succeed integrate domain expertise with AI knowledge into dedicated teams. AI solutions can notably reduce energy costs by up to 20%. #AI #BusinessStrategy #Innovation #OperationalExcellence #EnergyEfficiency πŸŒŸπŸ“ŠπŸ’‘
Source: Databricks Blog

Databricks at SIGMOD 2026

2026-05-29 17:08
πŸš€ Exciting innovations from Databricks at SIGMOD 2026! The company is showcasing Spark Declarative Pipelines (SDP), aimed at simplifying ETL and streaming tasks. Additionally, their Enzyme engine, which focuses on incremental view maintenance, received an honorable mention at the conference. Connect with Databricks engineers to learn more about these advancements! #Databricks #SIGMOD2026 #DataEngineering #AI #Innovation
Source: Databricks Blog

Winning under CMS TEAM: Building the learning health system to realize success in VBC today and tomorrow

2026-05-29 13:53
New regulations from the CMS TEAM program will impact over 700 hospitals in the U.S. starting January 1, 2026. This initiative focuses on managing the total cost and quality of five common surgical episodes. Hospitals are encouraged to shift from retrospective reporting to proactive strategies for success. Understanding these changes will be crucial for healthcare providers. πŸ₯πŸ“Š #Healthcare #CMS #VBC #Surgery #HealthSystem
Source: Databricks Blog

How enterprise leaders are scaling AI agents across their organization

2026-05-28 19:30
πŸš€ Enterprise leaders are increasingly integrating AI agents into core workflows, impacting areas like HR, finance, and fraud detection. Executives report a common challenge: achieving quick results while maintaining governance, trust, and cost control. Key practices for scaling AI responsibly include adopting best practices that prioritize efficiency and oversight. #AI #Leadership #Innovation #BusinessStrategy #TechTrends
Source: Databricks Blog

Unity Catalog and the next era of Apache Icebergβ„’

2026-05-28 18:58
πŸš€ The future of data management is here with Unity Catalog, paving the way for the next era of Apache Iceberg. This open catalog enhances accessibility and organization of data, supporting the development of open table formats. Explore how Unity Catalog is shaping the open lakehouse landscape! πŸŒŠπŸ“Š #DataManagement #ApacheIceberg #UnityCatalog #OpenLakehouse #DataAccessibility
Source: Databricks Blog

Reliable LLM Inference at Scale

2026-05-27 20:20
Databricks has developed a unique platform for reliable large language model (LLM) inference at scale. The article discusses the lessons learned while building this infrastructure, emphasizing the importance of reliability in deploying LLMs effectively. Key insights include strategies for maintaining performance and ensuring scalability in various applications. Explore how Databricks is shaping the future of AI. πŸŒπŸ’‘ #LLM #AI #Databricks #MachineLearning #TechInnovation
Source: Databricks Blog

BI Serving Pointers; Maximizing for Performance and TCO

2026-05-27 20:15
Struggling with slow BI dashboards? πŸ–₯️ This article outlines key strategies for optimizing performance and reducing total cost of ownership (TCO). 1️⃣ Use star schemas and liquid clustering to enhance query speed. 2️⃣ Implement Unity Catalog Metric Views for consistent business metrics across tools. 3️⃣ Leverage aggregate-aware materialization for efficient data access. #BIDashboards #DataOptimization #BusinessIntelligence #TCO #DataStrategy
Source: Databricks Blog

How the lakebase architecture stays resilient to cloud failures

2026-05-27 15:15
🌐 The recent article discusses how lakebase architecture addresses cloud failures. It highlights that agent workloads are changing reliability needs in cloud systems. Agents create databases four times faster than humans and require serverless, auto-scaling infrastructure. Lakebase starts tens of millions of databases daily, emphasizing resilience in its design. #CloudComputing #DataArchitecture #TechInnovation #Reliability #Serverless
Source: Databricks Blog

Introducing Always-On pricing: automatic savings for Databricks Lakebase

2026-05-27 15:15
πŸš€ Exciting news for Databricks users! The new Always-On pricing model offers flexibility without the serverless vs. provisioned dilemma. Enjoy a 25% lower price on baseline capacity with no long-term commitments. Activation is simple, and you can manage billing effortlessly. After 24 hours of continuous use, you'll benefit from the Always-On rate! #Databricks #AlwaysOnPricing #CloudComputing #DataManagement #TechNews
Source: Databricks Blog

Announcing Lakebase Change Data Feed (CDF)

2026-05-27 13:11
πŸš€ Exciting news from Lakebase! They have introduced a Change Data Feed (CDF) in Public Preview. This innovation simplifies data transfer from operational databases. Now, teams can enable the feed once and allow various engines and models to access it directly, reducing the need for multiple pipelines. #DataManagement #Lakebase #ChangeDataFeed #Innovation #TechUpdates
Source: Databricks Blog

Building a FHIR-native health data platform on Databricks Lakebase

2026-05-27 01:14
🌐 The article discusses the development of a FHIR-native health data platform on Databricks Lakebase. Health Samurai is key in standardizing clinical data from various sources, including HL7v2 and C-CDA, into FHIR format. This process includes terminology normalization and patient deduplication. Aidbox operates seamlessly on Databricks Lakebase, enhancing integration and data management in healthcare. #HealthTech #FHIR #DataManagement #HealthcareInnovation #Databricks
Source: Databricks Blog

AI readiness in telecommunications

2026-05-26 21:00
Telecom companies are keen on AI, with 97% of executives adopting it. However, many initiatives struggle to reach production scale. The main issue is "data debt," which refers to fragmented and ungoverned data that hampers effective AI use, rather than problems with the quality of AI models. πŸ“ŠπŸ€–πŸ“‰ #Telecommunications #AI #DataManagement #TechTrends #Innovation
Source: Databricks Blog

Pharma launch analytics: How to compress the first 90 days and win the three years that follow

2026-05-23 02:15
Pharmaceutical launches are crucial, as the first 90 days shape a product's entire lifecycle. πŸ“ˆ The article discusses the importance of rapid data analysis during this period. Companies gather insights on prescription trends, market access, and field force activities. πŸ“Š Addressing the "90-Day Intelligence Problem" is key to understanding market responses and overcoming barriers effectively. #PharmaLaunch #MarketAccess #DataAnalytics #Healthcare #Pharmaceuticals πŸ’Š
Source: Databricks Blog

Scaling for MHHS: how Octopus Energy achieved a 50x cost reduction in margin data engineering

2026-05-23 00:40
Octopus Energy successfully re-engineered its data pipelines to address the increasing demands of the UK's energy grid. A team of three engineers managed to handle a 48x increase in data volume while achieving a remarkable 50x cost reduction in margin data engineering. This initiative highlights the importance of efficient data management in the energy transition. βš‘οΈπŸ“Š #DataEngineering #EnergyTransition #OctopusEnergy #Innovation #CostReduction
Source: Databricks Blog

Accelerating LLM Inference with Prompt Caching for Open‑Source Models on Databricks

2026-05-22 20:00
πŸš€ The article discusses the benefits of **prompt caching** for accelerating LLM inference on Databricks. πŸ” It highlights how this technique enhances the efficiency and speed of open-source language models, making them more accessible for users. πŸ“ˆ The authors emphasize the importance of security and performance in implementing prompt caching. #LLM #Databricks #MachineLearning #AI #OpenSource
Source: Databricks Blog

Observability for any agent, anywhere: Production-ready tracing with OpenTelemetry & Unity Catalog on Databricks

2026-05-22 19:20
Unlocking observability in AI applications is crucial as they generate vast amounts of trace data. 🌐 The integration of OpenTelemetry with Unity Catalog on Databricks facilitates effective monitoring and analytics for AI agents. This combination enhances the continuous improvement of AI systems through better evaluation and oversight. πŸ”πŸ“ˆ #AI #Observability #Databricks #OpenTelemetry #TechInsights
Source: Databricks Blog

How World Bank Group uses databricks to eradicate poverty through shared knowledge

2026-05-22 15:00
🌍 The World Bank Group is leveraging Databricks to enhance its mission of eradicating poverty. By integrating structured data with unstructured documents, they have built a unified data and AI platform. This innovation streamlines research and improves access to insights, supporting millions of document downloads each month. πŸ“Š The initiative accelerates global knowledge sharing and empowers teams to make informed decisions for poverty reduction. #WorldBank #DataForGood #PovertyReduction #AI...
Source: Databricks Blog

How Databricks Genie democratizes data access in financial services

2026-05-22 10:30
Databricks Genie is transforming data access in financial services. Despite significant investments in data infrastructure, many business leaders still rely on analytics teams for insights. This highlights the "Last Mile of Data Democratization," where decision-makers often lack essential SQL skills or business intelligence tools. Genie aims to bridge this gap and empower leaders with direct data access. πŸ“ŠπŸ” #DataDemocratization #FinancialServices #BusinessIntelligence #Databricks #AI
Source: Databricks Blog