Articles from Source: Databricks-Blog

Unlocking the Archives: Turning Unstructured Documents into a Searchable Database for Groundwater Discovery

2026-05-11 20:30
🌍 In Sudan, access to groundwater is crucial for communities. MapAid has partnered with Databricks for Good to tackle this issue by transforming 700 scanned hydrogeological documents into a searchable database. This project utilizes multimodal AI to classify and catalog important data, enhancing resource accessibility. Discover how technology can support water management efforts! 💧📊 #Groundwater #AI #WaterCrisis #SustainableDevelopment #DataManagement
Source: Databricks Blog

Predictive Quality Starts Where Defect Detection Stops

2026-05-11 11:11
Manufacturers are shifting from traditional defect detection to proactive quality management. Databricks Genie enables quality leaders to analyze complete operational datasets using natural language, integrating data from inspections and suppliers in real time. This approach addresses data latency issues, allowing for quicker, more informed quality decisions. #PredictiveQuality #Manufacturing #DataIntelligence #QualityManagement #Innovation 🚀📊🔍
Source: Databricks Blog

Retail markdown optimization: from reactive markdowns to proactive

2026-05-11 11:10
Retail markdown optimization is shifting from reactive to proactive strategies. 📈 CMOs often rely on slow, weekly reports, which can lead to excessive inventory and markdowns when market trends change. The key challenge is to analyze crucial data like trends and pricing more efficiently. Databricks Genie for Merchandise Intelligence offers a solution, enabling faster, data-driven decisions for better inventory management. 🛍️✨ #RetailOptimization #DataDriven #Merchandising #InventoryManagement...
Source: Databricks Blog

How Superhuman and Databricks built a 200K QPS inference platform together

2026-05-08 21:10
Superhuman and Databricks have successfully migrated their spelling and grammar correction workloads to the Databricks Model Serving Platform. This collaboration has resulted in impressive metrics, achieving over 200,000 queries per second (QPS) and a 60% increase in throughput, all while maintaining sub-second P99 latency. Learn more about this innovative partnership and its impact on real-time inference. 🚀📈 #DataScience #TechInnovation #RealTimeAnalytics #Collaboration #MachineLearning
Source: Databricks Blog

Using MemAlign to Improve Evaluation of Traditional Machine Learning in Genie Code

2026-05-08 21:10
Genie Code, Databricks' new AI tool, creates full ML notebooks from natural language prompts. To assess the quality of these notebooks, nine LLM judges were developed. However, human evaluations showed significant disagreement between the LLM judges and human experts on various aspects, such as model training and data imputation. This highlights the need for improved evaluation methods in machine learning. 📊🤖 #MachineLearning #AI #DataScience #GenieCode #Databricks
Source: Databricks Blog

Addressing HR's widening capacity gap with AI

2026-05-08 18:00
HR leaders face a growing capacity gap, with 84% reporting frequent stress and declining workforce engagement. The cost of inaction is significant, leading to unfilled roles and lost productivity. To address this, organizations are encouraged to adopt AI in phases. Starting with data consolidation, then analytics, and finally integrating AI into workflows ensures a smooth transition. MathCo's NucliOS platform and Databricks' lakehouse architecture provide the necessary infrastructure for HR...
Source: Databricks Blog

MCP Marketplace Brings Real-Time Intelligence to Agentic Applications

2026-05-08 15:18
Discover how the MCP Marketplace enhances agentic applications with real-time intelligence. Static data limits agents' ability to reason and make decisions. The MCP Marketplace connects agents to reliable, external intelligence from partners like You.com and Moody's. Lakebase and Genie streamline workflows, enabling agents to retain context and present decisions in natural language for user approval. #AI #RealTimeIntelligence #AgenticApplications #MCPMarketplace #DataDriven
Source: Databricks Blog

Pushing the Frontier for Data Agents with Genie

2026-05-08 14:30
🚀 Introducing Genie, Databricks’ advanced data agent designed to tackle complex enterprise data queries! Genie integrates both structured and unstructured data sources, enhancing its ability to provide accurate insights. The article explores challenges faced by data agents and innovative techniques such as specialized knowledge search and Multi-LLM designs. Recent experiments show a significant boost in accuracy from 32% to over 90%, while also cutting costs and latency. #DataScience #AI...
Source: Databricks Blog

Energy trading analytics in a real-time market

2026-05-08 13:55
Energy trading faces challenges due to traditional batch analysis methods. ⏳ In a market with 15-minute price changes, delays can lead to significant revenue loss. Analysts experience bottlenecks, highlighting the need for real-time analytics. Databricks Genie offers a solution, providing traders with instant access to data for better decision-making. 📊⚡ #EnergyTrading #Analytics #RealTimeData #MarketTrends #DataSolutions
Source: Databricks Blog

First-party audience data is the ad sales relationship now

2026-05-08 13:55
The advertising landscape is evolving. 🌐 Media companies must leverage first-party audience data to secure RFPs. Sales teams are now expected to provide detailed audience insights and validated performance metrics to stay competitive. 📊 However, many face an "Insight Gap," struggling to access the necessary data for effective targeting and post-campaign analysis. Understanding your audience is crucial in this data-driven era. 📈 #Advertising #DataAnalytics #Media #AudienceInsights...
Source: Databricks Blog

Operating room utilization is hiding in your scheduling data

2026-05-08 10:27
Operating room utilization is crucial for healthcare systems, yet many operate at only 65-75% capacity. This gap leads to lost revenue and unmet patient needs. The issue lies in the "Operational Intelligence Gap," where daily performance reports arrive too late for timely interventions. Improving scheduling data could enhance OR utilization and operational efficiency. 📊💼 #Healthcare #OperationalEfficiency #Surgery #DataAnalytics #PatientCare
Source: Databricks Blog

Predictive Quality Starts Where Defect Detection Stops

2026-05-08 09:35
Manufacturers are shifting from traditional defect detection to proactive quality management. Databricks Genie enables quality leaders to analyze complete operational datasets using natural language, integrating data from inspections and suppliers in real time. This approach addresses data latency issues, allowing for quicker, more informed quality decisions. #PredictiveQuality #Manufacturing #DataIntelligence #QualityManagement #Innovation 🚀📊🔍
Source: Databricks Blog

Retail markdown optimization: from reactive markdowns to proactive

2026-05-08 08:55
Retail markdown optimization is shifting from reactive to proactive strategies. 📈 CMOs often rely on slow, weekly reports, which can lead to excessive inventory and markdowns when market trends change. The key challenge is to analyze crucial data like trends and pricing more efficiently. Databricks Genie for Merchandise Intelligence offers a solution, enabling faster, data-driven decisions for better inventory management. 🛍️✨ #RetailOptimization #DataDriven #Merchandising #InventoryManagement...
Source: Databricks Blog

Why telecom churn prediction misses the intervention window

2026-05-08 07:11
Telecom churn remains a significant challenge, as most intervention programs fail to act before customers decide to leave. The article highlights the "Velocity Problem in Retention Analytics," where data signals are missed due to slow responses from leadership. Databricks Genie for Retention Intelligence aims to address this by allowing leaders to quickly query data for timely interventions. 📊💼 #Telecom #CustomerRetention #DataAnalytics #ChurnPrediction #BusinessIntelligence
Source: Databricks Blog

Growth Analytics Is What Comes After Growth Hacking

2026-05-08 07:11
The article discusses the evolution from growth hacking to growth analytics in user acquisition. 📈 As easy user acquisition fades, successful growth teams focus on understanding their funnels, cohorts, and unit economics. However, many organizations struggle with fragmented analytics tools that can't efficiently provide the insights needed for timely decision-making. 🔍 Emphasizing analytical depth is now crucial for sustained growth. #GrowthAnalytics #UserAcquisition #DataDriven...
Source: Databricks Blog

Approximate Answers, Exact Decisions: New Sketch Functions for Analytics

2026-04-29 20:01
Discover new sketch functions in Databricks that enhance analytics by speeding up percentiles, distinct counts, and top-K queries. These improvements support decision-making rather than merely auditing data, making insights more accessible. Learn how these functions can transform your analytical processes! 📊✨ #Databricks #Analytics #DataScience #DecisionSupport #Innovation
Source: Databricks Blog

Companies Winning with AI Built the Data Layer First

2026-04-29 19:00
Trinity Industries demonstrates the importance of a strong data foundation for successful AI implementation. 📊 By consolidating fragmented systems, they improved on-time material delivery by 15% and enhanced ETA model accuracy by 50%. ⚙️ A unified and governed data layer is crucial for real-time AI and efficient decision-making. Companies prioritizing their data architecture will lead in the AI space. 🚀 #AI #DataStrategy #BusinessInsights #TrinityIndustries #Innovation
Source: Databricks Blog

Rethinking SQL ETL for modern data platforms

2026-04-29 16:45
Modern data platforms are built on SQL, but fragmented SQL ETL processes can lead to hidden costs and inefficiencies. 📊 The article discusses how multiple tools and warehouses complicate operations, resulting in slower incident resolution. 🛠️ A unified SQL ETL platform can streamline processes, reduce complexity, and enhance team collaboration. 🚀 #DataAnalytics #SQLETL #DataManagement #TechTrends #DataStrategy
Source: Databricks Blog

Stripe data now available on Databricks via Databricks Marketplace

2026-04-29 15:20
🚀 Great news for Databricks users! Stripe data is now accessible through the Databricks Marketplace. This integration allows you to analyze payment and business data directly in your workspace. With Delta Sharing, you can perform AI-native analysis without the need for ETL processes. #Databricks #Stripe #DataAnalytics #DeltaSharing #AI
Source: Databricks Blog

Databricks and Stripe Projects: Infrastructure Built for Agents

2026-04-29 15:20
🚀 Databricks is partnering with Stripe Projects to enhance AI coding capabilities. This new tool allows AI agents to provision Neon Postgres databases without human intervention. While agents can quickly create full-stack applications, they previously relied on humans for infrastructure setup. With the integration of Stripe Projects, agents can now deploy production-ready databases in seconds, streamlining the development process. ⏱️ #AI #TechInnovation #Databricks #Stripe #DatabaseManagement
Source: Databricks Blog

Agents are ready but your architecture probably isn't

2026-04-29 15:00
🚀 Enterprises are experiencing high levels of AI activity, but many struggle to extract real value. Data silos and inadequate governance often hinder the effectiveness of agentic systems. CDOs and CTOs are urged to prioritize a transactional database tailored for action and clearly define success before proceeding. #AI #DataStrategy #DigitalTransformation #BusinessInsights #TechTrends
Source: Databricks Blog

Interoperability Between Unity Catalog and Google BigQuery via Catalog Federation

2026-04-29 13:33
🚀 Exciting news for data users! Customers can now access the same data from both Google BigQuery and Databricks without duplication. This is made possible through catalog federation, allowing seamless reading of tables in Unity Catalog from BigQuery. Databricks also supports federation with Google Cloud’s Lakehouse. #DataInteroperability #GoogleCloud #Databricks #UnityCatalog #BigQuery
Source: Databricks Blog

Built In, Not Bolted On: What AI-Native Actually Means in Cybersecurity

2026-04-28 21:45
In the article "Built In, Not Bolted On: What AI-Native Actually Means in Cybersecurity," Neal Bradbury discusses the importance of AI-native applications. He emphasizes that these applications should have intelligence embedded at their core rather than added later. This approach helps in creating proprietary telemetry, offering a competitive edge that standard SaaS models lack. Additionally, effective cybersecurity requires cross-functional alignment focused on shared outcomes instead of...
Source: Databricks Blog

Operationalizing AI for public sector fraud prevention

2026-04-28 20:00
Public sector agencies are facing new challenges in fraud prevention due to advancements in AI technology. 🤖 As criminals adopt synthetic identities and deepfake tactics, traditional risk controls are becoming less effective. Agencies must incorporate clean data, smart automation, and real-time insights to enhance their fraud detection capabilities. Operationalizing AI will enable quicker and more accurate decision-making in tackling these sophisticated threats. 🔍 #FraudPrevention #AI...
Source: Databricks Blog

From months to minutes: Building real-time clinical data pipelines with natural language

2026-04-28 19:06
Transforming clinical data processing is now faster than ever! ⏱️ A new partnership between Databricks and Redox allows teams to build real-time clinical data pipelines, reducing integration time from weeks to minutes. This innovation utilizes natural language processing to streamline electronic health record (EHR) data management. Stay updated on the future of healthcare technology! 💡 #HealthcareInnovation #RealTimeData #DataIntegration #EHR #HealthTech
Source: Databricks Blog

Agentic Data Engineering with Genie Code and Lakeflow

2026-04-28 15:00
🚀 Exciting developments in data engineering are here! Genie Code is an AI tool designed to assist data engineers throughout the entire data engineering lifecycle. It integrates seamlessly with Lakeflow, allowing users to build pipelines, orchestrate workflows, and monitor processes all in one place. With the ability to generate production-ready data pipelines using natural language, Genie Code aims to simplify the development and deployment of data solutions. #DataEngineering #AI #GenieCode...
Source: Databricks Blog

Securely send first-party conversion signals with Snapchat Conversions API on Databricks Marketplace

2026-04-28 14:50
🚀 Exciting news for marketers! Snapchat has launched its Conversions API on Databricks Marketplace, allowing users to connect first-party Lakehouse data directly to Snapchat. This integration helps enhance Event Match Quality and boosts campaign performance. Discover how to optimize your advertising efforts with this new feature! #SnapchatAPI #Databricks #MarketingInnovation #AdTech
Source: Databricks Blog

How leading tech companies are killing the builder’s tax with Lakebase

2026-04-27 23:23
Leading tech companies are addressing the "builder's tax" in AI development through Databricks Lakebase. By collapsing data movement layers, they eliminate ETL processes and unify operational and analytical data on one governed platform. This approach enables real-time intelligence, streamlining the development of AI-native apps. #TechInnovation #AI #DataManagement #Databricks #Lakebase
Source: Databricks Blog

Inside one of the first production deployments of Lakebase: LangGuard's agentic workflow governance engine

2026-04-27 10:55
Discover how Lakebase is transforming enterprise operations with LangGuard's agentic workflow governance engine. 🌐 This innovative solution addresses the complexities of managing numerous AI agents, tools, and systems in real time. It offers the necessary infrastructure for efficient oversight and control. Learn more about this pivotal development in AI integration! 🤖🔧 #AI #WorkflowGovernance #Lakebase #LangGuard #Innovation
Source: Databricks Blog

The next generation of Databricks Genie

2026-04-26 20:39
🚀 Exciting news! The next generation of Databricks Genie has been launched. This updated version allows users to access insights beyond the limitations of a Genie Space. It also integrates with external knowledge sources like Google Drive. This development aims to enhance business user access to vital data insights. #Databricks #Genie #DataInsights #BusinessIntelligence #Innovation
Source: Databricks Blog

Model Risk Management in 2026: A Banker’s Guide to the Revised Interagency Guidance

2026-04-25 00:44
📊 The Federal Reserve has updated model risk management (MRM) guidance as of April 17, 2026. Key changes emphasize the importance of platform architecture over traditional procedural compliance. This shift reflects the evolving landscape of financial services. Bankers should adapt to these new standards to manage risks effectively. #ModelRiskManagement #Banking2026 #FinancialServices #RiskManagement #FederalReserve
Source: Databricks Blog

OpenAI GPT-5.5 now available on Databricks, fully-governed through Unity AI Gateway

2026-04-24 22:00
OpenAI's GPT-5.5 is now available on Databricks, enhancing enterprise capabilities. This model supports coding workflows with Codex and improves data pipeline efficiency. Users can build agents and interact with their data using natural language through Genie. Unity AI Gateway ensures governance for these AI applications. #OpenAI #Databricks #AI #Innovation #EnterpriseTech 🌟💻📊
Source: Databricks Blog

Operational databases: How they work and when to use them

2026-04-24 07:11
Operational databases, or OLTP databases, focus on speed and accuracy for real-time transaction processing. They support concurrent user interactions but face challenges with unstructured data and AI workloads. Legacy systems struggle with slow ETL pipelines, limiting their effectiveness. A new solution, the Lakebase, merges transactional capabilities with the flexibility of data lakes, addressing modern data demands. 🌊💾 #Databases #DataManagement #AI #OLTP #Lakebase
Source: Databricks Blog

Databricks partners with OpenAI on GPT-5.5

2026-04-23 23:00
🚀 Databricks has announced a partnership with OpenAI to work on GPT-5.5, the latest iteration of their advanced language model. This new model has achieved state-of-the-art performance on the Databricks OfficeQA benchmark, showcasing its potential in various applications. Stay tuned for more updates on this collaboration! #Databricks #OpenAI #GPT5 #AIInnovation #TechNews
Source: Databricks Blog

Announcing the Public Preview of Lakeflow Designer

2026-04-23 10:26
🚀 Exciting news! The public preview of Lakeflow Designer has been announced. This no-code, AI-native tool is designed for seamless data preparation on Databricks. It emphasizes a fully governed experience, making data handling simpler for users. Stay tuned for more updates! #LakeflowDesigner #DataPreparation #AI #Databricks #NoCode
Source: Databricks Blog

Are LLM agents good at join order optimization?

2026-04-22 21:30
Exploring the use of Large Language Model (LLM) agents for SQL join order optimization reveals both potential and challenges. Traditional query optimizers face difficulties in efficiently managing join orders. This article investigates how LLM agents may improve this process. The findings suggest that while LLMs could enhance optimization, further research is necessary to fully harness their capabilities. #DataScience #SQL #AI #DatabaseOptimization #LLM 🌐📊💡
Source: Databricks Blog

How conversational analytics removes the BI bottleneck

2026-04-22 18:30
🌟 In a recent discussion, Ari Kaplan from Databricks highlighted the role of conversational analytics in bridging gaps in Business Intelligence (BI). This approach offers actionable insights that traditional BI often lacks. It emphasizes the importance of governance and semantic layers to enhance trust in AI-driven analytics for executives. With tools like Databricks Genie and Lakebase, businesses are urged to operationalize intelligence or risk falling behind. #ConversationalAnalytics...
Source: Databricks Blog

How to transform document activation workflows with Genie and Agent Bricks

2026-04-22 17:58
Transform your document workflows with Genie and Agent Bricks! 📄✨ Many enterprises face challenges with manual document extraction, leading to inefficiencies and compliance risks. By integrating AI/BI tools, organizations can streamline their processes. Using a multi-agent workflow, businesses can turn essential documents into searchable, actionable data across various departments like marketing and HR. This shift enhances overall productivity and data governance. #DocumentIntelligence #AI...
Source: Databricks Blog

Beyond the spreadsheet: how Databricks is delivering the modern CFO in Financial Services

2026-04-22 13:00
Databricks is transforming the role of the CFO in financial services. The company is leveraging real-time data and AI to enhance decision-making processes. This shift moves the CFO's focus from traditional spreadsheets to data-driven insights. The article highlights how these advancements are shaping a more efficient Office of the CFO. #Finance #CFO #Databricks #DataDriven #Innovation 📊🤖📈
Source: Databricks Blog

AI App Development: Guide To Building AI-Powered Apps

2026-04-22 09:40
🚀 Building AI-powered apps is now accessible to teams of all sizes! This guide outlines a structured process for AI app development, covering model strategy, prompt design, and data preparation. It emphasizes the importance of a repeatable path from concept to production. Choosing the right AI app builder is crucial; consider scope, integration, and deployment capabilities. Platforms like Databricks streamline these processes. #AIAppDevelopment #TechInnovation #DataScience #AppBuilding...
Source: Databricks Blog