Today we’re kicking off Soda Launch Week with a major announcement: Soda has acquired NannyML.
Together, we’re building the most intelligent, context-aware data quality platform on the market. One that helps you prevent issues before they become business problems, detect anomalies that actually matter, and trace root causes across the entire stack, from data ingestion to automated decision-making.
This move brings together two teams with a shared goal: helping data and AI teams ship reliable, production-grade systems they can trust, whether those systems power dashboards, models, or autonomous agents.
Let’s get into what this means, why we’re doing it, and what’s coming next.
If you’ve worked on data or AI infrastructure, you’ve lived this:
Most data quality tooling today can’t handle this. It was built for a different era of batch jobs, static schemas, predictable data flows. It flags too much noise, misses critical context, and rarely shows downstream impact.
At the same time, the systems we’re building today are more dynamic than ever:
In this world, traditional checks and anomaly detection aren’t enough. Data quality isn’t just about correctness anymore, it’s about consequence.
NannyML tackled one of the hardest problems in modern AI systems:
How do you monitor model performance in production, when there’s no ground truth yet?
Their open-source library introduced estimation-based performance monitoring, robust drift detection, and alerting designed for real-world ML pipelines. It became the go-to toolkit for teams running models where labels are delayed, sparse, or unavailable.
But more importantly, they saw what was coming:
That models don’t fail in isolation. They fail when data pipelines degrade, when user behavior shifts, when upstream assumptions break. And they believed the only way to solve this was to close the loop between data quality and AI behavior.
We’ve believed the same from day one.
By bringing our teams and platforms together, we’re unifying those layers. Delivering a product that can monitor your entire system, not just pieces of it.
With NannyML’s team and tech now integrated into Soda, here’s what this unlocks:
And yes, NannyML’s open-source project will remain open, maintained, and fully supported. We’re not sunsetting it. We’re expanding it.
Because the cost of bad data is rising, and fast.
The systems data powers today are higher-stakes, faster-moving, and harder to debug.
If your tooling doesn’t understand impact, it’s not helping. If it can’t handle emergence and drift, it’s irrelevant. And if it’s not built for AI-native environments, it’s already behind.
We’re not here to slap “AI” on legacy checks. We’re here to make data quality actually intelligent:
This acquisition accelerates that mission.
This is Day 1 of Launch Week. All week long, we’ll be announcing new capabilities and product drops that show what intelligent, AI-first data quality looks like in practice.
Here’s a preview of what’s coming:
We’re just getting started, and we’re building fast.
This is the next chapter for data quality.
Smarter. Faster. AI-ready.
And built for teams like yours.
The team has been cooking. We'd love to show you around.