Skip to main content
Product Thesis Hero Hi all, Today, we wanted the opportunity to share with our customers and the broader Data & AI community about our thesis on what we are building at Oxygen Intelligence, our core point of differentiation, and why we are building it now.

Why Now: Language Models as a disruptive force on enterprise data systems

For the last two decades, the phrase “Data is the new oil” has served as a common rallying cry for organizations to building robust internal data platforms to unlock business value from their data. We’ve come a long way since then. In particular, we’ve seen two substantial leaps in data:
  1. The rise of elastic, cloud-native infrastructure that makes analytical computation efficient.
  2. The development of frameworks that let teams build and productize data applications (dashboards, reports) and predictive models (classical ML algorithms).
SQL and Python became the lingua franca of this era. Still, in many ways, the innovation has been incremental. And while the tools have improved, the main bottleneck remains the same: the technical know-how required to operate them—proficiency in programming, modeling, and managing complex data systems. With the rise of Large Language Models in the 2020s, that technical bottleneck began to disappear. These models make it possible to build AI agents that can reliably translate between natural language L and programmatic output P with both high recall and high accuracy. In doing so, they bridge the long-standing gap between business intent and programmatic data queries. Perhaps even more profoundly, AI agents can now extend this translation beyond analytics: from product requirements written in natural language to the code that generates fully functional data products. We now can live in a world where questions about enterprise data, and the productization of data applications, models and workflows can happen in the most ergonomic and democratic way possible: through natural language, in plain English. We can call this new paradigm Agentic Data Intelligence.

Data Agents and Workflows as an open, standalone category

The Modern Data Stack (“MDS”) has become the backbone of how organizations store, transform, and visualize data. In many ways, it feels like the natural home for data agents—after all, agents interact with the same components that power analytics today: the BI layer, the ELT layer, and the data warehouse. However, the rise of data agents represents a more fundamental shift that necessitates a decoupling. While data agents can belong to an existing layer of the Modern Data Stack (“MDS”), e.g. as part of a BI tool, the ELT layer, or as part of the data warehouse, we believe that data agents, workflows, and the infrastructure powering them deserves to exist as a separate category. More strongly, not only as its own category, but on its on plane. Modern Data Stack Firstly, it’s now well understood that separation of concerns across data systems enables independent scaling, more reliable feedback loops, and thus, better enterprise outcomes and lower Total Cost of Ownership (TCO). Examples of this shift include separation of storage and compute, the cleaving of the control plane from the data plane, and finally, the composable shape characteristic of the Modern Data Stack itself. Secondly, making the system of record for agentic entities (such as agents and workflows) dependent on a specific vendor layer introduces risk through vendor lock-in. As the MDS continues to mature and consolidate, each layer is increasingly incentivized to become an “all-in-one” solution, replicating the closed-ecosystem dynamics that once characterized platforms like Qlik or Informatica. For enterprise buyers, this creates a brittle ecosystem — too many tools doing the same thing, none working well in concert. The market instead needs “an open, layer-agnostic, and extensible platform” — serving as a neutral system of record for Data Agents and Workflows, integrating across the entire stack. This represents what Oxygen Intelligence aims to provide via Oxy: an open-source Agentic Data Intelligence system powering data agents, workflows, and automations.

Determinism-first design

Determinism Design Unlike higher recall and lower precision environments that exist for agentic coding and agentic search, Agentic Data Intelligence operates in a lower recall and higher precision environment. This is one in which not answering a question is preferable to answering an analytics questions incorrectly. A number is simply wrong, not a tiny bit wrong, in stark contrast to a search result being a little off. To that end, they introduce the following concepts:
  1. Ontology infrastructure that provides both the semantic and operational definitions of how to acquire data and to operationalize workflows to get to and end work product
  2. Composable workflows that can be chained together in a reasoning chain by Data Agents to more deterministically get to most desired outputs.

Accuracy-guaranteed Workflow Automation

Workflow Automation Determinism-first design enables Oxygen Intelligence to work with customers in a differentiated manner. Within less than one year of operation, they’ve collaborated with high-growth startups and Fortune 1000 companies automating data analytics workflows with accuracy guarantees. Applications include Q&A automation as embedded analytics, internal sales & marketing deflection, and end-to-end document generation for executive reporting previously unavailable through traditional BI and data warehouse systems. They’re discovering that Oxy represents a truly platform product capable of automating diverse data analytics workflows, including those demanding multi-environment orchestration—chaining together different systems, including non-data systems. Their code-native design allows builders to maintain engineering best practices in maintaining the Ontology, Agents, and Workflows declaratively (in plain YAML), unlike brittle GUI-only systems. These features enable Oxy to be both comprehensive and developer-focused.

A Call to Action

Early days remain, but their determinism-first, open platform could become an important system of record for agents, workflows, and context needed to run both data-centric and generic workloads agenically. They invite participation in realizing this vision. Visit oxy.tech to book a demo or reach out at joseph@oxy.tech.