benn.substack

May 20, 2022

Agreed, and my understanding is most metric layer tools can support this in some form. I think tools like Transform do a kind of smart caching where they try not to recalculate everything if they don't have to. But if they need to go back to the underlying data (for cases like this), they will.

Expand full comment

John Bodley

May 20, 2021

The Minerva API does speak SQL which we use with Superset (see the "Metrics for the Masses" section in https://medium.com/airbnb-engineering/supercharging-apache-superset-b1a2393278bd). Note we're in the process of writing the third blog in the series which will describe this in more detail.

Expand full comment

Reply (2)

T.Adeniyi

Where can you find the Minerva API? I tried to pip install it within Python, and it's saying the library doesn't exist.

Expand full comment

Just in case you weren't the same person asking this on Twitter, https://twitter.com/bennstancil/status/1423727352207581187

Expand full comment

T.Adeniyi

Lol, yes that was me too. Thank you. I was looking for a while, and I thought I was overlooking it. Do you know any good, similar substitutes for data quality that are open source? In my research so far, I've found Griffin.

Expand full comment

Not open source, unfortunately. I only know of the commercial tools that do this (of where there are a handful now).

Expand full comment

T.Adeniyi

Aug 7, 2021

Yeah, I've seen tools like Profisee that are commercial, but some of these licenses for the data governance tools are pretty expensive. Azure has Purview, but the data quality feature on Purview is pretty lacking, so I wanted to use something in addition to it with Nifi. Thank you for your help, this helped me to narrow down my search. I'm going to keep working with Griffin for now.

Expand full comment