
57: Improving Data Quality Using Data Product SLAs with Egor Gryaznov of Bigeye
This week on The Data Stack Show, Eric and Kostas spend some time with Egor Gryaznov, co-founder and CTO of Bigeye. Egor discusses issues surrounding data quality in organizations of various sizes and uses helpful analogies from software engineering to help define data quality.
Audio is streamed directly from the publisher (afp-928695-injected.calisto.simplecastaudio.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
Highlights from this week’s conversation include:
- Egor’s software engineering background and history with Uber (2:19)
- Experimentation platforms and analytics definitions (7:49)
- Bigeye’s function and use cases (9:40)
- Managing the relationship between the data engineer maintaining the pipelines and the downstream teams providing the context (18:49)
- Pinpointing problems in data compared to problems in software (21:55)
- Defining data quality at Bigeye (24:13)
- Machine learning models as a data product (28:38)
- Determining SLAs (32:22)
- How Bigeye brings different parties together and addresses natural communication barriers (36:42)
- Looking at when an organization needs to implement data quality tooling (45:54)
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.