Episode 86

Dagster with Sandy Ryza

We venture in the world of Data Pipelines with Dagster and Sandy Ryza

July 30, 202440m 39s

Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page

Show Notes

Today is time to talk about Data Pipelines and Data Engineering. I’m really excited to have on stage Sandy Ryza, Lead Engineer of Dagster.

If you’re a software engineer and you’re afraid of dealing with data pipelines, fear no more! Sandy is on a mission to make data pipelines easier to handle for software engineers. Join us in this episode to learn more about Dagster, and how it can make it easier for you to build and manage your data assets.

Enjoy the show 👨‍🍳

Show Notes

00.00 Intro
00.46 Episode Start
01.09 Sandy’s Introduction
02.29 What is Dagster?
05.14 How is Dagster affecting software engineers?
06.13 Data engineering as software engineering
11.34 Cloud vs Self-hosted
13.42 Dagster Plus vs Dagster Plus Pro
14.41 The history of Dagster
19.43 Who’s maintaining Dagster?
20.59 Contributing to Apache Spark
24.42 Being an Open Source Data Scientist
29.18 Speaking a different language than SWE
31.44 Moving from SWE to Data Scientist
34.38 Approaching the Data Scientist world
35.59 What’s next for Dagster
37.53 Further reading
39.46 Where people can find you online?

Resources

Show links

← All episodes of The Developers' Bakery