PLAY PODCASTS
The Data Stack Show

The Data Stack Show

502 episodes — Page 6 of 11

The PRQL: Feature Stores and ML Ops with Simba Khadder of Featureform

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Simba Khadder of Featureform. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 26, 20234 min

Shop Talk: Accountability and Opportunity for AI

bonus

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 23, 202320 min

143: Collaborative Data Analytics on the Data Warehouse, featuring Rob Woollen & Stipo Josipovic of Sigma

Highlights from this week’s conversation include:Stipo and Rob’s background in data (2:43)What is Sigma? (7:46)Takeaways from building analytics products in-house (9:16)Sigma’s approach to datastore interface (11:32)Why analytics and BI are still not a solved problem (15:50)Combining SQL and spreadsheets for useful interface (23:17)The evolution of BI to today (29:40)Overcoming the challenges of collaboration in working with data (33:17)Creating operational coding that humans can understand (46:50)The future of BI (54:00)Cloud’s impact on BI and analytics (1:00:04)The value of getting close to the data for analytics (1:02:21)Final thoughts and takeaways (1:08:45)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 21, 20231h 14m

The PRQL: Modern Analytics Using Common Paradigms, Featuring Rob Woollen & Stipo Josipovic of Sigma

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Rob Woollen & Stipo Josipovic of Sigma. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 19, 20235 min

Shop Talk: Why AI Is Not Another Crypto

bonus

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 16, 202323 min

142: Martech’s Separation and Return to Data Infrastructure with Scott Brinker of HubSpot

Highlights from this week’s conversation include:Scott’s background in martech (3:10)Where things have gone wrong between IT and marketing (5:46)The explosion of digital marketing data (12:04)Costs of having data siloed (16:14)The convergence of marketing and IT teams around data (19:27)Navigating the massive landscape of martech tools (26:10)Needed tools in the martech stack (31:11)The importance of an accurate attribution model (34:37)Building tooling for marketers and developers to use (39:20)Future areas of development in the martech space (44:46)Final thoughts and takeaways (52:40)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 14, 202357 min

The PRQL: Marketing, Martech, and Data with Scott Brinker of HubSpot

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Scott Brinker of HubSpot. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 12, 20234 min

141: A Journey From Backend Engineer to Data Engineer with Ioannis Foukarakis of Mattermost

Highlights from this week’s conversation include:Ioannis’ background and journey in data (2:42)Rudderstack’s transformations feature and examples of its application (4:20)Winning the transformations contest at Rudderstack (7:21)How Ioannis’ transformation project works for data governance (9:40)Memories from college for Ioannis and Kostas (12:30)Getting into the world of software development (17:27)The changes in data and engineering over the years (20:29)Bridging java with python (23:15)Dealing with ML workloads in the past vs. workflows of today (26:30)Data engineers and ML engineers (33:12)Dealing with data in the early stages to ensure reliability later on (38:39)What creates problems with data quality? (42:11)Exciting developments in data engineering (46:48)Final thoughts and takeaways (51:12)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 7, 202358 min

The PRQL: The Portability of Engineering Fundamentals with Ioannis Foukarakis of Mattermost

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Ioannis Foukarakis of Mattermost. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jun 5, 20236 min

140: Stream Processing for Machine Learning with Davor Bonaci of DataStax

Highlights from this week’s conversation include:Davor’s journey from Google and what he was building there (3:32)How work in stream processing changed Davor’s journey (5:10)Analytical predictive models and infrastructure (9:39)How Kaskada serves as a recommendation engine with data (14:05)Kaskada’s user experience as an event processing platform (20:06)Enhancing typical feature store architecture to achieve better results (23:34)What is needed to improve stream and batch processes (27:39)Using another syntax instead of SQL (36:44)DataStax acquiring Kaskada and what will come from that merger (40:24)Operationalizing and democratizing ML (47:54)Final thoughts and takeaways (56:04) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 31, 20231h 1m

The PRQL: Kaskada Serving as a Recommendation Engine with Davor Bonaci of DataStax

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Davor Bonaci of DataStax. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 29, 20234 min

139: Decoupling the Execution Engine From Python’s Pandas with Aditya Parameswaran of Ponder

Highlights from this week’s conversation include:Aditya’s background and journey in the data space (2:47)What does Ponder do? (5:18)101 on Pandas and why people utilize it (6:42)The challenge of translating Pandas to a big data platform (16:11)Data Warehouses and ML workflows (21:27)The differences in the “zoo” of data languages (26:56)Why do ML and data engineering have to be so different in languages? (34:39)Builders should be adapting to the users and not the other way around (39:32)Will we see a singular data interface in the future? (46:19)Aditya’s most surprising discovery in his research (50:40)Final thoughts and takeaways (53:18)Read more of Aditya's work: Pandas vs. SQL – Part 1: The Food Court and the Michelin-Style RestaurantPandas vs. SQL – Part 2: Pandas Is More ConcisePandas vs. SQL – Part 3: Pandas Is More FlexiblePandas vs. SQL – Part 4: Pandas Is More ConvenientThe Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 24, 202357 min

The PRQL: Removing the Execution Engine Language Barrier with Aditya Parameswaran of Ponder

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Aditya Parameswaran of Ponder. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 22, 20234 min

138: Paradigm Shift: Batch to Data Streaming with A.J. Hunyady of InfinyOn

Highlights from this week’s conversation include:A.J.’s background and journey in data (2:23)Challenges with Hadoop ecosystem (8:50)Starting InfinyOn and the need for innovation (10:02)Challenges with Kafka and Microservices (14:01)Real-time data streaming for IoT devices (19:28)Paradigm shift to real-time data processing (22:17)Benefits of Rust (29:45)Web Assembly and Platform Features (36:29)Analytics and Event Correlation (40:16)Real-time data processing (47:03)ETL vs ELP (52:20)Final thoughts and takeaways (57:07)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 17, 20231h 2m

The PRQL: Data Infrastructure Systems and the Rust / WebAssembly Combo with A.J. Hunyady of InfinyOn

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with A.J. Hunyady, Founder and CEO of InfinyOn. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 15, 20234 min

137: Data Collection Secrets & The Search Data Problem with Josh Wills

Highlights from this week’s conversation include:Josh’s background in data working at Google, Slack, and other companies (1:21)The need and process for high quality data (4:33)Digging into auction code (14:03)Joining Slack and working in the early days of the company (18:00)Not fighting the last war in data (25:42)Building a product, while using the product (30:35)Transitioning to the search team at Slack (36:50)Usage patterns of search (41:21)Josh’s work in helping build DuckDB (46:20)Having the right toolset to increase precision and efficiency (52:42)Final thoughts and takeaways (56:03)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 10, 202358 min

The PRQL: Data Engineers in the Front End with Josh Wills

bonus

In this bonus episode, Eric previews his upcoming conversation with Josh Wills, an experienced data scientist who has worked with IBM, Google, Slack, DuckDB, and more. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 8, 20232 min

136: System Evolution from Hadoop to RocksDB with Dhruba Borthakur of Rockset

Highlights from this week’s conversation include:Dhruba’s journey into the data space (2:02)The impact of Hadoop on the industry (3:37)Dhruba’s work in the early days of the Facebook team (7:54)Building and implementing RocksDB (14:33)Stories with Mark Zuckerberg at Facebook (24:25)The next evolution in storage hardware (26:14)How Rockset is different from other real-time platforms (33:13)Going from a key value store to an index (37:15)Where does Rockset go from here? (44:59)The success of RocksDB as an open source project (49:11)How do we properly steward real-time technology for impact (51:17)Final thoughts and takeaways (56:18)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 3, 20231h 0m

The PRQL: Hardware Innovation Begets Software Innovation with Dhruba Borthakur Co-Founder and CTO, Rockset

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Dhruba Borthakur of Rockset. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

May 1, 20233 min

Data Council Week (Ep 7) - What’s Next for Data Council? With Pete Soderling of Data Council

bonus

Highlights from this week’s conversation include:The origin story of Data Council (0:39)Developments for the future of Data Council (2:42)The emphasis of AI and ChatGPT at this year’s conference (3:54)The support of the data community (5:31)Biggest changes and innovations in the industry (7:10)What’s next for the Data Council? (10:46)Getting connected with Data Council (13:07)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 28, 202315 min

Data Council Week (Ep 6) - All About Debezium and Change Data Capture With Gunnar Morling of Decodable

bonus

Highlights from this week’s conversation include:Gunner’s background in data (0:32)Setting the vision in early days of Red Hat and spearheading Debezium (6:20)Replication of data in Debezium (9:47)The patterns and processes of Debezium (16:21)Debezium working with Kafka (19:03)Building a diverse system while incorporating common interfaces (24:09)The importance of documentation in open-sourced projects (27:59)Debezium’s vision moving forward (31:32)Why aren’t there more CDC open-sourced solutions? (34:35)Connecting with Gunnar (37:27)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 27, 202339 min

Data Council Week (Ep 5) - The Difference Between Data Platforms and ML Platforms with Michael Del Balso of Tecton

bonus

Highlights from this week’s conversation include:Michael’s journey to co-founding Tecton (0:22)The evolution of MLops and platform teams (3:50)Understanding boundaries between the data platform and the MLops (8:42)Differences in machine learning vs data pipelines (16:58)The systems needed to handle all these types of data (22:22)Developer experience in Tecton (25:15)Automating challenges in ML development (32:30)The most difficult part of the life cycle of prediction (37:24)Exciting new developments at Tecton (39:27)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 26, 202343 min

Data Council Week (Ep 4) - Using Data Anonymization for Identity Protection With Will Thompson of Privacy Dynamics

bonus

Highlights from this week’s conversation include:Will’s background in data (0:28)Privacy dynamics and data anonymization (4:18)Addressing data privacy problems in the space (10:33)Developer experience with Privacy Dynamics (13:49)How does Privacy Dynamics work? (21:09)Update of real-time anonymized data (26:29)The problem of dates and other complexities in data (31:24)Being a data engineer in a startup (34:44)Moving at the speed of a startup (41:01)Connecting with Will and Privacy Dynamics (43:28)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 26, 202346 min

Data Council Week (Ep 3) - GTM 101 for Engineers With Chase Roberts of Vertex Ventures

bonus

Highlights from this week’s conversation include:Chase’s journey to where he is today (0:51)Lessons in go-to-market roles which helps in the VC world (2:38)Differentiating between go-to-market and distribution (8:13)Taking an idea to the market (11:33)Hardest part of the pitch (17:08)Playbooks for go-to-market founders to follow (20:25)Focus of sales and marketing in go-to-market strategy (28:01)Answering the what and how of the problem you are solving (32:30)The importance of pricing in a go-to-market strategy (46:11)Connecting with Chase (1:00:58)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 25, 20231h 2m

Data Council Week (Ep 2) - The Convergence of MLops and DataOps With Team Featureform

bonus

Highlights from this week’s conversation include:Introducing the team from Featureform (0:31)In the work vs. leading the work (3:01)Difference between MLOps and data ops (7:06)The MLOps cycle (10:12)What is Featureform and what makes it different? (13:30)Is there another layer needed in feature stores? (18:46)Getting in touch with Featureform (23:55)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 24, 202325 min

Data Council Week (Ep 1) - The Evolution of Stream Processing With Eric Sammer of Decodable

bonus

Highlights from this week’s conversation include:Eric’s journey to becoming CEO of Decoable (0:20)Does real time matter? (2:12)Differences in stream processing systems (7:57)Processing in motion (13:04)Why haven’t there been more open source projects around CDC? (20:34)The Decodable experience and future focuses for the company (24:31)Streaming processing and data lakes (32:54)Data flow processing technologies of today (39:01)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 23, 202342 min

135: Database Knob Tuning and AI with Andy Pavlo and Dana Van Aken of OtterTune

Highlights from this week’s conversation include:Origins of OtterTune (4:43)The problem of knob tuning (6:25)Roles of machine learning (9:32)OtterTune’s development and industry recognition (12:03)The challenges of database tuning and the role of human expertise (16:15)Tuning in production (20:23)Observability and Data Collection (23:37)Data Security and Privacy (29:59)Optimizing on-prem vs. cloud workloads (35:52)Performance benchmarks (40:20)Future opportunities OtterTune is focusing on (43:55)Importance of automated tuning services (50:45)Challenges in Benchmarking Real Workloads (58:43)The Story Behind the Name OtterTune (1:08:58)Balancing Technology and Human Factors (1:13:23)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 19, 20231h 15m

The PRQL: Database Tuning and Optimization with Andy Pavlo and Dana Van Aken of OtterTune

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with with Andy Pavlo and Dana Van Aken of OtterTune. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 17, 20234 min

134: Unpacking the AI Revolution and the Technology Behind A Feature-First Future with H.O. Maycotte of FeatureBase

Highlights from this week’s conversation include:The journey of H.O. into data and becoming the CEO of FeatureBase (2:37)Characteristics of the super evolution in technology (6:36)ChatGPT as the missionary of AI (9:45)The tension between authenticity and technology (13:12)What is FeatureBase? (17:53)Comparing FeatureBase to feature stores (25:58)Workload capacities and possibilities in FeatureBase (33:20)The importance of developer experience on a platform (38:23)Exciting developments for FeatureBase in the future (47:13)Final thoughts and takeaways (53:52)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 12, 202359 min

The PRQL: AI and the Super Evolution with H.O. Maycotte, CEO at FeatureBase

bonus

On this bonus episode, Eric and Kostas preview their upcoming conversation with H.O. Maycotte of FeatureBase. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 10, 20234 min

133: Building the Data Warehouse for Everything Else with Sammy Sidhu of Eventual

Highlights from this week’s conversation include:Sammy’s background in data and tooling (2:46)Going from multipurpose engineering to a CTO position (5:14) Changes in technology and deep learning models (7:31)The state of self-driving and adoption (13:49)What is Eventual and what are they solving in the space? (20:54)What are daft and data frame and how they work? (28:11)Building a query optimizer (33:42)Sammy’s take on what is going on in data and future possibilities (45:18)Eventual’s future and its impact on the space (51:44)Final thoughts and takeaways (53:47)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 5, 202357 min

The PRQL: Self-Driving Technology and Data Infrastructure with Sammy Sidhu, Co-Founder and CEO of Eventual

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Sammy Sidhu, Co-Founder and CEO of Eventual. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Apr 3, 20233 min

132: Data Quality and Data Contracts with Chad Sanderson of Data Quality Camp

Highlights from this week’s conversation include:Chad’s background in data (2:10)Breaking down data quality (4:02)Semantic and logical layers of data (10:04)What are data contracts and how do they work? (17:41)Implicit contracts at companies (24:01)Where do data contracts fit in data infrastructure? (28:14)The value of data contracts to the producer and consumer (31:18)Tools needed in effective data contracts (46:13)The importance of community in data quality (50:53)Getting connected to Data Quality Camp (1:00:55)Final thoughts and takeaways (1:01:53)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 29, 20231h 6m

The PRQL: The Value of Data Contracts with Chad Sanderson, Head of Data, Data Contracts Advocate, Data Quality Camp

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Chad Sanderson of Data Quality Camp. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 27, 20234 min

131: How Data Teams Interact With Marketing Tools with Jason Davis of Simon Data

Highlights from this week’s conversation include:Defining CDPs (2:28)The data team's role in marketing (7:41)Leveraging commonalities across businesses (12:49)Building a CDP with customer data (18:05)Challenges in identity modeling (23:00)CDP lifecycle and one-to-one data (30:06)Segmentation and optimization (33:23)Real-time data in the cloud (40:37)The future of AI and machine learning (43:02)Final thoughts and takeaways (46:42)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 22, 202347 min

The PRQL: Unleashing the Potential of CDPs with Jason Davis, Co-Founder and CEO of Simon Data

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Jason Davis of Simon Data. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 20, 20232 min

130: From Business Intelligence to Product Analytics and Beyond with Vijay Ganesan of NetSpring.io

Highlights from this week’s conversation include:Vijay’s background in data (2:09)The journey of founding ThoughtSpot and its impact in the world of BI (2:49)The maturation of BI (6:34)What is NetSpring.io? (8:21)Bridging the gap of BI and product analytics (14:41) Why data warehouses and not time-series databases? (19:58)The difficulty of using SQL in product analytics (28:35)Challenges in pricing models for product analytics and tooling (35:41)Combining analytics and attribution (42:00)What’s the next wave of product analytics? (47:28)Final thoughts and takeaways (53:41)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 15, 202357 min

The PRQL: Business Intelligence and Product Analytics With Vijay Ganesan, Co-Founder and CEO at NetSpring.io

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Vijay Ganesan of NetSpring.io. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 13, 20233 min

129: Databases, Data Warehouses, and Timeseries Data with David Kogn of Timescale

Highlights from this week’s conversation include:David’s background and journey to Timescale (2:12)What are time series databases? (14:13)How Timescale would have impacted David’s trajectory early in his career (17:51)Innovation in postgreSQL (21:02)Why does Timescale build their timeseries databases differently? (27:08)The challenges of building a new database on top of an old software (32:22)Writing outside of SQL and Timescale’s secret sauce (37:47)The importance of the developer experience in Timescale (54:08)How does someone know when they need to implement time series functionality (56:51)Final thoughts and takeaways (1:04:57)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 8, 20231h 9m

The PRQL: Time-Series Data 101

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with David Kohn of Timescale. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 6, 20233 min

128: The Possibilities Are Endless for Synthetic Data with Alex Watson of Gretel.ai

Highlights from this week’s conversation include:Alex’s background working for NSA and starting a company (1:51)The Gretel.ai journey (9:30)Defining synthetic data (13:26)The evolution of AI in deep learning data and language learning (16:28)The properties of synthetic data (21:31)Boundaries between synthetic data and prediction models (25:52)The developer experience in Gretel.ai (36:44)Stewardship and expansion of deep learning models in the future (45:36) Final thoughts and takeaways (52:17)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Mar 1, 202356 min

The PRQL: Boundaries Between Synthetic Data and Prediction Models

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Alex Watson of Gretel.ai. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 27, 20233 min

127: The Anatomy of a Data Lakehouse with Alex Merced of Dremio

Highlights from this week’s conversation include:Alex’s background in the data space (2:41)Comics and Pop Culture Blending with Finance training (5:20)What is a data lake house? (7:36)What is Dremio solving in for users? (11:21)Essential components of a data lake house (16:35)Difference between on-prem and cloud experiences (33:53)What does it mean to be a developer advocate? (41:31)Final thoughts and takeaways (49:02)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 22, 202353 min

The PRQL: What Does It Mean to be a Developer Advocate?

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Alex Merced of Dremio. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 20, 20233 min

126: Crossing the Product Analytics Chasm with Spenser Skates of Amplitude Analytics

Highlights from this week’s conversation include:Spenser’s journey to Co-Founding Amplitude (3:02)Looking back over the last decade of success at Amplitude (8:31)Going from Engineer to Sales (14:41)Comparing product analytics and general analytics (20:11)How cloud data warehousing has impacted analytics (31:38)Providing an out-of-the-box experience for consumers (41:12)Final thoughts and takeaways (54:27)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 15, 202358 min

The PRQL: Amplitude - From Startup to IPO

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Spenser Skates of Amplitude Analytics. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 13, 20233 min

125: Authorization Is A Data Problem with Jeff Chao of Abbey Labs

Highlights from this week’s conversation include:Jeff’s background at Netflix and Stripe leading him to Abbey Labs (2:22)What Abbey is solving in the space (5:16)Tackling permissions in an organization (7:30)Opportunities to improve the availability of data (10:14)The challenge of tackling a new problem area at a new company (14:59)What is the most common challenges in the identity and security space (18:43)Importance of identity and the ability to track it in data (22:46)Connecting all the different platforms without frustrating the user (30:32)What are the parts of access data that needing to be tracked (36:10)Dealing with the varieties of data in security and managing permissions (40:26)Final thoughts and takeaways (51:52)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 8, 202355 min

The PRQL: Solving Identity in Marketing vs. Security

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Jeff Chao of Abbey Labs, Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 6, 20235 min

124: Pragmatism About Data Stacks with Pedram Navid of West Marin Data

Highlights from this week’s conversation include:Pedram’s journey into the world of data (4:05)What should the datastack at an early-stage startup look like? (9:53)New ideas surrounding access control for data (24:45)What can data teams learn around complexity from software engineering (30:55)Scaling up instead of scaling out in processing data (37:40)Why DuckDB is making so much noise in the market (41:06)Final thoughts and takeaways (53:25)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Feb 1, 202357 min

The PRQL: What Does the Modern Data Stack Mean to Normal Companies?

bonus

In this bonus episode, Eric and Kostas preview their upcoming conversation with Pedram Navid of West Marin Data. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jan 30, 20234 min