PLAY PODCASTS
#24 - Moving to the Lakehouse: From Hive to Iceberg

#24 - Moving to the Lakehouse: From Hive to Iceberg

Bits of Chris: Augment, Stay Human · Chris Lettieri

March 25, 202410m 15s

Audio is streamed directly from the publisher (api.substack.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Change is hard.

But it’s necessary.

In this Data Engineering episode, you'll learn:

* Hive tracks data as folders, Iceberg tracks data as files

* How this key distinction enables Iceberg with powerful metadata operations

* What a data lakehouse is in Data Engineering

* Iceberg’s schema evolution, partition evolution, and better query pruning



This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit bitsofchris.com