
#24 - Moving to the Lakehouse: From Hive to Iceberg
Bits of Chris: Augment, Stay Human · Chris Lettieri
March 25, 202410m 15s
Audio is streamed directly from the publisher (api.substack.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
Change is hard.
But it’s necessary.
In this Data Engineering episode, you'll learn:
* Hive tracks data as folders, Iceberg tracks data as files
* How this key distinction enables Iceberg with powerful metadata operations
* What a data lakehouse is in Data Engineering
* Iceberg’s schema evolution, partition evolution, and better query pruning
This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit bitsofchris.com