
Episode 103
Becoming More Effective at Manipulating Data With Pandas
The Real Python Podcast · Real Python
March 25, 202259m 57s
Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
<p>Do you wonder if you’re taking the right approach when shaping data in pandas? Is your Jupyter workflow getting out of hand? This week on the show, Matt Harrison talks about his new book, “Effective Pandas: Patterns for Data Manipulation.”</p>
<p>Matt discusses working as a corporate consultant and migrating Excel users toward Python. We explore several “NumPy-isms” that beginners get stuck on. Matt shares advice about chaining operations in pandas, which some developers find controversial.</p>
<div class="alert alert-primary" role="alert">
<p><strong>Course Spotlight:</strong> <a href="https://realpython.com/courses/sorting-data-python-pandas/">Sorting Data in Python With Pandas</a></p>
<p>In this video course, you’ll learn how to sort data in a pandas DataFrame using the pandas sort functions <code>sort_values()</code> and <code>sort_index()</code>. You’ll learn how to sort by one or more columns and by index in ascending or descending order.</p>
</div>
<p>Topics:</p>
<ul>
<li>00:00:00 – Introduction</li>
<li>00:01:32 – Working as a consultant</li>
<li>00:03:39 – Moving from Excel to Python</li>
<li>00:06:50 – Who is the book for?</li>
<li>00:10:15 – Using real data for examples</li>
<li>00:16:16 – Sponsor: CData Software</li>
<li>00:16:58 – What are patterns for data manipulation?</li>
<li>00:18:38 – Cleaning and preparing data</li>
<li>00:21:33 – What concepts were you most eager to share?</li>
<li>00:26:57 – An example of chaining operations in pandas</li>
<li>00:33:20 – NumPy-isms and other challenges in learning pandas</li>
<li>00:40:20 – The use of exercises throughout the book</li>
<li>00:43:50 – Video Course Spotlight</li>
<li>00:45:01 – Challenges of using color throughout the book</li>
<li>00:51:40 – Avoiding the slow path in pandas</li>
<li>00:56:03 – What are you excited about in the world of Python?</li>
<li>00:56:58 – What would you like to learn next?</li>
<li>00:58:16 – <em>Effective Pandas</em> book</li>
<li>00:58:38 – Social connections</li>
<li>00:58:58 – Thanks and goodbye</li>
</ul>
<p>Show Links:</p>
<ul>
<li><a href="https://store.metasnake.com/effective-pandas-book?coupon=REALPY"><em>Effective Pandas</em> Digital Book Discount Link</a></li>
<li><a href="https://store.metasnake.com/">Matt Harrison’s Site - MetaSnake</a></li>
<li><a href="https://pandas.pydata.org/">pandas - Python Data Analysis Library</a></li>
<li><a href="https://en.wikipedia.org/wiki/Law_of_Demeter">Law of Demeter - Wikipedia</a></li>
<li><a href="https://us.pycon.org/2022/">PyCon 2022 - Welcome to PyCon US</a></li>
<li><a href="https://pythonspeed.com/docker/">Production-ready Docker packaging for Python developers | Python=>Speed</a></li>
</ul>
<p>Level up your Python skills with our expert-led courses:</p>
<ul>
<li><a href="https://realpython.com/courses/gradebook-using-pandas-python/">Using pandas to Make a Gradebook in Python</a></li>
<li><a href="https://realpython.com/courses/pandas-dataframe-working-with-data/">The pandas DataFrame: Working With Data Efficiently</a></li>
<li><a href="https://realpython.com/courses/sorting-data-python-pandas/">Sorting Data in Python With pandas</a></li>
</ul> <p><a rel="payment" href="https://realpython.com/join">Support the podcast & join our community of Pythonistas</a></p>