PLAY PODCASTS
Becoming More Effective at Manipulating Data With Pandas
Episode 103

Becoming More Effective at Manipulating Data With Pandas

The Real Python Podcast · Real Python

March 25, 202259m 57s

Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

<p>Do you wonder if you&rsquo;re taking the right approach when shaping data in pandas? Is your Jupyter workflow getting out of hand? This week on the show, Matt Harrison talks about his new book, &ldquo;Effective Pandas: Patterns for Data Manipulation.&rdquo;</p> <p>Matt discusses working as a corporate consultant and migrating Excel users toward Python. We explore several &ldquo;NumPy-isms&rdquo; that beginners get stuck on. Matt shares advice about chaining operations in pandas, which some developers find controversial.</p> <div class="alert alert-primary" role="alert"> <p><strong>Course Spotlight:</strong> <a href="https://realpython.com/courses/sorting-data-python-pandas/">Sorting Data in Python With Pandas</a></p> <p>In this video course, you&rsquo;ll learn how to sort data in a pandas DataFrame using the pandas sort functions <code>sort_values()</code> and <code>sort_index()</code>. You&rsquo;ll learn how to sort by one or more columns and by index in ascending or descending order.</p> </div> <p>Topics:</p> <ul> <li>00:00:00 &ndash; Introduction</li> <li>00:01:32 &ndash; Working as a consultant</li> <li>00:03:39 &ndash; Moving from Excel to Python</li> <li>00:06:50 &ndash; Who is the book for?</li> <li>00:10:15 &ndash; Using real data for examples</li> <li>00:16:16 &ndash; Sponsor: CData Software</li> <li>00:16:58 &ndash; What are patterns for data manipulation?</li> <li>00:18:38 &ndash; Cleaning and preparing data</li> <li>00:21:33 &ndash; What concepts were you most eager to share?</li> <li>00:26:57 &ndash; An example of chaining operations in pandas</li> <li>00:33:20 &ndash; NumPy-isms and other challenges in learning pandas</li> <li>00:40:20 &ndash; The use of exercises throughout the book</li> <li>00:43:50 &ndash; Video Course Spotlight</li> <li>00:45:01 &ndash; Challenges of using color throughout the book</li> <li>00:51:40 &ndash; Avoiding the slow path in pandas</li> <li>00:56:03 &ndash; What are you excited about in the world of Python?</li> <li>00:56:58 &ndash; What would you like to learn next?</li> <li>00:58:16 &ndash; <em>Effective Pandas</em> book</li> <li>00:58:38 &ndash; Social connections</li> <li>00:58:58 &ndash; Thanks and goodbye</li> </ul> <p>Show Links:</p> <ul> <li><a href="https://store.metasnake.com/effective-pandas-book?coupon=REALPY"><em>Effective Pandas</em> Digital Book Discount Link</a></li> <li><a href="https://store.metasnake.com/">Matt Harrison&rsquo;s Site - MetaSnake</a></li> <li><a href="https://pandas.pydata.org/">pandas - Python Data Analysis Library</a></li> <li><a href="https://en.wikipedia.org/wiki/Law_of_Demeter">Law of Demeter - Wikipedia</a></li> <li><a href="https://us.pycon.org/2022/">PyCon 2022 - Welcome to PyCon US</a></li> <li><a href="https://pythonspeed.com/docker/">Production-ready Docker packaging for Python developers | Python=&gt;Speed</a></li> </ul> <p>Level up your Python skills with our expert-led courses:</p> <ul> <li><a href="https://realpython.com/courses/gradebook-using-pandas-python/">Using pandas to Make a Gradebook in Python</a></li> <li><a href="https://realpython.com/courses/pandas-dataframe-working-with-data/">The pandas DataFrame: Working With Data Efficiently</a></li> <li><a href="https://realpython.com/courses/sorting-data-python-pandas/">Sorting Data in Python With pandas</a></li> </ul> <p><a rel="payment" href="https://realpython.com/join">Support the podcast &amp; join our community of Pythonistas</a></p>