PLAY PODCASTS
The world's largest open library dataset (Practical AI #114)

The world's largest open library dataset (Practical AI #114)

Changelog Master Feed · Changelog Media

December 1, 202043m 58s

Audio is streamed directly from the publisher (op3.dev) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Unsplash has released the world’s largest open library dataset, which includes 2M+ high-quality Unsplash photos, 5M keywords, and over 250M searches. They have big ideas about how the dataset might be used by ML/AI folks, and there have already been some interesting applications. In this episode, Luke and Tim discuss why they released this data and what it take to maintain a dataset of this size.

Join the discussion

Changelog++ members get a bonus 1 minute at the end of this episode and zero ads. Join today!

Sponsors:

  • LinodeGet $100 in free credit to get started on Linode – our cloud of choice and the home of Changelog.com. Head to linode.com/changelog OR text CHANGELOG to 474747 to get instant access to that $100 in free credit.
  • Changelog++ – You love our content and you want to take it to the next level by showing your support. We’ll take you closer to the metal with no ads, extended episodes, outtakes, bonus content, a deep discount in our merch store (soon), and more to come. Let’s do this!
  • LaunchDarkly – Power experimentation at any scale. Fast and reliable feature management for the modern enterprise.

Featuring:

Show Notes:

Something missing or broken? PRs welcome!