Organizing Google's Datasets
If you're a data scientist, there's a good chance…
October 31, 201615m 0s
Audio is streamed directly from the publisher (feeds.soundcloud.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
If you're a data scientist, there's a good chance you're used to working with a lot of data. But there's a lot of data, and then there's Google-scale amounts of data. Keeping all that data organized is a Google-sized task, and as it happens, they've built a system for that organizational challenge. This episode is all about that system, called Goods, and in particular we'll dig into some of the details of what makes this so tough.
Relevant links: http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45390.pdf
Topics
datasciencemachinelearninglineardigressions