PLAY PODCASTS
Internet Archive Scholar: Preserving the scholarly record (clt25)

Internet Archive Scholar: Preserving the scholarly record (clt25)

Chaos Computer Club - archive feed · Martin Czygan

March 23, 202523m 29s

Audio is streamed directly from the publisher (cdn.media.ccc.de) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Internet Archive Scholar is an ongoing project to discover and preserve open scholarly publications on the web. Just as websites, online articles and journals can vanish and we are trying to be there first. In the process, we built our own catalog, access site and derived various interesting datasets, such as a large scale citation graph. This talk gives an overview about the project, the tech stack, and will highlight some interesting open access datasets created and curated during the project. Licensed to the public under http://creativecommons.org/licenses/by/4.0 about this event: https://chemnitzer.linux-tage.de/2025/de/programm/beitrag/313

Topics

682025clt25VortragV5clt25-engDay 2