PLAY PODCASTS
Prototype to Production for RAG applications (sps24)

Prototype to Production for RAG applications (sps24)

Chaos Computer Club - archive feed · Isaac Chung

October 18, 202434m 41s

Audio is streamed directly from the publisher (cdn.media.ccc.de) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Retrieval Augmented Generation (RAG) has been used to mitigate hallucination issues from LLMs and rapidly provide LLMs with external knowledge that were not part of the pre-training data. While tutorials offer convenient ways to build POCs quickly, transitioning these prototypes to production environments often catches us off-guard with unforeseen challenges. This talk takes a deeper dive into the topics that are often missing from cookbooks and tutorials yet are crucial in scaling your RAG prototype to production. Our discussion will use real examples to help you better understand some of the best practices in production RAG for observability, security, scalability, and fault tolerance. about this event: https://c3voc.de

Topics

56266importData Science & MoreAula2024