PLAY PODCASTS
17 - Evaluation | Philip Tannor (DeepChecks)
Season 1 · Episode 17

17 - Evaluation | Philip Tannor (DeepChecks)

LangTalks · Lee Twito, Gal Peretz

November 13, 202329m 54s

Show Notes

Evaluating LLMs and AI pipeline in dev and production environments. How to work with datasets