PLAY PODCASTS
Tracking Drift to Monitor LLM Performance
Episode 1

Tracking Drift to Monitor LLM Performance

Safe and Sound AI

December 12, 202411m 50s

Audio is streamed directly from the publisher (mcdn.podbean.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

In this episode, we discuss how to monitor the performance of Large Language Models (LLMs) in production environments. We explore common enterprise approaches to LLM deployment and evaluate the importance of monitoring for LLM quality or the quality of LLM responses over time. We discuss strategies for "drift monitoring" — tracking changes in both input prompts and output responses — allowing for proactive troubleshooting and improvement via techniques like fine-tuning or augmenting data sources.

Read the article by Fiddler AI and explore additional resources on how AI observability can help developers build trust into AI services.