PLAY PODCASTS
Decoding LLM Quality: From Unit Testing to User Feedback

Decoding LLM Quality: From Unit Testing to User Feedback

Dive deep into the nuances of measuring the quality of Large Language Model (LLM) prompts, as we explore past methodologies, and evaluate both qualitative assessments and large-scale testing techniques. Join us as we discuss the challenges of tradition...

The Prompt Desk · Justin Macorin, Bradley Arsenault

October 10, 202318m 20s

Audio is streamed directly from the publisher (audio.ausha.co) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Dive deep into the nuances of measuring the quality of Large Language Model (LLM) prompts, as we explore past methodologies, and evaluate both qualitative assessments and large-scale testing techniques. Join us as we discuss the challenges of traditional metrics, the role of user feedback, and brainstorm new ways to gauge generative model performance.

Continue listening to The Prompt Desk Podcast for everything LLM & GPT, Prompt Engineering, Generative AI, and LLM Security.

Check out PromptDesk.ai for an open-source prompt management tool.

Check out Brads AI Consultancy at bradleyarsenault.me.

Add Justin Macorin and Bradley Arsenault on LinkedIn.


Please fill out our listener survey here to help us create a better podcast: https://docs.google.com/forms/d/e/1FAIpQLSfNjWlWyg8zROYmGX745a56AtagX_7cS16jyhjV2u_ebgc-tw/viewform?usp=sf_link


Hosted on Ausha. See ausha.co/privacy-policy for more information.

Topics

AIgpt3GPTGPT4Large Language ModelsLLMPrompt EngineeringLLM qualityAI researchprompt qualityAI engineering