![[AI] Behind ChatGPT: RLHF and the Proximal Policy Optimization - Practical AI](https://img.transistor.fm/94aMolU3k24DDOId2-qGWFFpD6o9VcbVh2GZDKkE9pA/rs:fill:0:0:1/w:1400/h:1400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzE3NjMxLzE2MjY4/NDU3NzgtYXJ0d29y/ay5qcGc.jpg)
Episode 507
[AI] Behind ChatGPT: RLHF and the Proximal Policy Optimization - Practical AI
A great discussion of RLHF exhibited by ChatGPT by the PracticalAI guys
The Swyx Mixtape · Swyx
January 24, 202314m 7s
Audio is streamed directly from the publisher (2.gum.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
from https://overcast.fm/+HaNPbG9CU/24:00
to read: https://overcast.fm/+HaNPbG9CU/24:00
Topics
learningtechnologybusiness