Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

15: InstructGPT

15: InstructGPT

FromArgmax


15: InstructGPT

FromArgmax

ratings:
Length:
57 minutes
Released:
Mar 28, 2023
Format:
Podcast episode

Description

In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.
Released:
Mar 28, 2023
Format:
Podcast episode