Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

FromSuper Data Science: ML & AI Podcast with Jon Krohn


791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

FromSuper Data Science: ML & AI Podcast with Jon Krohn

ratings:
Length:
57 minutes
Released:
Jun 11, 2024
Format:
Podcast episode

Description

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to Jon Krohn about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education.

This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0), and Crawlbase (https://crawlbase.com), the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

In this episode you will learn:
• Why it is important that AI is open [03:13]
• The efficacy and scalability of direct preference optimization [07:32]
• Robotics and LLMs [14:32]
• The challenges to aligning reward models with human preferences [23:00]
• How to make sure AI’s decision making on preferences reflect desirable behavior [28:52]
• Why Nathan believes AI is closer to alchemy than science [37:38]

Additional materials: www.superdatascience.com/791
Released:
Jun 11, 2024
Format:
Podcast episode

Titles in the series (77)

The Super Data Science podcast with Jon Krohn brings you the latest and most important machine learning, artificial intelligence, and broader data-world topics from across both academia and industry. As the quantity of data on our planet doubles every couple of years and this trend is set to continue for decades to come, there's an unprecedented opportunity for you to make an enormous impact in your lifetime. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, and commercialization − everything you need to crush it with data science.