Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

747: Technical Intro to Transformers and LLMs, with Kirill Eremenko

747: Technical Intro to Transformers and LLMs, with Kirill Eremenko

FromSuper Data Science: ML & AI Podcast with Jon Krohn


747: Technical Intro to Transformers and LLMs, with Kirill Eremenko

FromSuper Data Science: ML & AI Podcast with Jon Krohn

ratings:
Length:
127 minutes
Released:
Jan 9, 2024
Format:
Podcast episode

Description

Attention and transformers in LLMs, the five stages of data processing, and a brand-new data science course: Kirill Eremenko joins host Jon Krohn to explore what goes into well-crafted LLMs, what makes Transformers so powerful, and how to succeed as a data scientist in this new age of generative AI.

This episode is brought to you by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatwithyourdata), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information.

In this episode you will learn:
• Supply and demand in AI recruitment [08:30]
• Kirill and Hadelin's new course on LLMs, “Large Language Models (LLMs), Transformers & GPT A-Z” [15:37]
• The learning difficulty in understanding LLMs [19:46]
• The basics of LLMs [22:00]
• The five building blocks of transformer architecture [36:29]
- 1: Input embedding [44:10]
- 2: Positional encoding [50:46]
- 3: Attention mechanism [54:04]
- 4: Feedforward neural network [1:16:17]
- 5: Linear transformation and softmax [1:19:16]
• Inference vs training time [1:29:12]
• Why transformers are so powerful [1:49:22]

Additional materials: www.superdatascience.com/747
Released:
Jan 9, 2024
Format:
Podcast episode

Titles in the series (77)

The Super Data Science podcast with Jon Krohn brings you the latest and most important machine learning, artificial intelligence, and broader data-world topics from across both academia and industry. As the quantity of data on our planet doubles every couple of years and this trend is set to continue for decades to come, there's an unprecedented opportunity for you to make an enormous impact in your lifetime. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, and commercialization − everything you need to crush it with data science.