Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Build Your Own Data Pipeline - Andreas Kretz

Build Your Own Data Pipeline - Andreas Kretz

FromDataTalks.Club


Build Your Own Data Pipeline - Andreas Kretz

FromDataTalks.Club

ratings:
Length:
62 minutes
Released:
Jul 2, 2021
Format:
Podcast episode

Description

We talked about:

Andreas’s background
Why data engineering is becoming more popular
Who to hire first – a data engineer or a data scientist?
How can I, as a data scientist, learn to build pipelines?
Don’t use too many tools
What is a data pipeline and why do we need it?
What is ingestion?
Can just one person build a data pipeline?
Approaches to building data pipelines for data scientists
Processing frameworks
Common setup for data pipelines — car price prediction
Productionizing the model with the help of a data pipeline
Scheduling
Orchestration
Start simple
Learning DevOps to implement data pipelines
How to choose the right tool
Are Hadoop, Docker, Cloud necessary for a first job/internship?
Is Hadoop still relevant or necessary?
Data engineering academy
How to pick up Cloud skills
Avoid huge datasets when learning
Convincing your employer to do data science
How to find Andreas


Links:

LinkedIn: https://www.linkedin.com/in/andreas-kretz
Data engieering cookbook: https://cookbook.learndataengineering.com/
Course: https://learndataengineering.com/


Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Released:
Jul 2, 2021
Format:
Podcast episode

Titles in the series (100)

DataTalks.Club - the place to talk about data!