Hi everyone!
Today we started module 5: batch processing
Materials: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/05-batch
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/05-batch/homework.md
Have fun!
The form for submitting homework 4 will remain open for some time
Today we started module 5: batch processing
Materials: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/05-batch
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/05-batch/homework.md
Have fun!
The form for submitting homework 4 will remain open for some time
GitHub
data-engineering-zoomcamp/05-batch at main · DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering. - DataTalksClub/data-engineering-zoomcamp
Hi everyone!
We will record module 6 on streaming live. It'll be tomorrow (Monday) at 17:00 CET. We'll send a reminder one hour before the start and share the link here too.
See you tomorrow!
We will record module 6 on streaming live. It'll be tomorrow (Monday) at 17:00 CET. We'll send a reminder one hour before the start and share the link here too.
See you tomorrow!
Hi everyone!
We accidentally discovered that for Q3 homework 5 different versions of Spark give different answers
We're still trying to figure out why it's happening, but if you came across this issue, select the closest option - it will still be the correct one
We accidentally discovered that for Q3 homework 5 different versions of Spark give different answers
We're still trying to figure out why it's happening, but if you came across this issue, select the closest option - it will still be the correct one
We're starting a stream about streaming in 1 hour. May the stream be with you!
Stream about streaming with Zach: https://www.youtube.com/watch?v=P2loELMUUeI
(Available for replay later)
(Available for replay later)
YouTube
Data Engineering Zoomcamp 2025 - Streaming - with Zach Wilson
Links:
- Repo: https://github.com/EcZachly/flink-training
- Course: https://github.com/DataTalksClub/data-engineering-zoomcamp
0:00 - Introduction to the Workshop
0:33 - Overview of the Session and Goals
1:04 - Tools and Technologies Used: Docker, Flink…
- Repo: https://github.com/EcZachly/flink-training
- Course: https://github.com/DataTalksClub/data-engineering-zoomcamp
0:00 - Introduction to the Workshop
0:33 - Overview of the Session and Goals
1:04 - Tools and Technologies Used: Docker, Flink…
We're still preparing homework for module 6, so you can continue working on module 5 in the meantime. We leave the homework form open for some time
Hi everyone!
The content and the homework for module 6 are ready
You can start working on the module
Module: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/06-streaming
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/06-streaming/homework.md
The homework is based on the PyFlink stream that Zach did, so you can treat the rest of the videos in module 6 as optional
Have fun and let us know in Slack if you have any problems
The content and the homework for module 6 are ready
You can start working on the module
Module: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main/06-streaming
Homework: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/06-streaming/homework.md
The homework is based on the PyFlink stream that Zach did, so you can treat the rest of the videos in module 6 as optional
Have fun and let us know in Slack if you have any problems
GitHub
data-engineering-zoomcamp/06-streaming at main · DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering. - DataTalksClub/data-engineering-zoomcamp
If you're looking for a project idea and interested in Blockchain, check this:
https://bush-thrill-4c5.notion.site/Solana-on-chain-analytics-competition-1aa9ca5fcbe7806dbee4eb96570c10a6?pvs=4
Our friend Dmitry Dremov is organizing a series of analytics competition on Solana data
You can check the past and ongoing competitions to learn more about the dataset or play with the data yourself and see if it's interesting for you to make a project about it
https://bush-thrill-4c5.notion.site/Solana-on-chain-analytics-competition-1aa9ca5fcbe7806dbee4eb96570c10a6?pvs=4
Our friend Dmitry Dremov is organizing a series of analytics competition on Solana data
You can check the past and ongoing competitions to learn more about the dataset or play with the data yourself and see if it's interesting for you to make a project about it
Hi everyone!
We hope you're enjoying the course! Now it's time to put everything we learned into practice
We start working on our projects
Here you can find more information about it: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/projects/README.md
You'll find the links for submitting your projects in the course management platform (and also here: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/project.md)
We hope you're enjoying the course! Now it's time to put everything we learned into practice
We start working on our projects
Here you can find more information about it: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/projects/README.md
You'll find the links for submitting your projects in the course management platform (and also here: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/cohorts/2025/project.md)
GitHub
data-engineering-zoomcamp/projects/README.md at main · DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering. - DataTalksClub/data-engineering-zoomcamp
We hope you're enjoying the project!
A reminder that if you use dlt in your project, you can get one of their T-shirts!
A reminder that if you use dlt in your project, you can get one of their T-shirts!
Hi everyone! Now it's time to learn from your peers and evaluate their projects!
If you submitted a project for attempt 1, you will find your assignments here:
https://courses.datatalks.club/de-zoomcamp-2025/project/project1/eval
Note that you need to be logged in
Happy learning!
And by the way, the form for submitting project attempt 2 is now open
If you submitted a project for attempt 1, you will find your assignments here:
https://courses.datatalks.club/de-zoomcamp-2025/project/project1/eval
Note that you need to be logged in
Happy learning!
And by the way, the form for submitting project attempt 2 is now open
Don't forget to evaluate your peers! If you don't do it, you'll fail your project. We will close the form tomorrow
Project attempt #1 is over!
Congratulations to 133 people who made it!
You can see the full list of projects (ranked by the score) here: https://courses.datatalks.club/de-zoomcamp-2025/project/project1/list
If you failed, you can update your project and submit it one more time for attempt 2
Congratulations to 133 people who made it!
You can see the full list of projects (ranked by the score) here: https://courses.datatalks.club/de-zoomcamp-2025/project/project1/list
If you failed, you can update your project and submit it one more time for attempt 2
We're extending the deadline for submitting attempt 2 for a couple of days
You have time till Friday
You have time till Friday
Another important thing: don't forget to update your certificate name. If you don't do it, you will have a randomly generated name on your certificate
You can do it here: https://courses.datatalks.club/de-zoomcamp-2025/enrollment
You can do it here: https://courses.datatalks.club/de-zoomcamp-2025/enrollment
We're closing the form for submitting project attempt 2 around 15:00 Berlin time. Hurry up!
If you used dlt in your project, please fill in this form:
https://docs.google.com/forms/d/e/1FAIpQLSd-KmfYXp179pGFj5SHALipC_t3raNV6b1B20Je5aILQVBY3Q/viewform
Remember - they are giving us 5 T-shirts, so you can get one of them
https://docs.google.com/forms/d/e/1FAIpQLSd-KmfYXp179pGFj5SHALipC_t3raNV6b1B20Je5aILQVBY3Q/viewform
Remember - they are giving us 5 T-shirts, so you can get one of them
Google Docs
dlt at Data Engineering Zoomcamp
If you used dlt in your project for our Data Engineering Zoomcamp, you can get a dltHub T-shirt!
We have 5 of them, so we will do a raffle to select the 5 winners
We have 5 of them, so we will do a raffle to select the 5 winners
If you passed the first attempt, you will find your certificate in your enrollment profile
Instructions: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/certificates.md
Congratulations to the 133 people who passed the project!
Instructions: https://github.com/DataTalksClub/data-engineering-zoomcamp/blob/main/certificates.md
Congratulations to the 133 people who passed the project!
GitHub
data-engineering-zoomcamp/certificates.md at main · DataTalksClub/data-engineering-zoomcamp
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering. - DataTalksClub/data-engineering-zoomcamp
Peer review assessments for project attempt 2
https://courses.datatalks.club/de-zoomcamp-2025/project/project2/eval
You need to be logged in to see them
Have fun learning from your peers!
https://courses.datatalks.club/de-zoomcamp-2025/project/project2/eval
You need to be logged in to see them
Have fun learning from your peers!
How many of you didn't manage to submit a project for attempt 2 but still want to do it?