Загрузка страницы

Word count example - From Google Cloud Storage to BigQuery with Apache Beam

Hi folks
thank you very much for the support. The first video was quite a success compared to my expectations, so here is the next one of the series.

In this video Word count example, we will learn how to create a custom pipeline in DataFlow using Apache Beam. We will count words from a text file using custom python code.

You can jump to the installation steps if you already have the environment set, but I advise you to follow from the write pipeline to understand the whole concept.

You have the code in GitHub: https://github.com/mrinaldi2/apache-beam-word-count-dataflow

At the end of the video, I proposed a little challenge. If you achieve to complete the challenge, please create a new branch and push it to GitHub so I can see it.

00:00 Introduction
04:20 How to install GCP SDK
08:20 Create a Python environment
11:40 Write a pipeline in apache beam
28:40 Run the pipeline DataFlowRunner
36:30 Change our pipeline to save the output on BQ

Enjoy, Like, Subscribe and Share
If you liked the video, want more, and are part of deciding what's next, please support us at https://www.patreon.com/cslearning?fan_landing=true.
https://www.instagram.com/cslearning86
https://www.twitter.com/cslearning3
https://www.facebook.com/profile.php?id=100064230603380
Follow my blog on https://cslearning.blog/
Thank you

Видео Word count example - From Google Cloud Storage to BigQuery with Apache Beam канала CSLearning
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
17 января 2021 г. 1:11:16
00:45:29
Яндекс.Метрика