Загрузка...

I Built My First ETL Pipeline as a Data Analyst (Using GitHub Trending Repos)

In my last video, I talked about how I’m learning data engineering as a data analyst. This is the next step in that journey.

In this video, I tried building my first real ETL pipeline from scratch using Python. The pipeline extracts trending GitHub repositories from the last 30 days, transforms and cleans the data, then saves everything into a CSV file.

I also break down what ETL actually means in a beginner-friendly way because honestly, it sounded way more complicated to me before I started building projects around it.

This project covers:

* Extracting data from an API
* Transforming and cleaning raw data
* Saving structured data as CSV
* Beginner data engineering concepts
* Real-world ETL workflow

I’m learning publicly as I go, so if you’re also transitioning into data engineering, hopefully this helps you understand the concepts faster than just watching theory videos.

More projects coming soon as I continue following my 12-month data engineering roadmap.

Видео I Built My First ETL Pipeline as a Data Analyst (Using GitHub Trending Repos) канала Ibrahim's Notebook
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять