Загрузка...

Dask in 15 Minutes | Machine Learning & Data Science Open-source Spotlight #5

Should you use Dask or PySpark for Big Data? 🤔

Dask is a flexible library for parallel computing in Python.
In this video I give a tutorial on how to use Dask for parallel computing, handling Big Data and integration with Deep Learning frameworks.
I compare Dask to PySpark and list the relative advantages I see of choosing Dask as your primary choice for Big Data handling.

Link to Notebook:
https://nbviewer.jupyter.org/github/danbochman/Open-Source-Spotlight/blob/master/Dask/Dask.ipynb

With these "Machine Learning & Data Science Open Source Spotlight" weekly videos, my objective is to introduce many game-changing libraries, which I believe many people can benefit from.

I would love to hear your feedback!
Did this video teach you something new?
Are there any open-source libraries you think deserve a spotlight?
Let me know in the comments! 👇🏻

Видео Dask in 15 Minutes | Machine Learning & Data Science Open-source Spotlight #5 канала Dan Bochman
Страницу в закладки Мои закладки
Все заметки Новая заметка Страницу в заметки

На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.

Об использовании CookiesПринять