Reading CSV, Parquet, and ORC with cuDF, Dask_cuDF, Pandas | Performance Benchmarking
In big data processing, frequent reading and writing of files can lead to significant performance drops when hundreds of large files are loaded simultaneously. Various libraries can help us quickly process these files, such as cuDF for performing processing on GPU, and Pandas for CPU. Multiple files can be processed simultaneously using Dask_cuDF. Let's see what performance the best among these.
Видео Reading CSV, Parquet, and ORC with cuDF, Dask_cuDF, Pandas | Performance Benchmarking канала MLWorks
Видео Reading CSV, Parquet, and ORC with cuDF, Dask_cuDF, Pandas | Performance Benchmarking канала MLWorks
Комментарии отсутствуют
Информация о видео
6 января 2024 г. 23:15:03
00:07:50
Другие видео канала