Apache Spark RDD operations : Transformations and Actions
RDD operations
==============
There are 2 operations that can be applied on RDD. One is transformation.
1) Transformation
===============
Transformation is what you do to an RDD to get another resultant RDD.
The example would be to apply functions like filter, union , that would then create another resultant RDD.
FILTER is a transformation that when applied on an RDD, will isolate certain elements and create a new RDD.
This combining of elements from 2 RDD be done using UNION transformation. UNION is a multi-RDD transformation, which means it acts on more than one RDD.
2) Actions
=========
Actions are second type of operations in RDD. Actions return a result to the driver program , or write it in a storage and kick off a computation. some examples are count , first, collect, take
count action can be used to get the number of elements in an RDD.
----------
first action can be used to retrieve the first element in the RDD.
-------
take action can be used to retrieve n elements out of the RDD.
--------
collect action can be used to retrieve the complete list of elements
-----------
from the RDD.
Видео Apache Spark RDD operations : Transformations and Actions канала BigDataElearning
==============
There are 2 operations that can be applied on RDD. One is transformation.
1) Transformation
===============
Transformation is what you do to an RDD to get another resultant RDD.
The example would be to apply functions like filter, union , that would then create another resultant RDD.
FILTER is a transformation that when applied on an RDD, will isolate certain elements and create a new RDD.
This combining of elements from 2 RDD be done using UNION transformation. UNION is a multi-RDD transformation, which means it acts on more than one RDD.
2) Actions
=========
Actions are second type of operations in RDD. Actions return a result to the driver program , or write it in a storage and kick off a computation. some examples are count , first, collect, take
count action can be used to get the number of elements in an RDD.
----------
first action can be used to retrieve the first element in the RDD.
-------
take action can be used to retrieve n elements out of the RDD.
--------
collect action can be used to retrieve the complete list of elements
-----------
from the RDD.
Видео Apache Spark RDD operations : Transformations and Actions канала BigDataElearning
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Apache Spark : Commonly used Transformations : Map, Filter, Flatmap Transformations](https://i.ytimg.com/vi/HS8Cx-l9Vhg/default.jpg)
![Apache Spark RDD Basics : What is RDD, How to create an RDD](https://i.ytimg.com/vi/NRo8TluH7KI/default.jpg)
![Apache Kafka in 6 minutes](https://i.ytimg.com/vi/Ch5VhJzaoaI/default.jpg)
![A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets - Jules Damji](https://i.ytimg.com/vi/Ofk7G3GD9jk/default.jpg)
![](https://i.ytimg.com/vi/V-BuLBO1n3g/default.jpg)
![Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | Edureka](https://i.ytimg.com/vi/xDpvyu0w0C8/default.jpg)
![Spark Performance Tuning | Performance Optimization | Interview Question](https://i.ytimg.com/vi/Kb08RTmjnkw/default.jpg)
![RDDs: Transformation and Actions](https://i.ytimg.com/vi/JjuKVv8SiLg/default.jpg)
![Fine Tuning and Enhancing Performance of Apache Spark Jobs](https://i.ytimg.com/vi/WSplTjBKijU/default.jpg)
![Apache Spark Architecture | Spark Cluster Architecture Explained | Spark Training | Edureka](https://i.ytimg.com/vi/jffQhcweGwY/default.jpg)
![Wide vs Narrow Dependencies](https://i.ytimg.com/vi/LDdA1RW_6xo/default.jpg)
![Apache Spark Architecture : Run Time Architecture of Spark Application](https://i.ytimg.com/vi/rJFg2i_auAg/default.jpg)
![Spark Architecture](https://i.ytimg.com/vi/wzy0oluoyN8/default.jpg)
![Broadcast vs Accumulator Variable - Broadcast Join & Counters - Apache Spark Tutorial For Beginners](https://i.ytimg.com/vi/AyfuUQtfWFY/default.jpg)
![Part 2 - Spark SQL - Apache Spark Crash Course Mini-series](https://i.ytimg.com/vi/FcAiK2VtPfA/default.jpg)
![Hadoop vs Spark | Hadoop And Spark Difference | Hadoop And Spark Training | Simplilearn](https://i.ytimg.com/vi/2PVzOHA3ktE/default.jpg)
![1.3 Apache Spark Architecture | Spark Execution Model | Spark tutorial](https://i.ytimg.com/vi/TmoaOFK1iEM/default.jpg)
![Shuffling: What it is and why it's important](https://i.ytimg.com/vi/kbQmZiT1gnA/default.jpg)
![Wide vs Narrow Transformation | Spark Tutorial | Interview Question](https://i.ytimg.com/vi/DAApXuIy_D8/default.jpg)
![How To Select, Rename, Transform and Manipulate Columns of a Spark DataFrame | PySpark Tutorial](https://i.ytimg.com/vi/GjIV7o-Y2bQ/default.jpg)