Загрузка страницы

combine different datasets into one | relational data | R for data science | left_join function in R

Hello everyone,

This video is based on R for Data Science chapter 13 Relational data. It is very common to have messy data in real life, and it is often true that the data for your analysis is from different resources. In this video, you will learn how to combine data tables and use them for analysis, through a few real-life datasets and two small data examples.

You may read chapter 13 through this link: https://r4ds.had.co.nz/

0:00 a quick intro
5:12 combine flights data with other information
such as weather, planes, and airlines
12:26 an easy and quick explanation using drawing to explain left_join
14:10 understand the joining functions using two smaller datasets

The R code in this video is available here: https://github.com/yz-DataScience/R-for-data-science/blob/main/R%20code%20for%20relational%20data.R

You may find the playlist on R for data science book club here ( I have been recording the videos according to a popular book): https://youtube.com/playlist?list=PLKNR1HvYSio7IaKu2vGkNJ1_jGNYiJPmT

You may set the video speed to 1.25 or 1.5 to make the video go faster by clicking the setting from the bottom right of the video, and set the speed to 1.25 or 1.5.

You need to install R and RStudio before you use RStudio. Follow this instruction here:
https://github.com/yz-DataScience/R-for-data-science/blob/main/R%20and%20RStudio%20Installation%20file.pdf

Видео combine different datasets into one | relational data | R for data science | left_join function in R канала Data Science with Yan
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
14 октября 2021 г. 19:41:09
00:17:53
Другие видео канала
data visualization| ggplot2| dplyr| data manipulation| Bar plot with error bars using Rdata visualization| ggplot2| dplyr| data manipulation| Bar plot with error bars using Rcheck for normal conditions | normality test | histogram | qq plotcheck for normal conditions | normality test | histogram | qq plotOne step boxplot considering two factors | customize your plot color paletteOne step boxplot considering two factors | customize your plot color palettepolynomial regression using R | non-linear regression | curved regressionpolynomial regression using R | non-linear regression | curved regressionggplot2: data visualization using boxplotsggplot2: data visualization using boxplotsUsing R to draw US maps | regions | Selected States in the United States PART 1Using R to draw US maps | regions | Selected States in the United States PART 1A quick intro to R Markdown: a convenient tool for creating a well-organized document with R outputA quick intro to R Markdown: a convenient tool for creating a well-organized document with R outputEDA exploratory data analysis part 1 distributions of one variable | R for data science book clubEDA exploratory data analysis part 1 distributions of one variable | R for data science book clubWorkflow R basics | Frustration is typical| Practice is Key!| R for data science book clubWorkflow R basics | Frustration is typical| Practice is Key!| R for data science book clubFlights dataset| data manipulation | deal with date/week of the day | calendar heat map| R tidyverseFlights dataset| data manipulation | deal with date/week of the day | calendar heat map| R tidyverseR markdown: you only need this video to create a R markdown file!R markdown: you only need this video to create a R markdown file!EDA part 2| ultimate guide to visualize covariations on two variables | R for data science book clubEDA part 2| ultimate guide to visualize covariations on two variables | R for data science book clubR for data science Book Club | An Introduction and announcement video | Subscribe to follow alongR for data science Book Club | An Introduction and announcement video | Subscribe to follow alongData manipulation with dplyr Part 2 | R for data science book club| chapter 5 data transformationData manipulation with dplyr Part 2 | R for data science book club| chapter 5 data transformationR for data science book club chapter 1 | a quick introduction of the book| whole processR for data science book club chapter 1 | a quick introduction of the book| whole processModeling using R | simple linear regression| correlations, visualizations, fit a model lm() functionModeling using R | simple linear regression| correlations, visualizations, fit a model lm() functionData Manipulation using R dplyr package | filter function to subset data based on conditionsData Manipulation using R dplyr package | filter function to subset data based on conditionsna.omit: get rid of missing values | ggpairs: a quick visualization function of a full dataset in Rna.omit: get rid of missing values | ggpairs: a quick visualization function of a full dataset in RSplit strings in R | change variables from characters to numeric | strisplit( )Split strings in R | change variables from characters to numeric | strisplit( )data manipulation with dplyr | R for data science book club| chapter 5 data transformation | Part 1data manipulation with dplyr | R for data science book club| chapter 5 data transformation | Part 1
Яндекс.Метрика