Two ways to impute missing values for a categorical feature
Need to impute missing values for a categorical feature? Two options:
1. Impute the most frequent value
2. Impute the value "missing", which treats it as a separate category
👉 New tips every TUESDAY and THURSDAY! 👈
🎥 Watch all tips: https://www.youtube.com/playlist?list=PL5-da3qGB5ID7YYAqireYEew2mWVvgmj6
🗒️ Code for all tips: https://github.com/justmarkham/scikit-learn-tips
💌 Get tips via email: https://scikit-learn.tips
=== WANT TO GET BETTER AT MACHINE LEARNING? ===
1) LEARN THE FUNDAMENTALS in my intro course (free!): https://courses.dataschool.io/introduction-to-machine-learning-with-scikit-learn
2) BUILD YOUR ML CONFIDENCE in my intermediate course: https://courses.dataschool.io/building-an-effective-machine-learning-workflow-with-scikit-learn
3) LET'S CONNECT!
- Newsletter: https://www.dataschool.io/subscribe/
- Twitter: https://twitter.com/justmarkham
- Facebook: https://www.facebook.com/DataScienceSchool/
- LinkedIn: https://www.linkedin.com/in/justmarkham/
Видео Two ways to impute missing values for a categorical feature канала Data School
1. Impute the most frequent value
2. Impute the value "missing", which treats it as a separate category
👉 New tips every TUESDAY and THURSDAY! 👈
🎥 Watch all tips: https://www.youtube.com/playlist?list=PL5-da3qGB5ID7YYAqireYEew2mWVvgmj6
🗒️ Code for all tips: https://github.com/justmarkham/scikit-learn-tips
💌 Get tips via email: https://scikit-learn.tips
=== WANT TO GET BETTER AT MACHINE LEARNING? ===
1) LEARN THE FUNDAMENTALS in my intro course (free!): https://courses.dataschool.io/introduction-to-machine-learning-with-scikit-learn
2) BUILD YOUR ML CONFIDENCE in my intermediate course: https://courses.dataschool.io/building-an-effective-machine-learning-workflow-with-scikit-learn
3) LET'S CONNECT!
- Newsletter: https://www.dataschool.io/subscribe/
- Twitter: https://twitter.com/justmarkham
- Facebook: https://www.facebook.com/DataScienceSchool/
- LinkedIn: https://www.linkedin.com/in/justmarkham/
Видео Two ways to impute missing values for a categorical feature канала Data School
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Examine the intermediate steps in a PipelineDrop the first category from binary features (only) with OneHotEncoderGuess the MYSTERY course 🎁Course outline: "Master Machine Learning with scikit-learn"Join Data School Insiders on Patreon!Load a toy dataset into a DataFrameHow to read the scikit-learn documentationHistGradientBoostingClassifier natively supports missing valuesHow to save a scikit-learn Pipeline with custom transformersShuffle your dataset when using cross_val_scoreAdd a missing indicator to encode "missingness" as a featureThree reasons not to use drop='first' with OneHotEncoderDon't use .values when passing a pandas object to scikit-learnEstimators only print parameters that have been changedAdditional Resources for Learning Git and GitHubWhat's your favorite pandas trick??Cost-sensitive learning in scikit-learnMost parameters should be passed as keyword argumentsPassthrough some columns and drop others in a ColumnTransformerTune the parameters of a VotingClassifer or VotingRegressorMachine Learning with Text in scikit-learn (PyData DC 2016)