Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost
From variety of classification and regression methods, gradient boosting, and in particular its variation in xgboost implementation, is one of the most convenient to use. Out of the box you can use it as easily as random forest. Due to its nature, when used with decision trees, you don't need to worry about co-linearities or missing values. No more worrying about normalization, standardization nor any other monotonic transformations on your data. Overfitting prevention with watchlists. Written efficiently in C++ with Python and R bindings and scikit-learn like interface. In this talk we will go deep into how and why xgboost works, why it is present in so many winning Kaggle solutions, what is the meaning of its parameters, how to tune them and how to use it in practice.
Jaroslaw is a Machine Learning Scientist in OLX Tech Hub Berlin. He has background in analytics and predictive models creation for finance institutions, FMCG and Telecom companies. Currently he is specializing in applying machine learning to detection of unwanted content on OLX classifieds sites across the globe.
---------------------
www.pydata.org
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.
Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps
Видео Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost канала PyData
Jaroslaw is a Machine Learning Scientist in OLX Tech Hub Berlin. He has background in analytics and predictive models creation for finance institutions, FMCG and Telecom companies. Currently he is specializing in applying machine learning to detection of unwanted content on OLX classifieds sites across the globe.
---------------------
www.pydata.org
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.
Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps
Видео Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost канала PyData
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
AdaBoost, Clearly Explained17. Learning: BoostingAnna Veronika Dorogush: Mastering gradient boosting with CatBoost | PyData London 2019Can one do better than XGBoost? - Mateusz SusikTrevor Hastie - Gradient Boosting Machine LearningGradient Boosting : Data Science's Silver BulletVincent Warmerdam: Winning with Simple, even Linear, Models | PyData London 2018CatBoost - the new generation of gradient boosting - Anna Veronika DorogushData Science Portfolio Tips | Discussion with Ken Jee, Krish Naik, Codebasics and Data ProfessorIntroduction To Gradient Boosting algorithm (simplistic n graphical) - Machine LearningBoosting - EXPLAINED!10 Tree Models and Ensembles: Decision Trees, Boosting, Bagging, Gradient Boosting (MLVU2018)Train, Evaluate, Repeat: Building a Credit Card Fraud Detection System - Leela Senthil NathanIntro to XGBoost Models (decision-tree-based ensemble ML algorithms)7.5 Gradient Boosting (L07: Ensemble Methods)Visual Guide to Gradient Boosted Trees (xgboost)XGBoost Made Easy | Extreme Gradient Boosting | AWS SageMakerGradient Boost Machine Learning|How Gradient boost work in Machine LearningTime Series Forecasting with Xgboost