Загрузка страницы

Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost

From variety of classification and regression methods, gradient boosting, and in particular its variation in xgboost implementation, is one of the most convenient to use. Out of the box you can use it as easily as random forest. Due to its nature, when used with decision trees, you don't need to worry about co-linearities or missing values. No more worrying about normalization, standardization nor any other monotonic transformations on your data. Overfitting prevention with watchlists. Written efficiently in C++ with Python and R bindings and scikit-learn like interface. In this talk we will go deep into how and why xgboost works, why it is present in so many winning Kaggle solutions, what is the meaning of its parameters, how to tune them and how to use it in practice.

Jaroslaw is a Machine Learning Scientist in OLX Tech Hub Berlin. He has background in analytics and predictive models creation for finance institutions, FMCG and Telecom companies. Currently he is specializing in applying machine learning to detection of unwanted content on OLX classifieds sites across the globe.

---------------------

www.pydata.org

PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.

PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.

Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps

Видео Jaroslaw Szymczak - Gradient Boosting in Practice: a deep dive into xgboost канала PyData
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
13 января 2018 г. 0:07:43
00:43:49
Яндекс.Метрика