Загрузка страницы

Scaling Factorization Machines on Spark Using Parameter Servers (Nick Pentreath)

Factorization machines are a relatively new class of model, that are extremely powerful as they are able to efficiently capture arbitrary order interactions between features. FMs are becoming increasingly popular in settings with large amounts of sparse data, including recommender systems and online advertising. Furthermore, with appropriate feature engineering, they can mimic most commonly used factorization-based models for collaborative filtering. However, one drawback of FMs is that, even though they are relatively efficient to train, they can still be difficult to scale to very large feature dimensions. This talk will explore scaling up FMs on Spark, using the Glint parameter server built on Akka. Rather than a general exploration of parameter server architectures, the focus will be on specific technical aspects of training factorization machines, with code examples and performance analysis and comparisons. It will also cover integration with Spark DataFrames and ML pipelines for feature engineering and cross-validation. Example code will be available as open source.

Видео Scaling Factorization Machines on Spark Using Parameter Servers (Nick Pentreath) канала Spark Summit
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
4 ноября 2016 г. 3:29:13
00:28:31
Другие видео канала
Glint: An Asynchronous Parameter Server for Spark (Rolf Jagerman)Glint: An Asynchronous Parameter Server for Spark (Rolf Jagerman)IoT and the Autonomous Vehicle in the Clouds: Spark Summit East  talk by Jay White BearIoT and the Autonomous Vehicle in the Clouds: Spark Summit East talk by Jay White BearAnalysis Andromeda Galaxy Data Using Spark: Spark Summit East talk by Jose NandezAnalysis Andromeda Galaxy Data Using Spark: Spark Summit East talk by Jose NandezThe Fast Path to Building Operational Applications with Spark: talk by Nikita ShamgunovThe Fast Path to Building Operational Applications with Spark: talk by Nikita ShamgunovNew Directions for Spark in 2015- Matei Zaharia (Databricks)New Directions for Spark in 2015- Matei Zaharia (Databricks)Software Above the Level of a Single Device  The Implications  - Tim O'Reilly (O'Reilly Media)Software Above the Level of a Single Device The Implications - Tim O'Reilly (O'Reilly Media)Keynote - Arun Murthy (Hortonworks)Keynote - Arun Murthy (Hortonworks)Scalable Deep Learning Platform On Spark In BaiduScalable Deep Learning Platform On Spark In BaiduExtending Word2Vec for Performance and Semi Supervised Learning - Michael Malak (Oracle)Extending Word2Vec for Performance and Semi Supervised Learning - Michael Malak (Oracle)5 Reasons Enterprise Adoption Of Spark Is Unstoppable5 Reasons Enterprise Adoption Of Spark Is UnstoppableSpark Summit 2013 - Big Data Research in the AMPLab - Mike FranklinSpark Summit 2013 - Big Data Research in the AMPLab - Mike FranklinDelivering Insights from 5PB of Product Logs at Pure Storage: Spark Summit East talk by Brian GoldDelivering Insights from 5PB of Product Logs at Pure Storage: Spark Summit East talk by Brian GoldPedal to the Metal: Accelerating Apache Spark with Innovations in Silicon TechnologyPedal to the Metal: Accelerating Apache Spark with Innovations in Silicon TechnologyProduction Spark and Tachyon use CasesProduction Spark and Tachyon use CasesSpark as a Platform to Support Multi-Tenancy and Many Kinds of Data Applications - Kelvin Chu (Uber)Spark as a Platform to Support Multi-Tenancy and Many Kinds of Data Applications - Kelvin Chu (Uber)Perspectives on Big Data & Analytics - Doug Wolfe (Central Intelligence Agency)Perspectives on Big Data & Analytics - Doug Wolfe (Central Intelligence Agency)Spark'ing an Anti Money Laundering Revolution- Katie Levans; Koert Kuipers (Tresata)Spark'ing an Anti Money Laundering Revolution- Katie Levans; Koert Kuipers (Tresata)Towards Modularizing Spark Machine Learning Jobs- Lance Co Ting Keh (Box)Towards Modularizing Spark Machine Learning Jobs- Lance Co Ting Keh (Box)Distributed Heterogeneous Mixture Learning On SparkDistributed Heterogeneous Mixture Learning On SparkA More Scalable Way of Making Recommendations with MLlib - Xiangrui Meng (Databricks)A More Scalable Way of Making Recommendations with MLlib - Xiangrui Meng (Databricks)Fireside Chat  -Justin Langseth (Zoomdata)Fireside Chat -Justin Langseth (Zoomdata)
Яндекс.Метрика