Building Machine Learning Models with Strict Privacy Boundaries
Speaker: Renaud Bourassa, Staff Software Engineer at Slack
Slides: https://www.slideshare.net/SessionsEvents/renaud-bourassa-building-machine-learning-models-with-strict-privacy-boundaries
"Every day, millions of people rely on Slack to get the information they need to do their jobs. To make their working lives more productive, we built a number of machine learning models to help users make sense of the data flowing through Slack. Although these models vary in structure and objective, they all share one common characteristic: they must deal with strict privacy boundaries inherent to the underlying dataset.
By policy, users can only be exposed to data that was publicly shared in their own Slack team. These restrictions must carry over into the machine learning models we build: not only must the models refrain from outputting data from foreign teams, patterns in foreign teams’ data must not be inferable from the usage of these models.
In this talk, I will discuss how Slack’s dataset differs from many traditional machine learning datasets. I will also present some techniques we developed to leverage our entire dataset to improve the performance of our models without jeopardizing the privacy boundaries we guarantee to our customers."
Видео Building Machine Learning Models with Strict Privacy Boundaries канала MLconf
Slides: https://www.slideshare.net/SessionsEvents/renaud-bourassa-building-machine-learning-models-with-strict-privacy-boundaries
"Every day, millions of people rely on Slack to get the information they need to do their jobs. To make their working lives more productive, we built a number of machine learning models to help users make sense of the data flowing through Slack. Although these models vary in structure and objective, they all share one common characteristic: they must deal with strict privacy boundaries inherent to the underlying dataset.
By policy, users can only be exposed to data that was publicly shared in their own Slack team. These restrictions must carry over into the machine learning models we build: not only must the models refrain from outputting data from foreign teams, patterns in foreign teams’ data must not be inferable from the usage of these models.
In this talk, I will discuss how Slack’s dataset differs from many traditional machine learning datasets. I will also present some techniques we developed to leverage our entire dataset to improve the performance of our models without jeopardizing the privacy boundaries we guarantee to our customers."
Видео Building Machine Learning Models with Strict Privacy Boundaries канала MLconf
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital](https://i.ytimg.com/vi/8rQ9g03yJ8E/default.jpg)
![Anima Anadkumar, Principal Scientist, Amazon Web Services, Endowed Professor, CalTech](https://i.ytimg.com/vi/RRy-3VXA0nw/default.jpg)
![Manipulating and Measuring Model Interpretability](https://i.ytimg.com/vi/hHAW1ug2qlE/default.jpg)
![Jennifer Marsman, Principal Developer Evangelist, Microsoft @ MLconf NYC](https://i.ytimg.com/vi/T8FaWkqzK0A/default.jpg)
![MLconf Online 2020: DevOps for Data Science With Kubernetes by Sophie Watson](https://i.ytimg.com/vi/9TqHilvnUuM/default.jpg)
![Sven Kreiss, Lead Data Scientist, Wildcard @ MLconf ATL](https://i.ytimg.com/vi/09kpP-w4DLI/default.jpg)
![Virginia Smith - A General Framework for Communication-Efficient Distributed... - MLconf SF 2016](https://i.ytimg.com/vi/vuGiNJoq8NQ/default.jpg)
![Jeremy Stanley, EVP/Data Scientist, Sailthru @ MLconf NYC](https://i.ytimg.com/vi/vEemVVLGo6E/default.jpg)
![Sanjeev Satheesh, The Story of End to End Models in Deep Learning at The AI Conference 2017](https://i.ytimg.com/vi/h3Y3Gohn1HI/default.jpg)
![MLconf Online 2020: Data Science is Key to Achieving Energy Access in Africa Madeleine Gleave](https://i.ytimg.com/vi/i9FXqOeFpwY/default.jpg)
![Subutai Ahmad, VP of Research, Numenta @ MLconf SF](https://i.ytimg.com/vi/SxtsCrTHz-4/default.jpg)
![Justin Basilico, Senior Researcher Engineer in Recommendation Systems, Netlix @ MLconf ATL](https://i.ytimg.com/vi/doWgbo-c9sM/default.jpg)
![Sergei Vassilvitskii, Research Scientist, Google @ MLconf NYC](https://i.ytimg.com/vi/rtXeauFFCE4/default.jpg)
![MLconf Online 2020: Mathematical Approaches to Clustering by Joseph Ross](https://i.ytimg.com/vi/ziZ2JfXDAd4/default.jpg)
![Byron Galbraith, Chief Data Scientist, Talla, NYC 2017](https://i.ytimg.com/vi/IHCtfiI8llA/default.jpg)
![MLconf NYC 2022: How to Detect and Interpret Data Drift in Production by Emeli Dral of Evidently AI](https://i.ytimg.com/vi/FnVi_-eq4yE/default.jpg)
![Optimized Image Classification on the Cheap](https://i.ytimg.com/vi/P5rU5LJfV5A/default.jpg)
![Carlos Guestrin, CEO of Dato Inc. @ MLconf SEA](https://i.ytimg.com/vi/gjSC5ZjLnII/default.jpg)
![Johann Schleier Smith, Co Founder and CTO, ifwe @ MLconf SF](https://i.ytimg.com/vi/t6eAdPof9yQ/default.jpg)
![Using a Bayesian Neural Network in the Detection of Exoplanets](https://i.ytimg.com/vi/u42czORKkt8/default.jpg)