Building Reproducible ML Models by Archa Jain
This talk draws from software engineering principles to introduce an end-to-end data science workflow that makes it easy to reproduce given results. In a 2016 study conducted by Nature [1], it was found that more than 70% of researchers surveyed were unable to reproduce research published by others, and more than 50% were unable to reproduce their own results. In general, the lack of reproducibility in research holds back other groups from building on existing results, holding back overall progress. Even in industry, it is often extremely hard for data scientists to share analysis methods, and to build on the work done by others in the company. In general, there is a belief that reproducibility is achieved by sharing raw code and data, and while this is a step in the right direction, there are often confounding environmental variables, especially in more complex models and analyses, that still make the results inconsistent. In this talk, I will provide a workflow rooted in Git and Jupyter that makes it easy to share research results, both in academia as well as within a company. I will also share some general principles and tips derived from software engineering that will allow anyone to tailor their work techniques to make their analysis more consistent and sharable. [1] https://www.nature.com/news/1-500-scientists-lift-the-lid-on-reproducibility-1.19970
Видео Building Reproducible ML Models by Archa Jain канала With The Best
Видео Building Reproducible ML Models by Archa Jain канала With The Best
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Tony Shan - IoT PatternsTrading stock, bond, and commodity futures on the blockchain by Jim BaiAaron Rinehart - ChaoSlingr: Introducing Security based Chaos Testing for CybersecurityWTB 2017Rakesh Dodeja - Containers to Deploy IoT Micro Services at the Fog and Edge NodesLeveraging Deep Learning and Video Analysis in Law Enforcement by Sanchit AroraRon Evans - Putting Eyes on the IoT: Advanced Computer Vision Using GolangJohn Walicki - IoT End to End - Global IoT Devfest With The Best 2017Gabriela de Queiroz - Statistics for Data Science: What You Should Know and WhyJohn Walicki - IoT End to End - Global IoT DevFest II November 7-8, 2017Using Machine Learning to Support Human Creativity by Rebecca FiebrinkFlorian Tramer - Ensemble Adversarial Training - Cybersecurity With The Best 2017Jennifer Williams - Enabling a Densely-Scalable Low-Power WSNs - Global IoT DevFest IIAngela Bassa - Machine Learning in Production: integrating with the software stack #AIWTBRon Evans - IoT Using Go and GoBot - Global IoT Devfest With The Best 2017Richard Socher - The Natural Language Decathlon: Multitask Learning as Question AnsweringAnthony Charbonnier, Startups and Programs : Sigfox at #IOTWTBNatalia Mykhaylova - Deriving Insights from Low-Cost Sensor NetworksFine-Grained Object Recognition in the Wild: A Multi-Task Domain Adaptation Approach by Timnit GebruKan Hayashi - Intel System Studio Use Cases for Digital Security and Surveillance SolutionsNoah Zentis - Rust - the Fast and Secure Language for Critical IoT Applications