Will Landau - Reproducible Computation at Scale in R with Targets [Remote]
Talk delivered December 1, 2020. Visit https://nyhackr.org/ to learn more and follow https://twitter.com/nyhackr. This meetup is the kickoff to the second 2020 R week with The R Conference for Government & Public Sector on December 2-4. Much like our recent NYR, this is virtual, so anyone around the world can attend. Visit https://rstats.ai/gov/ to learn more and use code nyhackr for a 20% discount on tickets, including workshops.
Following up our earlier meetup about drake, we have its creator talking about its successor, targets.
About the Talk:
Ambitious workflows in R, such as machine learning analyses, can be difficult to manage. A single round of computation can take several hours to complete, and routine updates to the code and data tend to invalidate hard-earned results. You can enhance the maintainability, hygiene, speed, scale, and reproducibility of such projects with the targets R package. targets resolves the dependency structure of your analysis pipeline, skips tasks that are already up to date, executes the rest with optional distributed computing, and manages data storage for you. It surpasses the permanent limitations of its predecessor, drake, and provides increased efficiency and a smoother user experience. This talk demonstrates how to create and maintain a Bayesian model validation project using targets-powered automation.
About Will:
Will Landau works at Eli Lilly and Company, where he develops capabilities for clinical statisticians, and he is the creator and maintainer of the targets and drake R packages. Will earned his PhD in Statistics at Iowa State University in 2016, where his dissertation research applied Bayesian methods, hierarchical models, and GPU computing to the analysis of RNA-seq data.
Where to find Will:
https://twitter.com/wmlandau
https://wlandau.github.io/
https://www.linkedin.com/in/wlandau/
Thank you EcoHealth Alliance (https://www.ecohealthalliance.org/) for providing the Zoom link.
Видео Will Landau - Reproducible Computation at Scale in R with Targets [Remote] канала Lander Analytics
Following up our earlier meetup about drake, we have its creator talking about its successor, targets.
About the Talk:
Ambitious workflows in R, such as machine learning analyses, can be difficult to manage. A single round of computation can take several hours to complete, and routine updates to the code and data tend to invalidate hard-earned results. You can enhance the maintainability, hygiene, speed, scale, and reproducibility of such projects with the targets R package. targets resolves the dependency structure of your analysis pipeline, skips tasks that are already up to date, executes the rest with optional distributed computing, and manages data storage for you. It surpasses the permanent limitations of its predecessor, drake, and provides increased efficiency and a smoother user experience. This talk demonstrates how to create and maintain a Bayesian model validation project using targets-powered automation.
About Will:
Will Landau works at Eli Lilly and Company, where he develops capabilities for clinical statisticians, and he is the creator and maintainer of the targets and drake R packages. Will earned his PhD in Statistics at Iowa State University in 2016, where his dissertation research applied Bayesian methods, hierarchical models, and GPU computing to the analysis of RNA-seq data.
Where to find Will:
https://twitter.com/wmlandau
https://wlandau.github.io/
https://www.linkedin.com/in/wlandau/
Thank you EcoHealth Alliance (https://www.ecohealthalliance.org/) for providing the Zoom link.
Видео Will Landau - Reproducible Computation at Scale in R with Targets [Remote] канала Lander Analytics
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Deep Learning vs Machine Learning in RMarlorie Hughes - Dashboarding Like a BossPanel Data (Fixed Effects, Random Effects) - R for Economists Moderate 9Web Apps in R: Building your First Web Application in R | Shiny Tutorial Ep 1targets: IntroductionAndrew Gelman - Solve All Your Statistics Problems Using P-ValuesVerifying Authenticity With shasum -a 256Alex Gold - Reliably Reproducible R-TifactsDavid Robinson - The Procrastinator’s Guide to a Data Science CareerTidy Spreadsheets in Medical Research - Peter Higgins - R/Medicine Workshop - 2021-02-23Data Share - source function in R [r tutorial]Kevin Ushey | renv: Project Environments for R | RStudio (2020)David Robinson - Ten Tremendous Tricks in the TidyverseR language tip: Test your code with testthatAnomalies, Breaks, and Outlier Detection in Time Series (in R for Data Science User Group, Feb 2021)Cracking into the Meat of R with lobstr: Console Visualisations that Explain How Stuff WorksEfficient data analysis and reporting: DRY workflows in RMalcolm Barrett - Zen and the Art of R Package DevelopmentA bioinformatics tutorial on Bacterial Genome Analysis with Real World Data and commandsNoam Ross - Nonlinear Models in R: The Wonderful World of mgcv