Building an ML Experimentation Platform for Easy Reproducibility | Treeverse
ABOUT THE TALK:
Quality ML at scale is only possible when we can reproduce a specific iteration of the ML experiment–and this is where data is key.
In this talk, you will learn how to use a data versioning engine to intuitively and easily version your ML experiments and reproduce any specific iteration of the experiment.
This talk will demo through a live code example:
-Creating a basic ML experimentation framework with lakeFS (on Jupyter notebook)
-Reproducing ML components from a specific iteration of an experiment
Building intuitive, zero-maintenance experiments infrastructure
-All with common data engineering stacks & open source tooling.
ABOUT THE SPEAKER:
Vino Duraisamy is a developer advocate at lakeFS, an open-source platform that delivers git-like experience to object store based data lakes.
She has previously worked at NetApp (on data management applications for NetApp data centers), on data teams of Nike and Apple, where she worked mainly on batch processing workloads as a data engineer, built custom NLP models as an ML engineer and even touched upon MLOps a bit for model deployments.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.
Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai/
Видео Building an ML Experimentation Platform for Easy Reproducibility | Treeverse канала Data Council
Quality ML at scale is only possible when we can reproduce a specific iteration of the ML experiment–and this is where data is key.
In this talk, you will learn how to use a data versioning engine to intuitively and easily version your ML experiments and reproduce any specific iteration of the experiment.
This talk will demo through a live code example:
-Creating a basic ML experimentation framework with lakeFS (on Jupyter notebook)
-Reproducing ML components from a specific iteration of an experiment
Building intuitive, zero-maintenance experiments infrastructure
-All with common data engineering stacks & open source tooling.
ABOUT THE SPEAKER:
Vino Duraisamy is a developer advocate at lakeFS, an open-source platform that delivers git-like experience to object store based data lakes.
She has previously worked at NetApp (on data management applications for NetApp data centers), on data teams of Nike and Apple, where she worked mainly on batch processing workloads as a data engineer, built custom NLP models as an ML engineer and even touched upon MLOps a bit for model deployments.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.
Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai/
Видео Building an ML Experimentation Platform for Easy Reproducibility | Treeverse канала Data Council
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![DC_THURS : dbt w/ Drew Banin](https://i.ytimg.com/vi/zc8wLzoAkVc/default.jpg)
![DevOps for Machine Learning & Other Half Truths Processes & Tools for the ML Lifecycle | DataRobot](https://i.ytimg.com/vi/z7m9B6vSVe8/default.jpg)
![Data Discovery Getting More From Your Metadata](https://i.ytimg.com/vi/WDi3rEe_Eow/default.jpg)
![Technical Founders Panel](https://i.ytimg.com/vi/mRgyDCtL6-k/default.jpg)
![Feed The Alligators With the Lights On: How Data Engineers Can See Who Really Uses Data | Stemma](https://i.ytimg.com/vi/4WO3klWEhiI/default.jpg)
![Architecting a Low-Latency Schemaless SQL Engine | Rockset](https://i.ytimg.com/vi/D3OUbQMxmcI/default.jpg)
![Building High Performance Recommender Systems with Feature Stores | Tecton](https://i.ytimg.com/vi/F7-7349p0Ok/default.jpg)
![Office Hours with Stitch Fix Data Platform](https://i.ytimg.com/vi/IabnpQAGkRo/default.jpg)
![DC_THURS on Trino](https://i.ytimg.com/vi/qGvZhwJWAaw/default.jpg)
![Enterprise Data Science Comes of Age | Anaconda](https://i.ytimg.com/vi/VZ3LLPKYjVE/default.jpg)
![Using Machine Learning and Observability Together to Reduce Incident Impact | DigitalOcean](https://i.ytimg.com/vi/QxocoT6Aeuo/default.jpg)
![Making Friends with Generative Models | Tonic](https://i.ytimg.com/vi/7WdMOfoBDpk/default.jpg)
![The Right Way to Track Mobile Data](https://i.ytimg.com/vi/qGgWe9GBUNk/default.jpg)
![DC_THURS on Feature Engineering](https://i.ytimg.com/vi/ewVwxuDizUQ/default.jpg)
![Scaling Uber's Metric System from Elasticsearch to Pinot | Uber](https://i.ytimg.com/vi/u82r_eqUaiI/default.jpg)
![Rikai: A New Data Format for Analytics on Unstructured Data at Scale](https://i.ytimg.com/vi/FVYOLcKNmsM/default.jpg)
![DC_THURS on DataHub w/ Shirshanka Das (Acryl Data)](https://i.ytimg.com/vi/lBbrilDAFMs/default.jpg)
![The Road to Exceptional Data Correctness](https://i.ytimg.com/vi/Ii2S_prglbc/default.jpg)
![How Vercel Builds Dozens of Metrics from One Heterogenous Table](https://i.ytimg.com/vi/n3KUORtd5J4/default.jpg)
![DC_THURS w/ Patrick Thompson, CEO of Iteratively](https://i.ytimg.com/vi/-6zTaAtaLzM/default.jpg)