Automatically Fix Data Issues & Label Errors in Most ML Datasets | Cleanlab
ABOUT THE TALK:
In this talk, we discuss cleanlab open-source (github.com/cleanlab/cleanlab) and Cleanlab Studio (https://cleanlab.ai/studio). Cleanlab open-source is a fast-growing python framework for data-centric AI that automatically detects issues in ML datasets. Cleanlab Studio is a no-code web interface used by universities and fortune 500 companies for dataset issue detection and fixing. Cleanlab algorithms have theoretical support for improved accuracy on real-world, messy data.
ABOUT THE SPEAKER:
Curtis Northcutt is an American computer scientist and entrepreneur focusing on machine learning and AI to empower people. He is the CEO and co-founder of Cleanlab, an AI software company that improves machine learning model performance by automatically fixing data and label issues in real-world, messy datasets. Curtis completed his PhD at MIT where he invented Cleanlab’s algorithms for automatically finding and fixing label issues in any dataset.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.
Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai/
Видео Automatically Fix Data Issues & Label Errors in Most ML Datasets | Cleanlab канала Data Council
In this talk, we discuss cleanlab open-source (github.com/cleanlab/cleanlab) and Cleanlab Studio (https://cleanlab.ai/studio). Cleanlab open-source is a fast-growing python framework for data-centric AI that automatically detects issues in ML datasets. Cleanlab Studio is a no-code web interface used by universities and fortune 500 companies for dataset issue detection and fixing. Cleanlab algorithms have theoretical support for improved accuracy on real-world, messy data.
ABOUT THE SPEAKER:
Curtis Northcutt is an American computer scientist and entrepreneur focusing on machine learning and AI to empower people. He is the CEO and co-founder of Cleanlab, an AI software company that improves machine learning model performance by automatically fixing data and label issues in real-world, messy datasets. Curtis completed his PhD at MIT where he invented Cleanlab’s algorithms for automatically finding and fixing label issues in any dataset.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers.
Make sure to subscribe to our channel for the most up-to-date talks from technical professionals on data related topics including data infrastructure, data engineering, ML systems, analytics and AI from top startups and tech companies.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai/
Видео Automatically Fix Data Issues & Label Errors in Most ML Datasets | Cleanlab канала Data Council
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Hot or Not: Latest Trends & Buzzwords in DataThe Road to Exceptional Data CorrectnessHow to End the Long tail of Most Data Requests | NarratorWhat I Don't Want to Exist in the Data World in 5 Years | Seattle Data GuyData Contracts in the Modern Data Stack | WhatnotIncident Management for Data People | BigeyeContinuous Data Pipeline for Real time Benchmarking & Data Set Augmentation | TeleskopeData Product Success: Aligning with Data's Core Purpose | EnteraData Products Aren't Just for Data Teams! LightdashInnovating on Software Development | Fast AIHierarchical Forecasting in Python | NixtlaThe Story of DevRel at Snowflake - How We Got Here | SnowflakeAI The Future is NowHow to Be a 10x Analyst | HyperqueryWhat it Takes to Support the World's Most Popular Open Source Communities | NumFOCUSHow Vercel Builds Dozens of Metrics from One Heterogenous TableGenerative AI for Search | TonitaFrom 1 to IPO: Growing the Data Team and Data Culture at GitLabWhen to Move from Batch to Streaming and how to do it without hiring an entirely new team | BytewaxDesigning & Building Metric Trees