Загрузка страницы

2022 Adversarial Machine Learning Rising Star Award Presentation by Fatemehsadat Mireshghallah

Presentation of 2022 Adversarial Machine Learning Rising Star Award by Fatemehsadat Mireshghallah (https://cseweb.ucsd.edu/~fmireshg/)

More details about the workshop and AdvML Rising Star Award: https://sites.google.com/view/advml
___
Talk Title: How much can we trust large language models?

Abstract: Large language Models (LLMs, e.g., GPT-3, TNLG, T-5) are shown to have a remarkably high performance on standard benchmarks, due to their high parameter count, extremely large training datasets, and significant compute. Although the high parameter count in these models leads to more expressiveness, it can also lead to higher memorization, which, coupled with large unvetted, web-scraped datasets can cause multiple different negative societal and ethical impacts: leakage of private, sensitive information— i.e. LLMs are ‘leaky’, generation of biased text—i.e. LLMs are ‘sneaky, and generation hateful or stereotypical text— i.e. LLMs are ‘creepy’. In this talk, I will go over how the issues mentioned above affect the trustworthiness of LLMs, and zoom in on how we can measure the leakage and memorization of these models. Finally I will discuss what it would actually mean for large LLMs to be privacy preserving, and what are the future research directions on making large models trustworthy.

Видео 2022 Adversarial Machine Learning Rising Star Award Presentation by Fatemehsadat Mireshghallah канала TrustworthyAI
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
16 августа 2022 г. 5:02:34
00:30:35
Яндекс.Метрика