[DSBA] CS224n 2021 Study | #10 Transformers and Pretraining
발표자: 석박사통합과정 김재희
lecture 10 : Transformers and Pretraining
---
DSBA CS224n winter 2021 Study
스터디 정보: https://github.com/yookyungkho/DSBA_CS224N_2021
CS224n 강의 영상: https://youtu.be/j9AcEI98C0o
Reference
- CS224N Lecture 10 강의 : https://youtu.be/j9AcEI98C0o
- CS224N Lecture 10 슬라이드 : https://web.stanford.edu/class/archive/cs/cs224n/cs224n.1214/
- GPT-2 figure : https://jalammar.github.io/illustrated-gpt2/
- Wordpiece : https://ratsgo.github.io/nlpbook/docs/preprocess/bpe/
- Other review :
DSBA T5 : http://dsba.korea.ac.kr/seminar/?mod=document&uid=247
투빅스 BERT : https://velog.io/@tobigs-nlp/BERT-Pre-training-of-Deep-Bidirectional-Transformers-for-Language-Understanding
투빅스 GPT-1 : https://velog.io/@tobigs-nlp/Improving-Language-Understandingby-Generative-Pre-Training-GPT-1
Видео [DSBA] CS224n 2021 Study | #10 Transformers and Pretraining канала 고려대학교 산업경영공학부 DSBA 연구실
lecture 10 : Transformers and Pretraining
---
DSBA CS224n winter 2021 Study
스터디 정보: https://github.com/yookyungkho/DSBA_CS224N_2021
CS224n 강의 영상: https://youtu.be/j9AcEI98C0o
Reference
- CS224N Lecture 10 강의 : https://youtu.be/j9AcEI98C0o
- CS224N Lecture 10 슬라이드 : https://web.stanford.edu/class/archive/cs/cs224n/cs224n.1214/
- GPT-2 figure : https://jalammar.github.io/illustrated-gpt2/
- Wordpiece : https://ratsgo.github.io/nlpbook/docs/preprocess/bpe/
- Other review :
DSBA T5 : http://dsba.korea.ac.kr/seminar/?mod=document&uid=247
투빅스 BERT : https://velog.io/@tobigs-nlp/BERT-Pre-training-of-Deep-Bidirectional-Transformers-for-Language-Understanding
투빅스 GPT-1 : https://velog.io/@tobigs-nlp/Improving-Language-Understandingby-Generative-Pre-Training-GPT-1
Видео [DSBA] CS224n 2021 Study | #10 Transformers and Pretraining канала 고려대학교 산업경영공학부 DSBA 연구실
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![[Paper Review] Towards better understanding of self supervised representations](https://i.ytimg.com/vi/_kDXbslDf3w/default.jpg)
![[Paper Review] RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain..](https://i.ytimg.com/vi/h0miMLkDIbk/default.jpg)
![[Paper Review] Various Methods to develop Verbalizer in Prompt-based Learning (KPT, WARP)](https://i.ytimg.com/vi/p6DxEBbodYk/default.jpg)
![[Paper Review] C2-CRS: Coarse-to-Fine Contrastive Learning for CRS](https://i.ytimg.com/vi/aOWGFnnT_X8/default.jpg)
![[Paper Review] DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning](https://i.ytimg.com/vi/3pg-bq_QwFQ/default.jpg)
![[Paper Review] How Much Knowledge Can You Pack Into the Parameters of a Language Model?](https://i.ytimg.com/vi/kZObjiaeY4s/default.jpg)
![[Paper Review] Speech to Speech Translation](https://i.ytimg.com/vi/WNadpnlnxFA/default.jpg)
![[Paper Review]ON CONCEPT-BASED EXPLANATIONS IN DEEP NEURAL NETWORKS](https://i.ytimg.com/vi/-l8vqkCu91M/default.jpg)
![[Paper Review] Open Source LMs](https://i.ytimg.com/vi/TLisXrictso/default.jpg)
![[Paper Review] Masked Image Modeling](https://i.ytimg.com/vi/ifS7Mzbylk4/default.jpg)
![[Paper Review] WinCLIP: Zero-/few-shot anomaly classification and segmentation.](https://i.ytimg.com/vi/iFO-QhTMpes/default.jpg)
![[Paper Review] Towards Total Recall in Industrial Anomaly Detection](https://i.ytimg.com/vi/mEY4qjZcNsw/default.jpg)
![[Paper Review] LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation](https://i.ytimg.com/vi/rgWEM2-xqkw/default.jpg)
![[Paper Review] DeepTIMe: Deep Time-Index Meta-Learning for Non-Stationary Time-Series](https://i.ytimg.com/vi/-klr7a7jyus/default.jpg)
![[Paper Review] Non-Autoregressive Neural Machine Translation (Gu et al., ICLR 2018)](https://i.ytimg.com/vi/vLmfTvqBWio/default.jpg)
![[Paper Review] Asymmetric Student-Teacher Networks for Industrial Anomaly Detection](https://i.ytimg.com/vi/OnDzxsKbCQw/default.jpg)
![[Paper Review] BEIT: BERT Pre-Training of Image Transformers](https://i.ytimg.com/vi/uCWhUayAwOY/default.jpg)
![[Paper Review] Momentum Contrast for Unsupervised Visual Representation Learning](https://i.ytimg.com/vi/j9avVME3g54/default.jpg)
![[Paper Review] Community Detection in graphs](https://i.ytimg.com/vi/MmKlFbexFGo/default.jpg)
![[Paper Review] AER: Auto-Encoder with Regression for Time Series Anomaly Detection](https://i.ytimg.com/vi/ROqdDm8ilBQ/default.jpg)