[DL輪読会]GLIDE: Guided Language to Image Diffusion for Generation and Editing
DeepLearningJP2016
3,598 views
21 slides
Jan 07, 2022
Slide 1 of 21
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
About This Presentation
2022/01/07
Deep Learning JP:
http://deeplearning.jp/seminar-2/
Size: 11.24 MB
Language: none
Added: Jan 07, 2022
Slides: 21 pages
Slide Content
D EEP L EARNING JP [DL Papers] GLIDE : G uided L anguage to I mage D iffusion for Generation and E diting Xin Zhang, Matsuo Lab http://deeplearning.jp/
書誌情報 タイトル: GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models( arxiv ) 著者:Alex Nichol, Prafulla Dhariwal Aditya Ramesh et al. (OPENAI) 20 Dec 2021 概要 テキストからリアルな画像を生成する Diffusion Model 2種類の条件付けの方法で、複数の工夫を取り入れた実装 綺麗な画像の生成に成功し、小さめなモデルを公開した ‹#›
Introduction
DALL-E (dVAE) StyleCLIP (StyleGAN) CLIP + Generative Model
Safety Considerations & Limitations Released small model trained on a smaller, filtered dataset. Fail to capture certain prompts which describe highly unusual objects or scenarios.
Impressions Video Generation系の研究に期待 絵が下手でも大丈夫 an oil painting of happy new year an cartoon of Mount Fuji