SFSCON23 - Markus Pobitzer - Image Generation with Diffusion Models

SFScon 69 views 11 slides Dec 04, 2023
Slide 1
Slide 1 of 11
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11

About This Presentation

Recent machine learning developments saw a breakthrough in generating images. So-called Diffusion Models can create photo-realistic images from noise. With the help of an input text (prompt) we can guide the generation and produce matching images.

This technology opened new doors for creating digit...


Slide Content

“Standing on top of the highest mountain, looking down to the other peaks, alps.
An award-winning landscape photo of South Tyrol”

Image Generation with
Diffusion Models
How computers imagine our world
Markus Pobitzer

Overview
Diffusion Models

Stable Diffusion (SD)
(Text) Guidance Stable Diffusion Output
“Majestic royal ship on
a calm sea, oil
painting, …”

Inpainting with Stable Diffusion
Input Mask Output
Original imagefrom: https://unsplash.com/@overture_creations

Billions
ofImage-Text pairs
256
Nvidia A100 (40GB) GPUs
150000
GPU Hours
Training Stable Diffusion(v1)
Source: SD ModelCard, https://github.com/CompVis/stable-diffusion/blob/main/Stable_Diffusion_v1_Model_Card.md

•DALL-E
•Imagen
•Midjourney
•StableDiffusion
Image Generation Applications
unreleased
or
behindAPI
trainingcode
and
modelweights
available

Beyond Text Guidance
Input (Edges) Outputs

“Sign with SFSCON written on it” Real or Fake?
ChallengesShortcomings

Tryitout!
Generate theimageon thelefton your
devicethankstoGoogle Colab.
Scan theQR-Code orvisit:
github.com/Markus-Pobitzer/SFSCON-2023