Gen AI in Action: Real-World Applications of Image Generation Models
nithishrw
10 views
30 slides
Aug 30, 2025
Slide 1 of 30
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
About This Presentation
This talk was delivered in PyCon Poland 2025 (https://pl.pycon.org/2025/en/agenda/) on August 30, 2025.
Abstract: Explore AI-generated art with this talk on image generation models like diffusion models and GANs. Watch live demos on virtual cameras, fine-tuning for personalized images, inpainting, ...
This talk was delivered in PyCon Poland 2025 (https://pl.pycon.org/2025/en/agenda/) on August 30, 2025.
Abstract: Explore AI-generated art with this talk on image generation models like diffusion models and GANs. Watch live demos on virtual cameras, fine-tuning for personalized images, inpainting, and ControlNet. Learn practical applications, Python integration, and ethical considerations in the innovative world of AI-powered creativity.
13
Stable Diffusion Models
●Text to Image Generation
●Training
○Noising: Add random noise to
images
○Denoising: Learn to remove
noise
○Guided by Text Embeddings
○Latent Space Representation
14
Stable Diffusion Models
Inference
●Start with Random Noise
●Iteratively, reduce noise with
guidance from text
embeddings
●Image matches text input over
time
15
Let Us Build
16
Virtual Camera
●Location
●Weather
●Surroundings from
OpenStreetMap
●Inspired by Paragraphica
17
Fine Tuning Image Models
●Generate Personalized Images
●Few Example Images (5-20)
●Low-Rank Adaptation(LoRA)
18
Inpainting
●Replace parts of Image
●https://huggingface.co/spaces/
ameerazam08/FLUX.1-dev-Inp
ainting-Model-Beta-GPU
19
ControlNet
●Replace parts of Image
●Parts based on existing
content
●https://huggingface.co/spaces/
hysts/ControlNet-v1-1
20
Many More Applications
●Outpainting
●Upscaling
●Restore Old Images
●3D Characters
●Run Models Locally
○ComfyUI
○DiffusionBee
21
Key Takeaways
22
Observations on Generating Realistic Images
●Look at examples of prompts & images from the community
○https://prompthero.com/
●Good idea to use an LLM to refine the prompts
○https://ai.gock.net/flux
●More steps leads to better results, especially with Flux.1 models
●Running locally needs a lot of RAM
●Cloud is quite cheap
●With smaller models
○Text is still problematic
○Some features like hands