Zilliz - Overview of Generative models in ML

chloewilliams62 86 views 32 slides May 01, 2024
Slide 1
Slide 1 of 32
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32

About This Presentation

Learn the following topics:

What is Generative AI?
Use cases of generative AI
Large Language Models, Neural Networks, and Parameters
GAN (Generative Adversarial Network)
Diffusion models
Multimodal models


Slide Content

Overview of Generative models in ML
-Aishit Dharwal

Generative AI

Generative AI
A branch of AI that focuses on creating new data.
It utilizes machine learning algorithms to learn the
underlying patterns and relationships within a
dataset and then uses that knowledge to generate
entirely new content that resembles the training
data.

Generative AI
A branch of AI that focuses on creating new data.
It utilizes machine learning algorithms to learn the
underlying patterns and relationships within a
dataset and then uses that knowledge to generate
entirely new content that resembles the training
data.

●Image Generation: Creating photorealistic
images, editing existing photos, and
generating artistic variations.

●Image Generation: Creating photorealistic
images, editing existing photos, and
generating artistic variations.
●Text Generation: Writing different creative
text formats like poems, scripts, or even code
in a coherent and stylistic manner.

●Image Generation: Creating photorealistic
images, editing existing photos, and
generating artistic variations.
●Text Generation: Writing different creative
text formats like poems, scripts, or even code
in a coherent and stylistic manner.
●Video Generation: Producing realistic videos
from scratch or editing existing videos by
adding or removing elements.

●Image Generation: Creating photorealistic
images, editing existing photos, and
generating artistic variations.
●Text Generation: Writing different creative
text formats like poems, scripts, or even code
in a coherent and stylistic manner.
●Video Generation: Producing realistic videos
from scratch or editing existing videos by
adding or removing elements.
●Music Generation: Composing novel musical
pieces in various styles.

Oreilly Media

LLM

LLM
A large language model (LLM) is a deep learning algorithm
that can perform a variety of natural language processing
(NLP) tasks. Large language models use transformer
models and are trained using massive datasets — hence,
large. This enables them to recognize, translate, predict, or
generate text or other content.

LLM
A large language model (LLM) is a deep learning algorithm
that can perform a variety of natural language processing
(NLP) tasks. Large language models use transformer
models and are trained using massive datasets — hence,
large. This enables them to recognize, translate, predict, or
generate text or other content.
GPT 4 is one of the most versatile LLMs. It is multimodal - vision and text - can
write code as well

https://www.linkedin.com/pulse/demystifying-llms-quick-summary-andrej-karpat
hys-intro-lye-jia-jun-r70jc/
Next word prediction task
forces the model to learn a
lot about the world.

https://www.marktechpost.com/2023/10/15/this-artificial-intelligence-survey-research-provides-a-comprehensive-
overview-of-large-language-models-applied-to-the-healthcare-domain/

GAN - Generative Adversarial Network

GAN - Generative Adversarial Network
GANs can generate high-quality synthetic data, making
them valuable for tasks like image generation, data
augmentation, and more.

GAN - Generative Adversarial Network
GANs can generate high-quality synthetic data, making
them valuable for tasks like image generation, data
augmentation, and more.
In a GAN, two neural networks contest with each other in
the form of a zero-sum game, where one agent's gain is
another agent's loss.

https://towardsdatascience.com/decoding-the-basic-math-in-gan-simpl
ified-version-6fb6b079793

https://developers.google.com/machine-learning/gan/gan_structure

https://developers.google.com/machine-learning/gan/gan_structure

Diffusion Models

Diffusion Models
Diffusion models are generative models that can create new data samples by starting
with basic data and gradually transforming it into more complex and realistic data.
Used for tasks like image generation, data denoising, and probabilistic data
generation.

Diffusion Models
Diffusion models are generative models that can create new data samples by starting
with basic data and gradually transforming it into more complex and realistic data.
Used for tasks like image generation, data denoising, and probabilistic data
generation.
Inspired by non-equilibrium thermodynamics: Add randomness and reverse it
successfully!
Add noise to original image and learn how to reverse the process.

Diffusion Models
Diffusion models are generative models that can create new data samples by starting
with basic data and gradually transforming it into more complex and realistic data.
Used for tasks like image generation, data denoising, and probabilistic data
generation.
Inspired by non-equilibrium thermodynamics: Add randomness and reverse it
successfully!
Add noise to original image and learn how to reverse the process.
1.Stable Diffusion - uses latent diffusion model
2.DALL-E

https://en.wikipedia.org/wiki/Diffusion_model

Multimodal Models

Multimodal Models
A Multimodal Generative model draws outputs from a
combination of multiple data types to provide
responses as insights, content, and more.
Capable of processing information from different
modalities, including images, videos, and text.

Assembly AI

Assembly AI

Assembly AI

Feel free to send in your
questions in the
comments/DM on Linkedin
Tags