Multimodal RAG with Milvus and GPT-4o Webinar

chloewilliams62 250 views 18 slides Aug 29, 2024
Slide 1
Slide 1 of 18
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18

About This Presentation

We've seen an influx of powerful multimodal capabilities in many LLMs. In this tutorial, we'll vectorize a dataset of text and images into the same embedding space, store them in Milvus, retrieve all relevant data given an LLM query, and input multimodal data as context into GPT-4o.

​Topi...


Slide Content

1 | © Copyright 2024 Zilliz1 1| © Copyright 9/25/23 Zilliz1| © Copyright 9/25/23 Zilliz
Speaker
Jiang Chen
Ecosystem & Developer Experience

[email protected]
@jiangc1010

2 | © Copyright 2024 Zilliz2
Multimodal RAG with Milvus and GPT4o
Jiang Chen @ Zilliz

3 | © Copyright 2024 Zilliz3
01Multi-modal Embeddings
CONTENTS
02Multi-modal Search in Milvus
Demo of Multi-modal RAG with Milvus03

4 | © Copyright 2024 Zilliz4
Multi-modal Embeddings

5 | © Copyright 2024 Zilliz5
Information Retrieval IR) at Multi-modal Setting
●CLIP 2021

6 | © Copyright 2024 Zilliz6
●independently process image and text modalities
○A classifier at heart
Information Retrieval IR) at Cross-modality Setting

7 | © Copyright 2024 Zilliz7
Visualized BGE
●establishes the in-depth fusion of text and image data
●enables the preservation of the original performance of text
embedding
○as the text encoder is fully fixed while the visual tokens are incorporated

8 | © Copyright 2024 Zilliz8
Visualized BGE - training technique

9 | © Copyright 2024 Zilliz9
MagicLens

10 | © Copyright 2024 Zilliz10
MagicLens

11 | © Copyright 2024 Zilliz11
Multi-modal Search in Milvus

12 | © Copyright 2024 Zilliz12
Retrieval-Augmented Generation

13 | © Copyright 2024 Zilliz13

14 | © Copyright 2024 Zilliz14

15 | © Copyright 2024 Zilliz15
Data Model Design in Milvus

16 | © Copyright 2024 Zilliz16
Demo of Multi-modal RAG with
Milvus

17 | © Copyright 2024 Zilliz17
Useful Links
●CLIP https://arxiv.org/abs/2103.00020
●Visualized BGE https://arxiv.org/abs/2406.04292
●MagicLens: https://arxiv.org/abs/2403.19651
●Multimodal RAG with Milvus ?????? :
https://milvus.io/docs/multimodal_rag_with_milvus.md
●Image Search with Milvus
https://milvus.io/docs/image_similarity_search.md
●Multimodal Image Search online demo:
https://multimodal-demo.milvus.io/
●Hybrid Image and Text search with Multi-vector in Milvus:
https://github.com/yiwen92/Milvus_hybridsearch/blob/main/multi-
modal-demo/demo.ipynb

18 | © Copyright 2024 Zilliz18
T H A N K Y O U
@jiangc1010
Tags