Implement Agentic RAG Using Claude 3.5 Sonnet, LlamaIndex, and Milvus
chloewilliams62
515 views
22 slides
Aug 14, 2024
Slide 1 of 22
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
About This Presentation
In this talk, we review the cutting-edge techniques for implementing Agentic Retrieval-Augmented Generation (RAG) systems, leveraging the power of Claude 3.5 Sonnet, LlamaIndex, and Milvus. Retrieval-Augmented Generation has become a cornerstone in building intelligent systems that require both gen...
In this talk, we review the cutting-edge techniques for implementing Agentic Retrieval-Augmented Generation (RAG) systems, leveraging the power of Claude 3.5 Sonnet, LlamaIndex, and Milvus. Retrieval-Augmented Generation has become a cornerstone in building intelligent systems that require both generative and retrieval capabilities, enabling more accurate, context-aware, and dynamic responses. The session will cover practical implementation details, including how to set up the RAG pipeline, integrate the components seamlessly, and optimize the system for performance and scalability. We will also explore use cases and real-world applications of Agentic RAG, demonstrating its potential in enhancing AI-driven solutions.
Size: 1.32 MB
Language: en
Added: Aug 14, 2024
Slides: 22 pages
Slide Content
Build Agentic RAG Using
Claude 3.5 Sonnet,
LlamaIndex,
and Zilliz
Bill Zhang
System Evolving
2022, 2023, 2024
●LLM
●Compound AI Systems
●Agents
Shift from Models to Compound Systems
Compound AI Systems
User
Query
Generate
(LLM)
Answer
User
Query
Search
Query(LL
M)
Generate(
LLM)
Generate(
LLM)
Compound AI
System
Limitations of models
●Limited Data/Knowledge
●Hard to Adapt
Benefits of Compound Systems
●Modular
●Easy to Adapt
●E.g. RAG
○Very Specific
Control Logic
Programmatic
Autonomy
(LLM)
Fast Slow
- +
Weakness
Certain Questions where top-k retrieval will fail:
●Summarization Questions: Summarize a document
●Comparison Questions: Compare the performance of employee A and B
●Structured Analytics + Semantic Search: Analyze the risk factors of highest
performing vector database companies globally.
●General multi-part questions: Tell me A, and Tell me B, Make a table and
generate a conclusion.
What is Agentic RAG?
RAG is just a tool for Agents.
Agents from Phase I to X
●Routing: Given user query and set of choices, output subset of
choices to route to.
●Query Planning: Break down query into parallelizable
Sub-queries. Each subquery can be executed against any RAG
●Tool Use: Use an LLM to call an API, LLM will determine the
parameters of the API.
●ReAct: Reasoning + Acting with LLMs
○Sequential multi-part problem? (Make it Loop)
○Stateful (Add memory)
ReAct Agent Model
●Dynamic Query Planning
○Plan Ahead at each Step
○Parallelize execution as much as possible
LLMCompiler