Evaluating RAG pipelines built on unstructured data
chloewilliams62
118 views
11 slides
Sep 18, 2024
Slide 1 of 11
1
2
3
4
5
6
7
8
9
10
11
About This Presentation
This talk will cover different techniques for evaluating a RAG pipeline built on unstructured data. Standing up a basic RAG pipeline is becoming easier every day, however identifying weak points in your application or dataset remains a challenge. We'll review how you can use traditional assertio...
This talk will cover different techniques for evaluating a RAG pipeline built on unstructured data. Standing up a basic RAG pipeline is becoming easier every day, however identifying weak points in your application or dataset remains a challenge. We'll review how you can use traditional assertion-based evaluation techniques, LLM-as-a-Judge approaches, and embedding visualization tools to improve your pipeline using Arize Phoenix.
Size: 1.67 MB
Language: en
Added: Sep 18, 2024
Slides: 11 pages
Slide Content
Evaluating Agentic RAG Pipelines
September 2024
Hakan Tekgul
Arize AI
Solution Architect
span
span
retrieval span
span
Phoenix Library
Model Params
Eval LLM
Eval Template
Example: Retrieval
retrieval span
Span we want to evaluate
Output
User Query
Input
Documents
Eval Template
You are comparing a reference text to a question and trying to determine
if the reference text contains information relevant to answering the
question. Here is the data:
[BEGIN DATA]
************
[Question]: {query}
************
[Reference text]: {reference}
[END DATA]
Compare the Question above to the Reference text. Determine whether
the Reference text contains information that can answer the Question.