Google AI Hackathon: LLM based Evaluator for RAG

sujitpal 317 views 10 slides Apr 25, 2024
Slide 1
Slide 1 of 10
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10

About This Presentation

Slides accompanying project submission video for Google AI Hackathon. Describes a LCEL and DSPy based evaluation framework inspired by the RAGAS project.

Accompanying video URL: https://youtu.be/yOIU65chc98


Slide Content

LLM based Evaluator for RAG Mayank Bhaskar Dave Campbell Sujit Pal

question context answer ground truth Retriever LLM

question context answer ground truth Retriever LLM Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall

Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall JSON Snapshot LLM

Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall LCEL JSON Snapshot LLM

Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall LCEL JSON Snapshot LLM Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall DSPy predicted scores

Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall LCEL JSON Snapshot LLM Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall DSPy

Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall LCEL JSON Snapshot LLM Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall DSPy Manual Evaluation Tool

Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall LCEL JSON Snapshot LLM Faithfulness Answer Relevance Context Precision Context Utilization Context Relevance Answer Correctness Answer Similarity Context Recall DSPy Manual Evaluation Tool Synthetic Data Predictive Models

https:// github.com / sujitpal / llm -rag-eval 🙏