AI/ML Infra Meetup | Preference Tuning and Fine Tuning LLMs

Alluxio 439 views 25 slides Aug 30, 2024
Slide 1
Slide 1 of 25
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25

About This Presentation

AI/ML Infra Meetup
Aug. 29, 2024
Organized by Alluxio

For more Alluxio Events: https://www.alluxio.io/events/

Speaker:
- Ankit Khare (Developer Relations, @OpenAI)

This session aims to provide practical insights for AI enthusiasts on effectively customizing and leveraging LLMs in various applica...


Slide Content

Fine-Tuning & Preference Tuning
LLMs

Ankit Khare
Developer Relations, OpenAI

Overview
•Mental Model of the landscape of optimizing LLMs
•Intuition on which methods to use when
•You should leave with some level of confidence that
LLM optimization is indeed hard.
•But, no pain no gain folks!!!

False Belief on optimization
•Process is linear X
•Process is non linear

Actual Flow

Steps in the flow

Prompt Engineering

Example

Mental Model - RAG and FT

RAG Intuition

RAG Eval

Fine Tuning Intuition

FT Success Story

Fun FT Story

Fun FT Story

Fun FT Story

FT Best Practices

FT + RAG

Preference Tuning

Preference Tuning

Preference Tuning

RLHF

DPO

Explore More!

Slide credits and references
●Huggingface
https://www.youtube.com/watch
?v=QXVCqtAZAn4

●OpenAI
https://platform.openai.com/docs
/guides/fine-tuning

Thank you!

Feel free to connect!