Leveraging Open-Source LLMs for Production

andreycheptsov 206 views 22 slides Sep 26, 2024
Slide 1
Slide 1 of 22
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22

About This Presentation

This talk explores the practical use of open-source LLMs in real-world applications, discussing their pros and cons, such as privacy benefits and cost-efficiency, alongside challenges like technical expertise and deployment techniques. It covers the economics of fine-tuning and deployment, infrastru...


Slide Content

Leveraging
Open-Source LLMs
for Production


Andrey Cheptsov, Founder at

InfoQ Dev Summit, Munich, September, 2024

Closed-source vs open-source
Looking back at
predictions

Closed-source vs open-source
Source: Maxime Labonne
An open-source model closes
the gap for the first time
MMLU Pro
(5-shot)

Benchmarks
Source: Meta
Llama 3.1
Instruct

Benchmarks
Source: Qwen
Qwen 2.5
Instruct

When to use open-source models
Closed-source vs open-source
●Control (1)
●Customization (3)
●Transparency (2)
●Ecosystem (5)
●Cost-effectiveness (4)

Llama 3.1 (no
optimization)
Hardware requirements
Add text
Inference Training
A100:80Gx8 x 2 nodes A100:80Gx8 x 6 nodes

Hardware requirements
How will you even run
this?

Source: HuggingFace
Quantization
Optimization techniques

Llama 3.1 (with
optimization)
Hardware requirements
Training Inference

Optimization techniques
Low Rank Adaptation
(LoRA)
Source: HuggingFace

Llama 3.1 (with
optimization)
Hardware requirements
Training Inference

Llama 3.1 (with
optimization)
Hardware requirements
TrainingInference

Development process
Source: Sebastian Raschka
Pre-training and
Post-training

Development process
Source: Sebastian Raschka
Pre-training and
Post-training

Development process
Supervised fine-tuning
(SFT)
Source: Chip Huen

Development process
Source: Sebastian Raschka
Pre-training and
Post-training

Development process
Source: Sebastian Raschka
Pre-training and
Post-training

Reinforcement Learning
from Human Feedback
(RLHF)
Source: Chip Huen
Development process

Development process
Source: Sebastian Raschka
Pre-training and
Post-training

Source: archive.org
RLHF vs DPO
Development process

●CUDA/ROCm/XLA
●vLLM/TGI/NIM
●TRL/Axolotl
Frameworks & tools
Development process