AI/ML Infra Meetup | Perspective on Deep Learning Framework
Alluxio
247 views
24 slides
May 24, 2024
Slide 1 of 24
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
About This Presentation
AI/ML Infra Meetup
May. 23, 2024
Organized by Alluxio
For more Alluxio Events: https://www.alluxio.io/events/
Speaker:
- Triston Cao (Senior Deep Learning Software Engineering Manager, @NVIDIA)
From Caffe to MXNet, to PyTorch, and more, Xiande Cao, Senior Deep Learning Software Engineer Manager,...
AI/ML Infra Meetup
May. 23, 2024
Organized by Alluxio
For more Alluxio Events: https://www.alluxio.io/events/
Speaker:
- Triston Cao (Senior Deep Learning Software Engineering Manager, @NVIDIA)
From Caffe to MXNet, to PyTorch, and more, Xiande Cao, Senior Deep Learning Software Engineer Manager, will share his perspective on the evolution of deep learning frameworks.
Size: 5.6 MB
Language: en
Added: May 24, 2024
Slides: 24 pages
Slide Content
Triston Cao, for Alluxio Meetup on May 23, 2024
PERSPECTIVE ON DEEP LEARNING FRAMEWORK
2
3
COMPUTATION GRAPH AND GRADIENT DECENT
Image credit to Deniz Yuret's Homepage: Alec Radford's animations for optimization
algorithms
21
INFERENCE WITH INT8
Ref: Achieving FP32 Accuracy for INT8 Inference Using Quantization Aware Training with NVIDIA TensorRT | NVIDIA Technical Blog
22
COMPILER BASED FRAMEWORK
https://tvm.apache.org/docs/tutorial/relay_quick_start.html
https://www.linkedin.com/pulse/exploring-jax-googles-high-performance-py
thon-library-nagilla-hwauc/
Thunder can optimize Pytorch module with
•torch.compile
•nvFuser
•cuDNN
•Apex
•TransformerEngine
•PyTorch eager
•Custom CUDA kernels through PyCUDA,
Numba, CuPy
•Custom kernels written in OpenAI Triton
https://github.com/Lightning-AI/lightning-thun
der
23
TAKE AWAYS
•Deep learning frameworks are large software projects
•NVIDIA keeps making libraries to server deep learning frameworks for GPU acceleration
•Training and inference have different challenges
•More stabilized by still fast evolving
•Compiler technology getting more integrated into the framework