“Implementing Transformer Neural Networks for Visual Perception on Embedded Devices,” a Presentation from VeriSilicon
embeddedvision
95 views
18 slides
Jun 26, 2024
Slide 1 of 18
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
About This Presentation
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/implementing-transformer-neural-networks-for-visual-perception-on-embedded-devices-a-presentation-from-verisilicon/
Shang-Hung Lin, Vice President of Neural Processing Products at VeriSilicon, presents the...
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/implementing-transformer-neural-networks-for-visual-perception-on-embedded-devices-a-presentation-from-verisilicon/
Shang-Hung Lin, Vice President of Neural Processing Products at VeriSilicon, presents the “Implementing Transformer Neural Networks for Visual Perception on Embedded Devices” tutorial at the May 2024 Embedded Vision Summit.
Transformers are a class of neural network models originally designed for natural language processing. Transformers are also powerful for visual perception due to their ability to model long-range dependencies and process multimodal data. Resource constraints form a central challenge when deploying transformers on embedded platforms. Transformers demand substantial memory for parameters and intermediate computations. Further, the computations involved in self-attention create challenging computation requirements. Energy efficiency adds another layer of complexity.
Mitigating these challenges requires a multifaceted approach. Optimization techniques like quantization ameliorate memory constraints. Pruning and sparsity techniques, removing less critical connections, alleviate computation demands. Knowledge distillation transfers knowledge from larger models to compact models. Lin also discusses hardware accelerators such as NPUs customized for transformer workloads, and software techniques for efficiently mapping transformer models to hardware accelerators.
Size: 1.61 MB
Language: en
Added: Jun 26, 2024
Slides: 18 pages
Slide Content
Implementing Transformer
Neural Networks for Visual
Perception on Embedded
Devices
Shang-Hung Lin
VP, NPU IP
VeriSilicon Inc.