Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
ivanruchkin
60 views
4 slides
May 27, 2024
Slide 1 of 4
1
2
3
4
About This Presentation
Presented by Zhenjiang Mao at the Robot Trust for Symbiotic Societies (RTSS) Workshop, ICRA 2024. Also by Dong-You ("Sam") Jhong at UF AI Days 2024 and the NELMS IoT conference 2024.
Out-of-distribution (OOD) detection is essential in autonomous driving, to determine when learning-based c...
Presented by Zhenjiang Mao at the Robot Trust for Symbiotic Societies (RTSS) Workshop, ICRA 2024. Also by Dong-You ("Sam") Jhong at UF AI Days 2024 and the NELMS IoT conference 2024.
Out-of-distribution (OOD) detection is essential in autonomous driving, to determine when learning-based components encounter unexpected inputs. Traditional detectors typically use encoder models with fixed settings, thus lacking effective human interaction capabilities. With the rise of large foundation models, multimodal inputs offer the possibility of taking human language as a latent representation, thus enabling language-defined OOD detection. In this paper, we use the cosine similarity of image and text representations encoded by the multimodal model CLIP as a new representation to improve the transparency and controllability of latent encodings used for visual anomaly detection. We compare our approach with existing pre-trained encoders that can only produce latent representations that are meaningless from the user's standpoint. Our experiments on realistic driving data show that the language-based latent representation performs better than the traditional representation of the vision encoder and helps improve the detection performance when combined with standard representations.
Size: 888.07 KB
Language: en
Added: May 27, 2024
Slides: 4 pages
Slide Content
Language-Enhanced Latent Representations
for Out-of-Distribution Detection
in Autonomous Driving
Zhenjiang Mao, Dong-You Jhong, Ao Wang, Ivan Ruchkin
Department of Electrical and Computer Engineering
Robot Trust for Symbiotic Societies (RTSS) Workshop
05/13/24
1
Out-of-Distribution (OOD) Detection
2
Neural Networks (NN)
NN controller
NN safety monitor
Training data
In-distribution data
Classification:
Red Light
Unseen data
Out-of-distribution(OOD) data
Classification:
Green Light