RGB-X Model Development: Exploring Four Channel ML Workflows
chloewilliams62
147 views
20 slides
Sep 19, 2024
Slide 1 of 20
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
About This Presentation
Machine Learning is rapidly becoming multimodal. With many models in Computer Vision expanding to areas like vision and 3D, one area that has also quietly been advancing rapidly is RGB-X data, such as infrared, depth, or normals. In this talk we will cover some of the leading models in this explodin...
Machine Learning is rapidly becoming multimodal. With many models in Computer Vision expanding to areas like vision and 3D, one area that has also quietly been advancing rapidly is RGB-X data, such as infrared, depth, or normals. In this talk we will cover some of the leading models in this exploding field of Visual AI and show some best practices on how to work with these complex data formats!
Size: 2.53 MB
Language: en
Added: Sep 19, 2024
Slides: 20 pages
Slide Content
RGB-X Model Development
Exploring 4 Channel Model Development
Daniel Gural
Machine Learning & Developer
Relations - Voxel51
While Stagnation Fears Rise, Visual AI Marches On
NVDA
The Year of Scaling
> Connecting to cloud
> Securing GPUs
> Building out your stack
RGB-X Models
What is a RGB-X Model?
> A model designed to input or output four channel
images
> Utilized across all industries
Why Does This Work Matter?
> Take a step back
> Does it matter today to answer?
> Hold this question for later
How RGB-X Models Are Being Used Today
Tracking Across all Signals
> Extension of a traditional Object Detection Model
> Tracks objects across frames of any RGB-X image
Surveying Difficult Terrain
> Scan hard to access places
> Create a map of areas just by flying through a drone
> Can be performed all onboard
Sapiens and Beyond
How to Get Started
Revisiting the Why
> In a vacuum, none of these use cases move the needle
> So why is this work important?
Looking to the Future
Looking to the Future
Looking to the Future
What Does This Mean for Machine Learning?
1.
What Does This Mean for Machine Learning?
2.
What Does This Mean for Machine Learning?
3.
Try Your Hand at Monocular Depth Estimation
https://docs.voxel51.com/tutorials/monocular_depth_estimation.html