Day 00 - Introduction to machine learning with big data
ssusere5ddd6
11 views
18 slides
May 02, 2024
Slide 1 of 18
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
About This Presentation
Day 00 - Introduction to machine learning with big data
Size: 1.99 MB
Language: en
Added: May 02, 2024
Slides: 18 pages
Slide Content
Introduction
Machine Learning for
Big Data
Introductions
Machine Learning for Big Data
Robert Dempsey
•18 years in tech
•Degrees in Comp Sci & MBA
•Founded 3 businesses
•Founder, Data Wranglers DC
•Books
•Python Business Intelligence Cookbook (2015)
•Building Machine Learning Pipelines (2018)
•International speaker
•Instructor & consultant at District Data Labs
•I drink a LOT of coffee :)
Machine Learning for Big Data
Gimme Your Infos!
1.Your name
2.What your role is
3.Why you are attending this course
4.What you hope to get out of this course
Course Agenda
Machine Learning for Big Data
Course Setup
•8, half days
•Interactive lecture + hands-on lab
•Yes, there will be breaks
•Everyone gets a number!
Machine Learning for Big Data
Day One: Data Analytics with Hadoop
•Introduction to Distributed Computing
•The Age of Data Products
•Building Data Products at Scale
•Data Product Architectures
•Hadoop: An Operating System for Big Data
•Hadoop Architecture
•What’s In A Cluster?
•HDFS Caveats
Machine Learning for Big Data
Day Two: Data Analytics with Hadoop
•Setting Up for Big Data Analytics
•Building a Hadoop Cluster in AWS
•Building a Spark Cluster in AWS
•Configuring Your Local Environment
•Introduction to Spark
•Building Applications for Spark
•Writing Spark Applications
Machine Learning for Big Data
Day Three: Machine Learning on Big Data
•Machine Learning Overview
•Model Categories & Types of Output
•Operationalizing Machine Learning
•Threats to Machine Learning
Machine Learning for Big Data
Day Four: Machine Learning on Big Data
•Big Data Approaches
•Sampling and Fitting in Memory
•A Tour of Model Families
Machine Learning for Big Data
Day Five: Supervised Machine Learning
•Overview of Supervised Learning
•Regression Models (Algorithms)
•Model Evaluation
•Hands-on Lab: Regression at Scale
Machine Learning for Big Data
Day Six: Supervised Machine Learning
•Classification Models (Algorithms)
•Model Evaluation
Machine Learning for Big Data
Day Seven: Unsupervised Machine Learning
•Overview of Unsupervised Learning
•Distance Metrics
•Clustering Algorithms
Machine Learning for Big Data
Day Eight: Unsupervised Machine Learning
•Clustering
•Algorithms Review
•Evaluation
•Visualization
•Clustering at Scale