**Artificial Intelligence (AI)** is a transformative field of computer science dedicated to creating systems that can perform tasks typically requiring human intelligence. This includes everything from simple problem-solving to complex abilities like learning, reasoning...
### A single paragraph on AI
**Artificial Intelligence (AI)** is a transformative field of computer science dedicated to creating systems that can perform tasks typically requiring human intelligence. This includes everything from simple problem-solving to complex abilities like learning, reasoning, and language understanding. At its core, AI relies on powerful algorithms and vast amounts of data to identify patterns and make predictions. Key sub-fields, such as **Machine Learning** and **Deep Learning**, have revolutionized technology, enabling applications like personalized recommendations, self-driving cars, and virtual assistants. As AI continues to evolve, it promises to reshape industries and redefine our daily lives, while also presenting crucial ethical challenges related to data privacy, bias, and its impact on the workforce.
Size: 3.48 MB
Language: en
Added: Aug 27, 2025
Slides: 79 pages
Slide Content
Introduction to Computer Vision
PrinsipkecerdasanArtifisial
Dr. DyahArumingTyas.
1
What is Computer Vision ?
What is Computer Vision?
Make computers understand images and videos.
•What kind of scene?
•Where are the cars?
•How far is the building?
What is Computer Vision?
Make computers understand images and videos.
•What are they doing?
•Why is this happening?
•What is important?
•What will I see?
The goal of computer vision is to give
computers (super) human-level perception
typical perception pipeline
typical perception pipeline
typical perception pipeline
typical perception pipeline
Important note: In general, computer vision
does not work
Important note: In general, computer vision
does not work
(except in certain situation / condition)
Computer Vision
•Com
16
Computer
Artificia
nVisio
puter Vision
Machine
Learning
l Intelligence
Natural
Language
Processing
Robotics
Deep Learning
Computer Vision
•One of the fastest growing field of A.I.
• Computer Vision
17
Computer Vision
•Cisco: by 2016, >85% of Internet data is in form of pixels (multimedia)
• Computer Vision
19
Computer Vision
•YouTube :
every second,
about 10 hours of videos
are being uploaded
•Impossible to process
manually
http://www.everysecond.io/youtube
• Computer Vision
20
Age of Vision
21
• Computer Vision
Not This Vision
• Computer Vision
22
Computer Vision
•Vision data (raw pixel data) are one of the hardest data to harness
•Digital Dark Matter of The Internet
• Computer Vision
23
Recognition via Edge Detection
•Computer Vision
43
[John Canny, 1986]
[David Lowe, 1987] 1963
1959
CV
1966
1970
1986
AI Winter II
Recognition via Grouping
Normalized Cuts
[Shi & Malik, 1997]
CV AI Winter II
1959 1997
1963
1966
1970
1986
• Computer Vision
44
Recognition via Matching
Scale-Invariant Feature Transform (SIFT)
[David Lowe, 1999]
1963
1959
1966
1970
CV
1986
1997
AI Winter II
1999
• Computer Vision
45
Face Detection
[Viola & Jones, 2001]
CV AI Winter II
1959 1997
1963 1999
1966 2001
1970
1986
• Computer Vision
46
Recognition via Features and Parts
Deformable Part Model
[Felzenswalb, McAllester,
Ramanan, 2009]
1963
1959
1966
1970
CV
1986
1997
AI Winter II
1999
2001
2005
Histogram of Oriented
Gradients (HOG)
[Dalal & Triggs, 2005]
• Computer Vision
47
Popularity of
AI/ML in CV
There is a number of visual recognition problems that are related to image classification,
such as
object detection, image captioning, video classification
56
• Computer Vision
Classification
•Input
•Output
: Image,
there may be an object inside
: Class Label
Single object per image
•Evaluation metric:
Accuracy
•Also called Recognition, Identification
CAT
• Computer Vision
3/8/2023 89
Localization
•Input
•Output
: Image,
there IS an object inside
: Box in the image (�, �, �, ℎ)
Single object, specific
•Evaluation metric:
Intersection over Union
(�, �, �, ℎ)
• Computer Vision
3/8/2023 90
Classification + Localization
•Input: Image,
there may be an object inside
•Output : Class Label and Box (�, �, �, ℎ)
•Evaluation metric:
Intersection over Union
Accuracy
•Only one object, simpler than detection
•Single Object Detection
CAT
• Computer Vision
3/8/2023 91
(�, �, �, ℎ)
Object Detection
•Input: Image, there may be
one or more objects inside
•Output : Class Label and Box (�, �, �, ℎ)
for each object
•Evaluation metric:
Intersection over Union
Accuracy, MAP, AR
CAT , DOG , DUCK
�, �, �, ℎ , �, �, �, ℎ , �, �, �, ℎ
• Computer Vision
3/8/2023 92
Semantic Segmentation
•Input: Image, there may be
one or more objects inside
•Output : Pixel label
•Evaluation metric:
Intersection over Union
Jaccard Index, Dice,
Accuracy, MAP, AR
• Computer Vision
3/8/2023 93
Instance Segmentation
•Input: Image, there may be
one or more objects inside
•Output : Pixel label for each instance
•Evaluation metric:
Intersection over Union
Jaccard Index, Dice,
Accuracy, MAP, AR
• Computer Vision
3/8/2023 94
Summing it up
• Computer Vision
3/8/2023 95
•Classification / Recognition
•Localization
•Classification + Localization
•Object Detection
•Segmentation
•Instance Segmentation
: What is
: Where is
: What and where is
: What and where are
: Which label pixel is
: Which instance pixel is