HusseinMalikMammadli
37 views
33 slides
Mar 04, 2025
Slide 1 of 33
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
About This Presentation
"Misleading Statistical Facts " ,bizə Data Collection bias, p-dəyərlərinin sirrini və hətta "Black swan or black cat" hadisələrini anlamağımıza kömək edəcək.
Sitarə Ağayeva , Paşa Holdingde Data Scientist və Model Validator expert kimi çalışır.Sitara eyni...
"Misleading Statistical Facts " ,bizə Data Collection bias, p-dəyərlərinin sirrini və hətta "Black swan or black cat" hadisələrini anlamağımıza kömək edəcək.
Sitarə Ağayeva , Paşa Holdingde Data Scientist və Model Validator expert kimi çalışır.Sitara eyni zamanda data həvəskarlarının bir yerə toplandığı Challengers Deep Educational qurucusu və Women in Data Science icmasının səfirlərindən biridir.
Size: 863.58 KB
Language: en
Added: Mar 04, 2025
Slides: 33 pages
Slide Content
HOW DO THE
NUMBERS
FOOL US?
THINGS YOU PREFER NOT
TO THINKABOUT
SITARA AGHAYEVA
01 Modelling the
World
Are all theories true?
Modelling is
the way of finding
the nearest
representation of truth.
01
Not the actual truth itself.
02 Do we define
accurately?
Are operational
definitions important?
Xoşbaxtlığ, xoşbaxtlığ
02
Not the actual truth itself.
How you define happiness?
Examples:
02
But what is happiness?
Imagine reading a research
saying people who eat candies
are happier?
03 Data is source
for knowledge.
But is all data good?
Sampling matters.
03
Are we able to reach to
the truth?
Representativeness
Response error
Non-response error
Innate bias
Examples:
03
Are we able to reach to
the truth?
Survey about depression
Survey about access to internet
Survey about salary
Crime rates in Malta
We see what we want.
03
Are we willing to reach to
the truth?
Confirmation bias
Selection bias
Examples:
03
Are we willing to reach to
the truth?
People who leaves universities
become successful, because Bill
Gates did. (btw what is
success?)
Why does more
data distort us?
Answer in the last slide.
04 Finding a meaning
in randomness
Is P-value honest?
Less probable things are
more probable if you try
enough chances.
04
Monte-Carlo simulation
P-value hacking
04
Monte-Carlo simulation
Try enough experiments and
at least one of them will make
you right.
05 Correlation and
causation
Are we meant to be
together?
A car that racing you
every day: is s/he really for
you?
05
Ceteris Paribus
Solution:
05
Try to go other direction.
Look for ceteris paribus (just
stop the car to see s/he will go
on or not.)
Ceteris Paribus
" " Stop for a while to see who is
coming with you and who just
coincides with you.
06 Reporting issue
Have we made a great
progress?
I got 10.
06
Actual values are not
comparable.
out of 100.
Our company increased
sales by 100%.
06
And percents can be
mind blowing.
From 1 to 2.
Solution:
06
Require both actual and
percentage reviews.
Calculate different rations, do
not rely on one for performance.
Comprehensive reporting
07 Predictibility
Randomness drives us
mad.
What is randomness and
do computers make it?
07
Roll a dice to know.
Answer:
07
But why?
Randomness is the thing that we
cannot see the reason behind and
cannot predict.
Why does subjectivity
exist?
07
Is not there only
one truth?
Elephant analogy
Answer:
07
Why does more data
distort us?
We are not able to see everything.
and partial truth may lead us
false.
Examples:
07
Are we willing to reach to
the truth?
Imagine your versions in
different people’s mind: you are
the same person, but everyone
remembers you differently.
THANKS
A LOT!
SITARA AGHAYEVA
Devil’s Advocate
Nassim Nicholas Taleb - “Black Swan”
Jim Al Khalili - “Paradox”
Vaclav Smil - “Numbers don’t lie”
Literature