What is Hypothesis? ๏ต Hypothesis is a predictive statement, capable of being tested by scientific methods, that relates an independent variables to some dependent variable. ๏ต A hypothesis states what we are looking for and it is a proportion which can be put to a test to determine its validity e.g. Students who receive counseling will show a greater increase in creativity than students not receiving counseling 3
Characteristics of Hypothesis ๏ต Clear and precise. ๏ต Capable of being tested. ๏ต Stated relationship between variables. ๏ต limited in scope and must be specific. ๏ต Stated as far as possible in most simple terms so that the same is easily understand by all concerned. But one must remember that simplicity of hypothesis has nothing to do with its significance. ๏ต Consistent with most known facts. ๏ต Responsive to testing with in a reasonable time. One canโt spend a life time collecting data to test it. ๏ต Explain what it claims to explain; it should have empirical reference. 4
Null Hypothesis โข The Null hypothesis H represents a theory that has been put forward either because it is believed to be true. โข it is used as a basis for an argument and has not been proven. โข For example, in a clinical trial of a new drug, the null hypothesis might be that the new drug is no better, on average, than the current drug. We would write H : there is no difference between the two drugs on an average.
Null Hypothesis 5 ๏ต It is an assertion that we hold as true unless we have sufficient statistical evidence to conclude otherwise. ๏ต Null Hypothesis is denoted by ๐ป ๏ต If a population mean is equal to hypothesised mean then Null Hypothesis can be written as ๐ป : ๐ = ๐
Alternative Hypothesis โข โข โข The alternative hypothesis, H A , is a statement of what a statistical hypothesis test is set up to establish. For example, in the clinical trial of a new drug, the alternative hypothesis might be that the new drug has a different effect, on average, compared to that of the current drug. We would write H A : the two drugs have different effects, on average. or H A : the new drug is better than the current drug, on average. The result of a hypothesis test: โReject H in favor of H A โ OR โDo not reject H โ
Alternative Hypothesis ๏ต The Alternative hypothesis is negation of null hypothesis and is denoted by ๐ป ๐ If Null is given as ๐ป : ๐ = ๐ Then alternative Hypothesis can be written as ๐ป ๐ : ๐ โ ๐ ๐ป ๐ : ๐ > ๐ ๐ป ๐ : ๐ < ๐ 6
Level of significance and confiden c e ๏ต Significance means the percentage risk to reject a null hypothesis when it is true and it is denoted by ๐ผ . Generally taken as 1%, 5%, 10% ๏ต (1 โ ๐ผ) is the confidence interval in which the null hypothesis will exist when it is true. 7
Risk of rejecting a Null Hypothesis 8 when it is true Desi g nati o n Risk ๐ถ Confidence ๐ โ ๐ถ Description Sup e rcr i t i c a l 0.001 0.1% 0.999 99.9% More than $100 million (Large loss of life, e.g. nuclear disaster Critical 0.01 1% 0.99 99% Less than $100 million (A few lives lost) Important 0.05 5% 0.95 95% Less than $100 thousand (No lives lost, injuries occur) Moderate 0.10 10% 0.90 90% Less than $500 (No injuries occur)
Type I and Type II Error Situation Decision Accept Null Reject Null Null is true Correct Type I error ( ๐ผ ๐๐๐๐๐ ) Null is false Type II error ( ๐ฝ ๐๐๐๐๐ ) Correct 9
Type I and Type II Errors 1. Type I error refers to the situation when we reject the null hypothesis when it is true (H is wrongly rejected). e.g H : there is no difference between the two drugs on average. Type I error will occur if we conclude that the two drugs produce different effects when actually there isnโt a difference. Prob (Type I error) = significance level = ฮฑ 2. Type II error refers to the situation when we accept the null hypothesis when it is false. H : there is no difference between the two drugs on average. Type II error will occur if we conclude that the two drugs produce the same effect when actually there is a difference. Prob (Type II error) = ร
Type I Error A type I error , also known as an error of the first kind It occurs when the null hypothesis ( H ) is true, but is rejected. The rate of the type I error is called the size of the test. It is denoted by the Greek letter ฮฑ (alpha). It usually equals the significance level of a test . If type I error is fixed at 5 %, it means that there are about 5% chances in 100% that we will reject H when H is true.
Type II Error Type II error , also known as an error of the second kind It occurs when the null hypothesis is false, but due to error fails to be rejected. Type II error means accepting the hypothesis which should have been rejected . A Type II error is committed when we fail to believe a truth. T h e r ate of the t y pe II error is d e no t e d by t he G r eek l e t ter ฮฒ (beta) and related to the power of a test (which equals 1-ฮฒ ) .
In the tabular form two error can be presented as follows -
Two tailed test @ 5% Significance level Acceptance and Rejection regions in case of a Two tailed test ๐ ๐๐๐๐๐ก๐๐๐ ๐๐๐๐๐๐ /๐ ๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (๐ผ = 0.025 ๐๐ 2.5%) ๐ ๐๐๐๐๐ก๐๐๐ ๐๐๐๐๐๐ /๐ ๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (๐ผ = 0.025 ๐๐ 2.5%) Suitable When ๐ป : ๐ = ๐ ๐ป ๐ : ๐ โ ๐ ๐๐๐ก๐๐ ๐ด๐๐๐๐๐ก๐๐๐๐ ๐๐๐๐๐๐ ๐๐ ๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (1 โ ๐ผ) = 95% ๐ป : ๐ = ๐ 10
Left tailed test @ 5% Significance level Acceptance and Rejection regions in case of a left tailed test ๐ป : ๐ = ๐ ๐๐๐ก๐๐ ๐ด๐๐๐๐๐ก๐๐๐๐ ๐๐๐๐๐๐ ๐๐ ๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (1 โ ๐ผ) = 95% ๐ ๐๐๐๐๐ก๐๐๐ ๐๐๐๐๐๐ /๐ ๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (๐ผ = 0.05 ๐๐ 5%) Suitable When ๐ป : ๐ = ๐ ๐ป ๐ : ๐ < ๐ 11
Right tailed test @ 5% Significance level Acceptance and Rejection regions in case of a Right tailed test Suitable When ๐ป : ๐ = ๐ ๐ป ๐ : ๐ > ๐ ๐๐๐ก๐๐ ๐ด๐๐๐๐๐ก๐๐๐๐ ๐๐๐๐๐๐ ๐๐ ๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (1 โ ๐ผ) = 95% ๐ป : ๐ = ๐ ๐ ๐๐๐๐๐ก๐๐๐ ๐๐๐๐๐๐ /๐ ๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐ (๐ผ = 0.05 ๐๐ 5%) 12
Procedure for Hypothesis Te s t i ng State the null (Ho)and alternate (Ha) Hypothesis State a significance level; 1%, 5%, 10% etc. Decide a test statistics; z-test, t- test, F-test. Calculate the value of test statistics Calculate the p- value at given significance level from the table Compare the p-value with calculated value P-value > Ca l cu l a t ed value P-value < Ca l cu l a t ed value Accept Ho Reject Ho 13
Hy p othesis Testing of Means Z-TEST AND T-TEST 14
Z-Test for testing means Test Condition ๏ต Population normal and infinite ๏ต Sample size large or small, ๏ต Population variance is known ๏ต Ha may be one-sided or two sided Test Statistics ๐ โ๐ ๐ป ๐ง = ๐ ๐ ๐ 15
Z-Test for testing means Test Condition ๏ต Population normal and finite, ๏ต Sample size large or small, ๏ต Population variance is known ๏ต Ha may be one-sided or two sided Test Statistics ๐ โ ๐ ๐ป ๐ง = ๐ ๐ ๐ ร ๐ โ ๐ ๐ โ 1 16
Z-Test for testing means Test Condition ๏ต Population is infinite and may not be normal, ๏ต Sample size is large, ๏ต Population variance is unknown ๏ต Ha may be one-sided or two sided Test Statistics ๐ โ๐ ๐ป ๐ง = ๐ ๐ ๐ 17
Z-Test for testing means Test Condition ๏ต Population is finite and may not be normal, ๏ต Sample size is large, ๏ต Population variance is unknown ๏ต Ha may be one-sided or two sided Test Statistics ๐ โ ๐ ๐ป ๐ง = ๐ ๐ ๐ ร ๐ โ ๐ ๐ โ 1 18
T-Test for testing means Test Condition ๏ต Population is infinite and normal, ๏ต Sample size is small, ๏ต Population variance is unknown ๏ต Ha may be one-sided or two sided Test Statistics ๐ โ๐ ๐ป ๐ก = ๐ ๐ ๐ ๐ค๐๐กโ ๐. ๐. = ๐ โ 1 ๐ ๐ = ๐ ๐ โ ๐ 2 (๐ โ 1) 19
T-Test for testing means Test Condition ๏ต Population is finite and normal, ๏ต Sample size is small, ๏ต Population variance is unknown ๏ต Ha may be one-sided or two sided Test Statistics ๐ค๐๐กโ ๐. ๐. = ๐ โ 1 ๐ ๐ = ๐ ๐ โ ๐ 2 (๐ โ 1) ๐ โ ๐ ๐ป ๐ก = ๐ ๐ ๐ ร ๐ โ ๐ ๐ โ 1 20
Hypo t he s is testing for difference between means Z-TEST, T-TEST 21
Z-Test for testing difference between means Test Condition ๏ต Populations are normal ๏ต Samples happen to be large, ๏ต Population variances are known ๏ต Ha may be one-sided or two sided Test Statistics ๐ โ ๐ ๐ง = 1 2 ๐1 + ๐2 ๐ 2 ๐ 2 ๐ 1 ๐ 2 22
Z-Test for testing difference between means Test Condition ๏ต Populations are normal ๏ต Samples happen to be large, ๏ต Presumed to have been drawn from the same population ๏ต Population variances are known ๏ต Ha may be one-sided or two sided Test Statistics ๐ง = ๐ 1 โ ๐ 2 ๐ ๐ 2 + 1 1 ๐ 1 ๐ 2 23
T-Test for testing difference between means Test Condition ๏ต Samples happen to be small, ๏ต Presumed to have been drawn from the same population ๏ต Population variances are unknown but assumed to be equal ๏ต Ha may be one-sided or two sided Test Statistics ๐ก = ๐ 1 โ ๐ 2 ๐ 1 ๐ 1 โ 1 ๐ 2 + ๐ 2 โ 1 ๐ 2 ๐ 1 + ๐ 2 โ 2 ๐ 2 ร + 1 1 ๐ 1 ๐ 2 ๐ค๐๐กโ ๐. ๐. = (๐ 1 + ๐ 2 โ 2) 24
Hypothesis Testing for comparing two related samples PAIRED T-TEST 25
Paired T-Test for comparing two related samples Test Condition ๏ต Samples happens to be small ๏ต Variances of the two populations need not be equal ๏ต Populations are normal ๏ต Ha may be one sided or two sided Test Statistics ๐ท โ ๐ก = ๐ ๐๐๐๐. ๐ ๐ค๐๐กโ (๐ โ 1) ๐. ๐. ๐ท = Mean of differences ๐ ๐๐๐๐. = Standard deviation of differences ๐ = ๐๐ข๐๐๐๐ ๐๐ ๐๐๐ก๐โ๐๐ ๐๐๐๐๐ 26
Hypothesis Testing of propor t ions Z- TEST 27
Z-test for testing of proportions Test Condition ๏ต Use in case of qualitative data ๏ต Sampling distribution may take the form of binomial probability distribution ๏ต Ha may be one sided or two sided ๏ต ๐๐๐๐ = ๐. ๐ ๏ต ๐๐ก๐๐๐๐๐๐ ๐๐๐ฃ๐๐๐ก๐๐๐ = ๐. ๐. ๐ Test statistics ๐ง = ๐ โ ๐ ๐. ๐ ๐ ๐ = ๐๐๐๐๐๐๐ก๐๐๐ ๐๐ ๐ ๐ข๐๐๐ ๐ 28
Hypothesis Testing for difference between pro p or t ions Z- TEST 29
Z-test for testing difference between proportions Test Condition ๏ต Sample drawn from two different populations ๏ต Test confirm, whether the difference between the proportion of success is significant ๏ต Ha may be one sided or two sided Test statistics ๐ง = ๐ 1 โ ๐ 2 ๐ 1 ๐ 1 + ๐ 2 ๐ 2 ๐ 1 ๐ 2 ๐ 1 = proportion of success in sample one ๐ 2 = proportion of success in sample two 30
Hypothesis testing of equality of variances of two normal populations F - TEST 31
F-Test for testing equality of variances of two normal populations Test conditions ๏ต The populations are normal ๏ต Samples have been drawn randomly ๏ต Observations are independent; and ๏ต There is no measurement error ๏ต Ha may be one sided or two sided Test statistics ๐น = ๐ 1 ๐ 2 ๐ 2 ๐ 2 ๐ค๐๐กโ ๐ 1 โ 1 and ๐ 2 โ 1 d. f. ๐ 1 ๐ 2 ๐ 1 is the sample estimate for ๐ 2 ๐ 2 ๐ 2 ๐ 2 is the sample estimate for ๐ 2 32
Limitations of the test of Hypothesis ๏ต Testing of hypothesis is not decision making itself; but help for decision making ๏ต Test does not explain the reasons as why the difference exist, it only indicate that the difference is due to fluctuations of sampling or because of other reasons but the tests do not tell about the reason causing the difference. ๏ต Tests are based on the probabilities and as such cannot be expressed with full certainty. ๏ต Statistical inferences based on the significance tests cannot be said to be entirely correct evidences concerning the truth of the hypothesis. 33