SlideShare a Scribd company logo
1 of 81
Download to read offline
B . S. Parajuli
HYPOTHESIS AND ITS TESTING PROCEDURES
B. S. Parajuli
Hypothesis
➒An assumption that we make about the population
parameter
➒A statistical hypothesis is a statement about population
usually concerned with the value of one or more
parameters of the population.
i. logically conjectured relationship
ii. Relationship between two or more variable
iii. expressed in the form of testable statement(s)
iv. Formal statement(s) / conjectured statement
v. Variables (independent and dependent)
ο‚’ The average monthly income of the people of this city is
Rs. 17,000. i.e. 𝝁 = Rs. 17,000
ο‚’ The proportion of people with cell phones in this rural
municipality is 0.62
i.e. P = 0.62
ο‚’ More than 20% of the adults watch children’s program
in the television.
ο‚’ Is the average waiting time for the customers of Smart
Supermarket at the checkouts greater than 15
minutes?
ο‚’ Is there any difference between the percentage of
internet users between Kathmandu and Pokhara ?
Purpose
will identify
independent
variables
influence upon
dependent
variable
investigate a
research
problem
enhances the
objectivity of
the study
understand
what data is
to be
collected
increase the
objectivity of
research.
concludes
what is true
or what is
false
Null Hypothesis: A statistical hypothesis or assumption made about the
population parameter to testing its validity for the possible acceptance is
called null hypothesis. It is also called hypothesis of no difference. It is
denoted by H0.
β€’ Always contains β€œ=” , β€œβ‰€β€ or β€œο‚³β€ sign
Example : The mean age of MBM students of SMC is equal to 23 years
(H0 ∢ 𝝁 = 23 years)
Alternative Hypothesis: Any hypothesis which is complementary to the
null hypothesis is called an alternative hypothesis and is denoted by H1.
ο‚— Always contains β€œβ‰ β€ , β€œ>” or β€œ<” sign
H1 ∢ 𝝁 β‰  23 years (TwoTailed)
H1 ∢ 𝝁 > 23 years (RightTailed)
H1 ∢ 𝝁 < 23 years(LeftTailed)
To determine whether the test is one-tailed or two tailed depends
upon the alternative hypothesis. If the alternative hypothesis is two
tailed we apply two-tailed test and if the alternative hypothesis is
one-tailed, we apply one-tailed test.
Two Tailed Test : A test of statistical hypothesis in which the alternati
ve hypothesis looks for the values are significantly different or not
but the direction of the difference(more or less) is not specified.
Key words like:
ο‚— differs from, is not equal to, differs significantly, same,
whether any difference, unbiased, whether equal to, not
equals to
One Tailed Test: A test of statistical hypothesis in which the alternative
hypothesis H1 looks for a definite direction of the difference (more or less)
with parameter is specified.
Thus for the comparative study of the population parameter identifying the
words such as:
less than, more than, greater than, no more than, taller than,
smaller than, better than , at least, at most , only, improved,
superior, inferior, below, above, high, low, increase, decrease,
better than, up to, majority, minority etc.
After identifying one-tailed test, the left tailed test and right
tailed tests can be identified in following way:
i. If sample statistic < Population parameter- left tailed test
ii. If sample statistic > Population parameter- Right tailed test
iii. If first sample statistic < Second sample statistic- Left tailed test
iv. If first sample statistic > Second sample statistic- Right tailed test
Level of Significance: The maximum size of type I error that we are
prepared to take risk is called the level of significance and is denoted by 𝛼.
Symbolically it is defined as
𝛼 = P(type I error)
= P (Rejecting a true null hypothesis)
If we allow Ξ± = 5%, we are likely to reject 5 cases in making decision 100
times under the same conditions.
Critical Region and Rejection Region:
The region where the null hypothesis is rejected is called critical region or
rejection region and the region for accepting true null hypothesis is called
acceptance region. If the statistic belongs to the critical region, H0 is rejected
and if the statistic belongs to the acceptance region H0 is accepted.
The critical region is established on the basis of the choice of level of
significance. If 𝛼 = 0.05(5%) then the total rejection area under the
normality curve will be 0.05(5%) and the acceptance area under the normal
probability curve will be 0.95(95%).
Critical / Significant Value:
The value which separates the acceptance region and the
rejection region is called critical value or significant value. It
depends upon
(i) The level of significance used and
(ii) The alternative hypothesis whether it is two tailed or one
tailed (left or right ) test.
Critical values
Rejection Region
/2
0
a
/2
a
Acceptance Region
Two Tailed Test
0.025 0.025
0.95
For large sample, the critical value of β€˜Z’ obtained from the statistical table
similarly for small sample the critical value of β€˜t’ obtained from the
statistical table.
When the computed test statistic lies in the acceptance region, it is
reasonable to accept the null hypothesis as it is believed to be probable true.
Acceptance
Region
0
a
Critical values
Rejection Region
0
a
Acceptance
Region
RightTailedTest LeftTailedTest
0.95
0.95
0.05
Blood
Pressure
Heridity
Regular
Exercise
Food
Habit
Weight
Age
Contd…
➒ Example
➒ H0: Food habit does not affect blood pressure.
➒ H1: Food habits affects blood pressure.
➒ Food habit: independent variable
➒ Blood pressure: dependent variable.
➒ H0: Age does not affect blood pressure.
➒ H1: As age increases blood pressure increases.
➒ age: independent variable
➒ Blood pressure: dependent variable.
➒ H0: Regular exercise does not affect blood pressure.
➒ H1: Regular exercise may decrease blood pressure.
➒ regular exercise: independent variable
➒ blood pressure: dependent variable.
Parametric and Non-parametric test
ο‚— ParametricTest:
i. Model specifies certain conditions about the parameters of the
populations from which the samples are drawn
ii. Implies that there is normal distribution
iii. The data used in this test are continuous
ο‚— Non- ParametricTest:
i. does not depend on the particular form of the basic frequency
function from which the samples are drawn
ii. do not implies that there is a normal distribution
iii. The data used in this test are frequency or ranks
iv. measuremental data: data obtained by actual measurement.
Parametric test Non-parametric test
1. Parametric tests specify certain
conditions about the parameters of
the population from which the
sample has been drawn
1. Non-parametric test do not specify
any condition about the population
parameter of the population from which
the sample has been drawn.
2. It is used in testing of hypothesis
and estimation of parameters.
2. It is used in testing of hypothesis and
but not estimation of parameters.
3. Parametric test are mostly applied
only to the data which are measured
in interval and ratio scale.
3. Non-parametric test are applied only
to the data which are measured in
nominal and ordinal data.
4. Parametric test are most powerful
test.
4. Non-parametric test are less powerful
test than parametric test.
5. It requires complicated sampling
technique.
5. It does not require complicated
sampling technique.
Criteria of Good Hypothesis Statement
β€’ Should be stated in declarative form
β€’ Should articulate (describe) a relationship between two or
more variables
β€’ Should be testable empirically
β€’ Useless if it cannot be tested empirically
β€’ Should be limited in scope
β€’ Should be clearly and precisely stated
β€’ Should no ambiguity (vegness) in the variables
β€’ Should state the conditions to apply.
β€’ Context and study units must be clear
β€’ Should reflect a guess at a solution
β€’ Should outcome to a problem based upon some knowledge,
previous research, or identified needs
β€’ Should be consistent with most known facts
Actual
situation
Decision from sample
Accept Hβ‚€ Reject Hβ‚€
Hβ‚€ is true Correct decision
No error
Probability = 1 – Ξ±
Wrong decision
Type I error
Probability = Ξ±
(Not so harmful and can
be corrected)
Hβ‚€ is false
(H₁ true)
Wrong decision
Type II error
Probability = Ξ²
(Degrees of impact of
Type II error is very
harmful and can’t be
corrected)
Correct decision
No error
Probability = 1 – Ξ²
1 – Ξ² = power of test
Types of Errors inTesting of Hypothesis
➒ If a medicine is administered to a few patients of a particular
disease to cure them and the medicine is curing the disease, but it
is claimed that it has no effect and hence it is discontinued. This
Type I error.
➒ If a medicine is administered to a few patients of a particular
disease to cure them and the medicine is not curing the disease,
but it is claimed that it have good effect and the treatment is
continued. This type II error.
Type I error : Rejecting the good lot of products
(Producer’s risk)
Type II error: Accepting the bad lot of products
(Consumer’s risk)
Steps in Hypothesis Testing
a. Critical Value approach:
Depends upon the size of
sample and nature of data
Use 𝛼 = 0.05 unless we are
given
Formulate the Null and Alternative
Hypothesis
Select the Right test statistic and
compute test statistic
Decide the level of significance
Determine Critical value
Decision: compare computed test
statistic with critical value
If Calculated Value ≀ Tabulated Value
Yes
No
It is not Significant Accept HO
It is significant Reject HO
b. p-value approach:
The p-value is the probability of getting the observed value of the
computed test statistic or a value with even greater evidence against
null hypothesis H0 if the null hypothesis is true.
In other words, the observed value of level of significance for the
testing of hypothesis assuming null hypothesis is true is called p-
value.
The p- value is the maximum rejection area under the normal
probability curve under the null hypothesis H0, when the some test
(or procedure) is applied.
➒ If p-value β‰₯ 𝛼 (level of significance) then accept null hypothesis
(H0).
➒ If p-value < 𝛼 (level of significance) then accept alternative
hypothesis (H1).
Mostly p-value can be obtained from different statistical software
based on the general output for the given problem.
Finding p-value:
(i) For Left Tailed Test:
p = P( Z < Zcal)
= 0.5 – P(0 < Z < Z cal)
(ii) For Right Tailed Test
p = P( Z >Zcal)
= 0.5 – P(0 < Z < Z cal)
(iii) For Two Tailed Test:
p = P( Z < Zcal) + P( Z >Zcal)
= 2 P(Z >Zcal)
= 2[0.5 – P(0 < Z < Z cal)] Z = 0
Zcal
Zcal
Zcal
Zcal
Z = 0
Z = 0
Two tailed test and Confidence interval
β€’ If the reference value specified in H0 lies outside the
interval (that is, is less than the lower bound or
greater than the upper bound), you can reject H0.
β€’ If the reference value specified in H0 lies within the
interval (that is, is not less than the lower bound or
greater than the upper bound), you fail to reject H0.
If hypothesis value 𝐻0: πœ‡ = πœ‡0lies within calculated
confidence interval estimate
[ ΰ΄€
𝑋 βˆ’ 𝑍 Ξ€
𝛼
2
𝑆. 𝐸. ( ΰ΄€
𝑋) < πœ‡0 < ΰ΄€
𝑋 βˆ’ 𝑍 Ξ€
𝛼
2
𝑆. 𝐸. ( ΰ΄€
𝑋)]
then null hypothesis is accepted otherwise rejected.
Z Test
26
27
β€’ Z-test is important parametric test and based on normal
probability distribution. When the samples are selected from
population of known parameter with sample size more than
30 , Z test is used. We consider that if sample size is more
than 30 then sample selected non-normal population is also
approximately normally distributed.
β€’ Z- test is defined as: 𝑍 =
𝑑 βˆ’ 𝐸(𝑑)
𝑆.𝐸.(𝑑)
β€’ 𝑇𝑒𝑠𝑑 π‘ π‘‘π‘Žπ‘‘π‘–π‘ π‘‘π‘–π‘ =
Statistic – Parameter
Standard error
Application of Z test:
(i) Test of significance of single mean
(ii) Test of significance of difference between two means
(iii) Test of significance of single proportion
(iv) Test of significance of difference between two proportions
Introduction
Critical value
Critical
values
Level of significance (Ξ±)
ZΞ± 1% 2% 5% 10%
Two tailed
test
ǀ𝑍𝛼ǀ = 2.58 ǀ𝑍𝛼ǀ = 2.33 ǀ𝑍𝛼ǀ = 1.96 ǀ𝑍𝛼ǀ = 1.645
Right tailed
test
ǀ𝑍𝛼ǀ = 2.33 ǀ𝑍𝛼ǀ = 2.05 ǀ𝑍𝛼ǀ = 1.645 ǀ𝑍𝛼ǀ = 1.28
Left tailed
test
ǀ𝑍𝛼ǀ
= βˆ’2.33
ǀ𝑍𝛼ǀ
= βˆ’2.05
ǀ𝑍𝛼ǀ =
βˆ’1.645
ǀ𝑍𝛼ǀ
= βˆ’1.28
28
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0: 𝝁 = 𝝁𝟎 i. e. there is no significant difference between the sample mean (𝑋) and the
population mean
H1: 𝝁 β‰  𝝁𝟎 (two tailed test), or H1 : 𝝁 > 𝝁𝟎 (right tailed test),or H1 : 𝝁 < 𝝁𝟎 (left tailed
test)
If sample mean( ΰ΄€
𝑋) > Population mean (πœ‡ ) : Right Tailed Test
If sample mean( ΰ΄€
𝑋) < Population mean (πœ‡ ) : Left Tailed Test
Step 2: Select test statistic: 𝒁 =
ΰ΄₯
𝑿 βˆ’ 𝝁
ΰ΅—
𝝈
𝒏
If 𝜎 is unknown for large samples ො
𝜎 = s
Step 3: Fix the level of significance (𝛼)
Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table.
Step 5: Make decision:
Critical value approach:
If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted.
If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted.
P-value approach:
If p-value β‰₯ 𝛼 (level of significance) then accept null hypothesis (H0).
If p-value < 𝛼 (level of significance) then accept alternative hypothesis (H1). 29
Case I: Test of Significance of Single Mean
Example : A samples of 50 pieces of certain type of string was tested. The mean
breaking strength turned out to be 14.5 kgs. Test whether the sample is from a
batch of strings having a mean breaking strength of 15.6 kgs and standard
deviation of 2.2 kgs.
Here, Number of sample(n) = 50 Sample mean ( ΰ΄€
𝑋) = 14.5 kgs.
Population mean () = 15.6 kg Population standard deviation (𝜎) = 2.2 kgs
Formulation of Hypothesis
β€’ Null Hypothesis: Ho :  = 15.6 kgs i.e. the Population mean breaking strength of
the strings is 15.6 kg.
β€’ Alternative Hypothesis: H1 :  β‰ 15.6 kgs i.e. the mean breaking strength is not
equal to 15.6 kg. (two tailed test)
Test Statistics: Z =
ΰ΄₯
𝑿 βˆ’ 𝝁
ΰ΅—
𝝈
𝒏
=
πŸπŸ’.πŸ“βˆ’ πŸπŸ“.πŸ”
ΰ΅—
𝟐.𝟐
πŸ“πŸŽ
= = - 3.55
| zcal | = | - 3.55 | = 3.55
Level of significance (a) = 5%
Critical value: Tabulated value of z at 5% level of significance for two tailed test Ztab
= 1.96
Decision: Since Zcal ο€Ύ Ztab , Ho is rejected i.e. H1 is accepted.
Conclusion: Hence we conclude that the sample has not been drawn from the
batch with mean breaking strength of 15.6 kgs and standard deviation 2.2 kgs.
30
Example : In the past a blending process has produced an average of 5 Kg.
of waste material for every batch with standard deviation 5 Kg. From a
sample of 100 batches an average of 7 Kg. of waste per batch is obtained. At
5% level of significance is it reasonable to believe that the average has
increased?
Solution: with usual notations, n = 100, ΰ΄€
𝑋= 7 kg,  = 5 kg,  = 5 kg
Formulation of Hypothesis
β€’ H0:  = 5 kg, i.e. the average of waste material has not been increased
β€’ H1:  ο€Ύ 5 kg, i.e. the average has increased (Right tailed test)
Test Statistics: Z =
ΰ΄₯
𝑿 βˆ’ 𝝁
ΰ΅—
𝝈
𝒏
=
πŸ•βˆ’ πŸ“
ΰ΅—
πŸ“
𝟏𝟎𝟎
= 4 zcal = 4
Level of significance (a) = 5%
Critical value: Tabulated value of z at 5% level of significance for right tailed
test Ztab = 1.645
Decision: Since Zcal ο€Ύ Ztab , Ho is rejected i.e. H1 is accepted. Hence we conclude
that the average waste material has been increased.
31
Example : An insurance agent has claimed that the average age of the policy holders
who have insured through him is less than the average age for all the agents, which
is 30.55 years. A random sample of 100 policy holders who had insured through him
gave the following age distribution.
Calculate the arithmetic mean and standard deviation of this distribution and use
these values to test his claim at 5% level of significance.
Solution:
32
Class limits 16-20 21-25 26-30 31-35 36-40
Frequency 12 22 20 30 16
Age f x 𝒅′
=
π’™βˆ’πŸπŸ–
πŸ“
f𝒅′
fπ’…β€²πŸ
16-20 12 18 -2 -24 48
21-25 22 23 -1 -22 22
26-30 20 28 0 0 0
31-35 30 33 1 30 30
36-40 16 38 2 32 64
Ξ£fd’= 16 Ξ£fd’2=164
xΜ„ = A +
Ξ£fd’
n
Γ— h = 28+
16
100
Γ—5 = 28 + 0.08 = 28.8 years
𝑠 =
Οƒ 𝑓d’2
𝑛
βˆ’ (
Ξ£fd’
n
)2 Γ— h =
164
100
βˆ’ (
16
100
)2 Γ— 5 = 6.352 years
Setting of Hypothesis
H0: 𝝁 = 30.5 years i.e. the average age of the policy holders for all agents is 30.5
years.
H1: 𝝁 < 30.5 years (left tailed test) i.e. the average age of the policy holders
who have insured through him is less than the average age for all the agents.
Test Statistics: Under H0, the test statistic is Z =
ΰ΄₯
𝑿 βˆ’ 𝝁
ΰ΅—
𝝈
𝒏
=
ΰ΄₯
𝑿 βˆ’ 𝝁
ΰ΅—
𝒔
𝒏
=
πŸπŸ–.πŸ– βˆ’ πŸ‘πŸŽ.πŸ“
ΰ΅—
πŸ”.πŸ‘πŸ“πŸ‘
𝟏𝟎𝟎
= -2.68
| zcal | = | - 2.68 | = 2.68
Level of significance (a) = 5%
Critical value: Tabulated value of z at 5% level of significance for left tailed test
Ztab = -1.645 i.e. | ztab | = 1.645
Decision: Since | zcal |> | ztab | , So Ho is rejected i.e. H1 is accepted. Hence we
conclude that the average age of the policy holders who have insured through
him is less than the average age of the policy holders of all the agents.
33
Case II: Test of Significance of Difference between Two Means
34
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0:𝛍₁ = 𝛍₂ i. e. Two samples have been drawn from the sample parent population
H1:𝛍₁ β‰  𝛍₂(two tailed test), H1:μ₁ > ΞΌβ‚‚(right tailed test), H1:μ₁ < ΞΌβ‚‚(left tailed test)
If First sample mean(ΰ΄₯
X1) > Second sample mean(ΰ΄₯
X2) : Right Tailed Test
If First sample mean(ΰ΄₯
X1) < Second sample mean(ΰ΄₯
X2) : Left Tailed Test
Step 2: Select test statistic: Z =
ΰ΄₯
X1βˆ’ ΰ΄₯
X2
Οƒ1
2
n1
+
Οƒ2
2
n2
π‘œπ‘Ÿ Z =
(ΰ΄₯
X1βˆ’ΰ΄₯
X2)
s1
2
n1
+
s2
2
n2
π‘œπ‘ŸZ =
(ΰ΄₯
X1βˆ’ ΰ΄₯
X2)
𝜎2 1
n1
+
1
n2
Step 3: Fix the level of significance (𝜢)
Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table.
Step 5: Make decision:
Critical value approach:
If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is
accepted.
If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is
accepted.
Step 6: Write Conclusion:
Example: An examination was given to 50 students at college A and 60
students at college B. At college A, the average marks of the students was
found to be 75 with a standard deviation of 9. At college B, the average
marks of the students was 79 with a standard deviation of 7. Is there a
significant difference between the average marks of the students at
these two colleges?
Solution: with usual notations,
College A College B
n1 = 50 n2 = 60
ΰ΄₯
X1= 75 ΰ΄₯
X2 =79
s1 = 9 s2 = 7
Setting of Hypothesis:
H0 : 𝛍₁ = 𝛍₂ i. e. there is no significant difference between the average
marks of the students at these two colleges
H1: 𝛍₁ β‰  𝛍₂ (Two tailed test) There is significant difference between the
average marks of the students at these two colleges
Test Statistic: Under H0 the test statistic is
Z =
(ΰ΄₯
X1βˆ’ΰ΄₯
X2)
s1
2
n1
+
s2
2
n2
=
(75βˆ’79)
92
50
+
72
60
= -2.56
Η€ZΗ€cal= 2.56
Level of significance(𝜢) = 5%
Critical value: Ztab= 1.96
Decision: Since Zcal > Ztab so H0 is rejected and H1 is accepted.
Conclusion: Hence we conclude that There is significant
difference between the average marks of the students at these
two colleges
36
Example: In a random sample of 500 the mean is found to be 20. In another
independent sample of 400 the mean is 15. Could the samples have been drawn from
the same population with standard deviation 4 ?
Solution: With usual notations,
Common Population S. D. (𝜎) = 4
H0:𝛍₁ = 𝛍₂ (Both samples have been
Drawn from the same population)
H1:𝛍₁ β‰  𝛍₂ (Two tailed test)
Test statistic: Z =
ΰ΄₯
X1βˆ’ ΰ΄₯
X2
𝜎2 1
n1
+
1
n2
contd…
Example : If 60 MA students are found to have a mean height of 63.60 inches and 50
MBS students have mean height of 69.51 inches, would you conclude that the
MBS students are taller than MA students? Assume the standard deviation of height
of post-graduate students to be 2.48 inches.
For MA: n1 = 60 , ΰ΄₯
X1= 63.60 For MBS n2 = 50 , ΰ΄₯
X2 = 69.51 common population s.d.
(𝜎) = 2.48
H0:𝛍₁ = 𝛍₂ (There is no difference between height of MA and MBS students)
H1:𝛍₁ < 𝛍₂ (MBS students are taller than MA students) Left Tailed Test
Test statistic: Z =
ΰ΄₯
X1βˆ’ ΰ΄₯
X2
𝜎2 1
n1
+
1
n2
contd…
37
First Sample Second Sample
n1 = 500 n2 = 400
ΰ΄₯
X1= 20 ΰ΄₯
X2 = 15
Example: Two random samples of Nepalese people taken from rural and urban region
gave the following data of income.
Test whether the average
monthly income of urban
region people is significantly
more than rural region
people.
Solution:
With usual notations,
Test Statistic: Under H0 the test statistic is
𝐙 =
(ΰ΄₯
π‘ΏπŸβˆ’ΰ΄₯
π‘ΏπŸ)
π’”πŸ
𝟐
π’πŸ
+
π’”πŸ
𝟐
π’πŸ
contd…..
38
Sample size Average
monthly
income
Standard
deviation
From Urban Region 100 1250 30
From Rural Region 150 800 50
Urban Region Rural Region
n1 = 100 n2 = 150
ΰ΄₯
X1= 1250 ΰ΄₯
X2 = 800
s1 = 30 s2 = 50
H0:𝛍₁ = 𝛍₂ (There is no significant difference
between the average income of the people of
urban and rural region)
H1:𝛍₁ > 𝛍₂ (The average monthly income of urban
region people is more than rural region people)
Right Tailed Test
Case III: Test of significance of Single Proportion
39
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0:P = 𝑃0i. e. the sample has been drawn from a population with proportion H1:P β‰ 
𝑃0(two tailed test), H1:P > 𝑃0(right tailed test), H1:𝑃 < 𝑃0(left tailed test)
If sample proportion(p) > Population Proportion(P) : Right Tailed Test
If Sample Proportion(p) < Population proportion(P) : Left tailed Test
Step 2: Select test statistic:𝑍 =
π·π‘–π‘“π‘“π‘’π‘Ÿπ‘’π‘›π‘π‘’
𝑆.𝐸.
=
𝑝 βˆ’ 𝑃
𝑃𝑄
𝑛
[Note: if we have sampling from a
finite population of size N, then 𝑆. 𝐸. 𝑝 =
π‘βˆ’π‘›
π‘βˆ’1
𝑃(1βˆ’π‘ƒ)
𝑛
=
π‘βˆ’π‘›
π‘βˆ’1
𝑃𝑄
𝑛
]
Step 3: Fix the level of significance (𝛼)
Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table.
Step 5: Make decision:
Critical value approach:
If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant).So Hβ‚€ is
accepted.
If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is
accepted.
Step 6: Write Conclusion:
Example 1 : An airline must allocate available seating space between first
class passengers and economy class passengers. The null hypothesis is that
20% of the passengers fly first class but management recognizes the
possibility that the percentage could be more or less. A random sample of
400 passengers includes 70 passengers holding first class tickets. Can the
null hypothesis be rejected at the 10% level of significance?
Solution:
With usual notations, P = 0.20 , n = 400 , Q = 1- P = 1 – 0.20 = 0.80
sample proportion(p) =
π‘₯
𝑛
=
70
400
= 0.175
Setting of Hypothesis:
H0: P = 0.20 (20% of the passengers fly first class)
H1: P β‰  0.20 ( more or less than 20% passengers fly first class)
Test Statistics: Under H0 ,the test statistic is Z =
𝑝 βˆ’ 𝑃
𝑃𝑄
𝑛
=
0.175βˆ’0.20
0.20 Γ—0.80
400
= - 1.25
Zcal =1.25
Level of significance (𝜢) = 10%
Critical value: Ztab= 1.645
Decision : Since Zcal < Ztab so H0 is accepted
Conclusion: Hence we conclude that 20% of the passengers fly first class.
40
Example 2: A manufacturer claimed that at least 90% of the machine parts that is
supplied by that factory conformed to specifications. An examination of 200 such
parts revealed that 160 parts were not faulty. Determine if the manufactuer’s
claim is legitimate at 1% level of significance.
Solution: With usual notations P = 0.90 , Q = 1- P = 1- 0.90 = 0.10 n = 200 ,
x = 160, p =
π‘₯
𝑛
=
160
200
= 0.80
Setting of Hypothesis:
H0: P β‰₯0.90 (At least 90% of the machine parts is conformed to specifications)
H1:P<0.90 [Left tailed Test](Less than 90% of the machine parts is conformed to
specifications)
Test Statistics: Zcal =
𝑝 βˆ’ 𝑃
𝑃𝑄
𝑛
=
0.80βˆ’0.90
0.90 Γ—0.10
200
= -4.71 Η€ZcalΗ€ = 4.71
Level of Significance(𝜢): 1%
Critical Value: Ztab = -2.33 Η€ZtabΗ€ = 2.33
Decision: Since | zcal |> | ztab | So H0 is rejected and H1 is accepted.
Conclusion:
41
Example 3: A cellphone provider has the business objective of wanting to
determine the proportion of subscribers who would upgrade to a new
cellphone with improved features if it were made available at a substantially
reduced cost. Data are collected from a random sample of 500 subscribers.
The results indicate that 135 of the subscribers would upgrade to a new
cellphone at a reduced cost. At the 0.05 level of significance, is there evidence
that more than 20% of the customers would upgrade to a new cellphone at a
reduced cost?
Solution:
With usual notations, n = 500 , x = 135 , p =
π‘₯
𝑛
=
135
500
= 0.27, P = 0.20 , Q = 0.80
Setting of Hypothesis:
H0: P =0.20 (20% of the customers would upgrade to a new cellphone at a
reduced cost
H1: P>0.20 [Right tailed Test](more than 20% of the customers would upgrade
to a new cellphone at a reduced cost)
Test Statistics: Zcal =
𝑝 βˆ’ 𝑃
𝑃𝑄
𝑛
=
0.27βˆ’0.20
0.20 Γ—0.80
500
= 3.91
Level of Significance(𝜢): 0.05
Critical value: Ztab = 1.645 Decision: Since Zcal > Ztab so H0 is rejected
42
43
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0:π‘·πŸ = π‘·πŸi. e. the sample has been drawn from a population with proportion P.
H1: π‘·πŸ β‰  π‘·πŸ(two tailed test), H1 :π‘·πŸ > π‘·πŸ(right tailed test), H1 :π‘·πŸ < π‘·πŸ(left tailed test)
If First sample proportion (p1) > Second sample proportion (p2) : Right Tailed
If First sample proportion (p1) < Second sample proportion (p2): Left Tailed
Step 2: Select test statistic: 𝑍 =
𝑝1 βˆ’ 𝑝2
𝑃1𝑄1
𝑛1
+
𝑃2𝑄2
𝑛2
π‘œπ‘Ÿ 𝑍 =
(𝑝1βˆ’ 𝑝2)
ΰ· 
𝑃 ΰ· 
𝑄
1
𝑛1
+
1
𝑛2
Note: If P1 and P2 are not known, each of them is estimated by ΰ·‘
P =
π‘₯1+π‘₯2
𝑛1+𝑛2
=
n1p1+n2p2
n1+ n2
ΰ·‘
Q = 1 - ΰ·‘
P
Step 3: Fix the level of significance (𝜢)
Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table.
Step 5: Make decision:
If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted.
If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted.
Step 6: Write Conclusion:
Case IV: Test of Significance for Difference of Two Proportions
Example: An online survey asked 1,000 adults β€œWhat do you buy from your
mobile device?”. Both sample sizes were 500 and that 195 out of 500 males and
305 out of 500 females reported they buy clothing from their mobile device. Is
there evidence of a difference between males and females in the proportion who
said they buy clothing from their mobile device at the 0.01 level of significance?
Solution: With usual notations, For Male: 𝑛1= 500 , 𝑝1=
195
500
=0.39
For Female: 𝑛2= 500 , 𝑝2=
305
500
=0.61
Setting of Hypothesis:
H0: 𝑃1 = 𝑃2 i. e. there is no significant difference between males and females in the
proportion of clothes buying from mobile device.
H1: 𝑃1 β‰  𝑃2 (Two tailed test).
Test Statistics: Zcal =
p1βˆ’ p2
ΰ·‘
Pΰ·‘
Q
1
n1
+
1
n2
ΰ·‘
P =
n1p1+n2p2
n1+n2
=
500Γ—0.39+500Γ—0.61
500+500
= 0.5
ΰ·‘
Q = 1 βˆ’ 0.5 = 0.5
Hence Zcal = -6.957
Level of significance(𝜢) = 0.01 (1%)
Critical value: Ztab= 2.58 Decision:
44
45
Example: At a certain date in a large city 400 out of a random sample of 500
men were found to be smokers. After the tax on tobacco had been heavily
increased another random sample of 600 men in the same city included 400
smokers. Was the observed decrease in the proportion of the smokers
significant?
Solution:
With usual notations,
Setting of Hypothesis:
β€’ Null Hypothesis (H0): 𝑃1 = 𝑃2 i. e. there is no significant difference
between the proportion of the smokers before and after the increase in
the tax.
β€’ Alternative Hypothesis (H1): 𝑃1 > 𝑃2 (right tailed test). There is a
significant decrease in the proportion of the smokers after the increase in
the tax.
Before Tax increased After Tax increased
𝑛1= 500 𝑛2= 600
𝑝1=
π‘₯1
𝑛1
=
400
500
= 0.8 𝑝2=
π‘₯2
𝑛2
=
400
600
= 0.667
Test statistic: Zcal =
p1βˆ’ p2
ΰ·‘
Pΰ·‘
Q
1
n1
+
1
n2
ΰ·‘
P =
n1p1 + n2p2
n1 + n2
=
500 x 0.8 + 600 x 0.667
500 + 600
=
400 + 400
1100
=
800
1100
=
8
11
& ΰ·‘
Q = 1 βˆ’
8
11
=
3
11
Zcal =
p1βˆ’ p2
ΰ·‘
Pΰ·‘
Q
1
n1
+
1
n2
=
0.8 βˆ’ 0.667
8
11
π‘₯
3
11
1
500
+
1
600
=
0.133
0.027
= 4.926
Level of significance (Ξ±) = 0.05,
Critical value: then tabulated value of Z(Ξ± = 0.05) for right tailed Ztab = 1.645
Decision: Since Zcal = 4.926 > Ztab = 1.645 then, H0 is rejected i. e. Calculated
value of Z is greater than tabulated value of Z at 0.05 level of significance.
Then H0 is rejected.
Conclusion: Then there is a significant decrease in the proportion of the
smokers after the increase in the tax.
46
Example: A firm found with the help of a sample survey of a city (size of
sample 900) that
πŸ‘
πŸ’
𝒕𝒉
of them consumes things produced by them. The firm
then advertised the goods in paper and on radio. After one year a sample of
size 1000 reveals that proportion of consumers of the goods produced by
the firm is
πŸ’
πŸ“
𝒕𝒉
. Is this rise significant to indicate that the advertisement was
effective?
Solution:
With usual notations
Setting of Hypothesis:
H0: 𝑃1 = 𝑃2 The advertisement was not effective
H1: 𝑃1 < 𝑃2 (Left tailed test). The advertisement was effective.
Test Statistics: Zcal =
p1βˆ’ p2
ΰ·‘
Pΰ·‘
Q
1
n1
+
1
n2
ΰ·‘
P =
𝑛1𝑝1+𝑛2𝑝2
𝑛1+𝑛2
=
900Γ—
3
4
+(1000Γ—
4
5
)
900+1000
= 0.77
ΰ·‘
Q = 1 - ΰ·‘
P = 1 – 0.77 = 0.23
Hence Zcal = -2.59 |π‘π‘‘π‘Žπ‘| = 2.59
Level of Significance (𝜢) = 5%
Critical Value: Ztab = -1.645, |π‘π‘‘π‘Žπ‘| = 1.645
Decision: H0 is rejected and H1 is accepted. 47
Before Ad After Ad
𝑛1= 900 𝑛2= 1000
𝑝1=
3
4
𝑝2=
4
5
t-test
Student’s β€˜t’: Definition
β€’ If x1, x2,….,xn is a random sample of size β€˜n’ from a
normal population with mean ΞΌ and population
variance Οƒ2, then Student’s β€˜t’ statistic is defined as:
β€’ 𝐭 =
ΰ΄₯
𝐗 βˆ’ 𝛍
ΰ΅—
𝐒
𝐧
=
(ΰ΄₯
𝐗 βˆ’ 𝛍)
ΰ΅—
π’πŸ
𝐧
=
𝑛(ΰ΄₯
X βˆ’ ΞΌ)
S
~𝑑(π‘›βˆ’1) … … (𝟏)
β€’ Where: ΰ΄€
x =
Οƒ X
n
, is the sample mean and S2 =
1
π‘›βˆ’1
Οƒ(π‘₯ βˆ’ π‘₯)2 = unbiased estimate of the population
variance Οƒ2
.
β€’ Note: β€˜t’ defined in the (1) follows Student’s β€˜t’-
distribution with (n – 1) degrees of freedom
β€’ Degrees of freedom is defined as number of
independent observations.
Assumptions
1. The parent population from which the sample is
drawn is normal.
2. Sample size is generally less than or equals to 30.
(which is considered as general traditional
convention)
3. All the observations in the samples are independent.
4. Samples are drawn randomly from normal
populations.
5. Population standard deviation (Οƒ) is unknown.
6. The sample values are correctly measured and
recorded.
7. The hypothetical value of population mean (πœ‡0) of πœ‡
is correct values of the population mean.
β€˜t’ distribution
t = 0
-1
-3 -2
f(t)
Normal curve
t- distribution for n = 25
+2 +3
+1
Fig. 1
t- distribution for n = 5
Applications of t-test
(i) Test of significance of a single mean
(ii) Test of significance of the difference
between two independent means
(iii)Paired t test for difference of two means
(iv)Test of significance of an observed sample
correlation coefficient and regression
coefficients
Case I: Test of significance of a single mean
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0: πœ‡ = πœ‡0
H1: πœ‡ β‰  πœ‡0 (Two tailed test), : πœ‡ > πœ‡0 (Right tailed test), : πœ‡ < πœ‡0 (Left tailed test)
If sample mean ( ΰ΄€
𝑋) > Population mean (πœ‡) : Right Tailed Test
If sample mean ( ΰ΄€
𝑋) < Population mean (πœ‡) : Left Tailed Test
Step 2: Select test statistic:
t =
ΰ΄₯
X βˆ’ ΞΌ
ΰ΅—
S
n
=
n (ΰ΄₯
X βˆ’ ΞΌ)
S
[π‘“π‘œπ‘Ÿ π‘’π‘›π‘π‘–π‘Žπ‘ π‘’π‘‘( π΄π‘π‘‘π‘’π‘Žπ‘™ π‘‘π‘Žπ‘‘π‘Ž 𝑖𝑠 𝑔𝑖𝑣𝑒𝑛)]
t=
ΰ΄₯
X βˆ’ ΞΌ
ΰ΅—
s
nβˆ’1
=
(nβˆ’1) (ΰ΄₯
X βˆ’ ΞΌ)
s
[π‘“π‘œπ‘Ÿ π‘π‘–π‘Žπ‘ π‘’π‘‘ (π‘ π‘Žπ‘šπ‘π‘™π‘’ 𝑠. 𝑑. 𝑖𝑠 𝑔𝑖𝑣𝑒𝑛)]
Step 3: Fix the level of significance (𝜢)
Step 4: degrees of freedom(d.f.) = (n – 1)
Step 5: Write the critical value (t ) for 𝛼 % level of significance from the t- table.
Step 6: Make decision: Critical value approach:
If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) : it is not significant (insignificant). So Hβ‚€ is accepted.
If Η€tΗ€(calculated) > tΞ±(table or critical) : it is significant. So Hβ‚€ is rejected, then H₁ is accepted.
Step 7: Write Conclusion:
Computation of S2 for Numerical Problems:
β€’ 𝑆2 =
1
(π‘›βˆ’1)
Οƒ(π‘₯ βˆ’ π‘₯ )2 where Οƒ(π‘₯ βˆ’ π‘₯ )2= sum of the squares of
deviation taken from mean
β€’ S2 =
1
(nβˆ’ 1)
Οƒ π‘₯2 βˆ’
Οƒ π‘₯ 2
𝑛
β€’ 𝑆2
=
1
(π‘›βˆ’1)
Οƒ 𝑑2
βˆ’
Οƒ 𝑑 2
𝑛
where π‘₯ = 𝐴 +
Οƒ 𝑑
𝑛
How can we see critical value:
d.f.
Level of significance for one tailed test
0.10 0.05 0.025 0.01 0.005 0.0005
Level of significance for two tailed test
0.20 0.10 0.05 0.02 0.01 0.001
1 3.078 6.314 12.706 31.821 63.657 636.619
24 2.064
Confidence intervals using β€˜t’ or Confidence interval for
Small Samples
β€’ Confidence intervals = estimator Β± (reliability
constant) x (standard error of the estimate).
β€’ Reliability coefficient is obtained from the table of t-
distribution.
β€’ C. I. (ΞΌ) = ΰ΄€
𝐱 Β± 𝐭𝜢,π’βˆ’πŸ.
𝐒
𝐧
. [ For unbiased estimator,
When actual data is given]
β€’ C. I. (ΞΌ) = ΰ΄€
𝐱 Β± 𝐭𝜢,π’βˆ’πŸ.
𝐬
π§βˆ’πŸ
. [ For biased estimator
when sample s.d. or variance is given]
Example: The average cost of a hotel room in New York is said to be $168 per
night. To determine if this is true, a random sample of 25 hotels is taken and
resulted in average of $172.50 with standard deviation of $15.40. Test the
appropriate hypothesis at 0.05 level of significance.
Solution: With usual notations: πœ‡ =$ 168 , n = 25 , ΰ΄€
𝑋= $ 172.50 , s = $ 15.40
Formulation of Hypothesis:
H0: ΞΌ = $ 168 . The average cost of a hotel room in New York is $168 per night.
H1: ΞΌ ο‚Ή 168 (Two tailed test). The average cost of a hotel room in New York is not
equals to $168 per night.
Test statistic: Under H0 ,the test statistics is 𝑑 =
ΰ΄€
𝑋 βˆ’ πœ‡
ΰ΅—
𝑆
π‘›βˆ’1
=
172.50 βˆ’ 168
ΰ΅—
15.40
25βˆ’1
= 1.43 tcal =1.43
Level of significance(𝜢) = 0.05
Degree of freedom(d.f.) = n-1 = 25-1 = 24
Critical value: The tabulated value of t for two tailed test at 0.05 level of significance
with 24 d.f. is 2.064 i.e. ttab = 2.064
Decision: Since tcal < ttab so H0 is accepted.
Conclusion: Hence we conclude that the average cost of a hotel room in New York is
$168 per night
Contd..
95% confidence interval for population mean:
n= 25, ΰ΄€
𝑋 = $ 172.50 , s = $ 15.40
1- 𝛼 = 0.95 , 𝛼 = 0.05 d.f. = n-1 = 25-1 = 24
𝑑𝛼,π‘›βˆ’1= 𝑑0.05,24 = 2.064
95% CI for Population mean( πœ‡) is
= ΰ΄€
𝑋 Β± 𝑑𝛼,π‘›βˆ’1 .
𝑠
π‘›βˆ’1
= 172.50 Β± 2.064 .
15.40
25βˆ’1
= 172.50 Β± 6.48
= (172.50 – 6.48 , 172.50 + 6.48)
= (166.01 , 178.98)
Example : The following are weight (gm.) values for
sample of 21 chocolates. Can we conclude from these
data that the mean of the population from which the
sample was drawn is greater than 14.0? Use Ξ± = 0.05.
14.5 12.9 14.0 16.1 12.0 17.5 14.1
12.9 17.9 12.0 16.4 24.2 12.2 14.4
17.0 10.0 18.5 20.8 16.2 14.9 19.6
Solution:
Calculation of S2:
X X2
14.5 210.25
10.9 166.41
14.0 196.00
16.1 259.21
12.0 144.00
17.5 306.25
14.1 198.81
12.9 166.41
17.9 320.41
12.0 144.00
16.4 268.96
24.2 585.64
12.2 148.84
14.4 207.36
17.0 289.00
10.0 100.00
18.5 342.25
20.8 432.64
16.2 262.44
14.9 222.01
19.6 384.16
Ξ£X = 328.1 Ξ£X2 = 5355.05
β€’ Now 𝑆2 =
1
(π‘›βˆ’1)
Οƒ 𝑋2 βˆ’
(Οƒ 𝑋)
2
𝑛
=
1
21βˆ’1
5355.05 βˆ’
328.1 2
21
= 11.44
β€’ ∴ 𝑠 = 11.44 = 3.38
β€’ Sample size (n) = 21, sample mean (π‘₯) =
Οƒ 𝑋
𝑛
=328.1/21 = 15.62, sample
standard deviation (s) = 3.38,, population mean (ΞΌ) = 14,
β€’ Hypothesis setting:
β€’ Null Hypothesis: H0:ΞΌ = 14 gm i. e.
β€’ Alternative hypothesis: H1: ΞΌ > 14 gm(Right tailed test) i. e.
β€’ Test statistic: 𝑑 =
ΰ΄₯
X βˆ’ ΞΌ
ΰ΅—
S
n
=
15.62βˆ’14
ΰ΅—
3.38
21
=
1.62
Ξ€
3.38
4.58
=
1.62
0.73
= 2.21
β€’ Level of significance(𝜢 )= 0.05
β€’ Degree of freedom(d.f.) = n -1 = 21 – 1 = 20
β€’ Critical value: 𝑑𝛼=0.05 π‘“π‘œπ‘Ÿ π‘œπ‘›π‘’ π‘‘π‘Žπ‘–π‘™π‘’π‘‘ ,20 𝑑𝑓 = 1.725
β€’ Decision: since π‘‘π‘π‘Žπ‘™ = 2.21 > 𝑑𝛼 = 1.725 then the test is significant so
we reject hull hypothesis and then accept alternative hypothesis (H1).
β€’ Conclusion:
Case II: Test of significance of difference between two independent
means (Pooled t- test) independent t -test
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0: 𝝁𝟏 = 𝝁𝟐 i. e. there is no significant difference between the two population means
H1: 𝝁𝟏 β‰  𝝁𝟐 (two tailed test), H1: 𝝁𝟏 > 𝝁𝟐 (right tailed test), H1:𝝁𝟏 < 𝝁𝟐(left tailed test)
If First sample mean(ΰ΄₯
X1) > Second sample mean(ΰ΄₯
X2) : Right Tailed Test
If First sample mean(ΰ΄₯
X1) < Second sample mean(ΰ΄₯
X2) : Left Tailed Test
Step 2: Test Statistics , t =
(𝑋1 βˆ’ 𝑋2 )
𝑆2 1
𝑛1
+
1
𝑛2
=
(𝑋1 βˆ’ 𝑋2 )
𝑆
1
𝑛1
+
1
𝑛2
~𝑑(𝑛1+𝑛2βˆ’2)
Step 3: Fix the level of significance (𝜢)
Step 4: Degrees of freedom(d.f.) = (𝑛1 + 𝑛2 βˆ’ 2)
Step 5: Write the critical value (t ) for 𝛼 % level of significance at (n1 + n2 – 2) d. f. from the t- table.
Step 6: Make decision:
Critical value approach:
If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted.
If Η€tΗ€(calculated) > tΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is
accepted.
Step 7: Conclusion:
Calculation of S2
β€’ Actual mean method: 𝑆2
=
1
(𝑛1+ 𝑛2βˆ’2)
[Οƒ(𝑋1 βˆ’ 𝑋1)2
+ Οƒ(𝑋2 βˆ’ 𝑋2)2
]
β€’ Direct method: 𝑆2 =
1
(𝑛1+ 𝑛2βˆ’2)
ቂ
ቃ
Οƒ 𝑋1
2
βˆ’
Οƒ 𝑋1
2
𝑛1
+ Οƒ 𝑋2
2
βˆ’
Οƒ 𝑋2
2
𝑛2
β€’ Short cut method: 𝑆2
=
1
(𝑛1+ 𝑛2βˆ’2)
ቂ
ቃ
Οƒ 𝑑1
2
βˆ’
Οƒ 𝑑1
2
𝑛1
+
Οƒ 𝑑2
2
βˆ’
Οƒ 𝑑2
2
𝑛2
β€’ Where d1 = X – A, d2 = X – B, A = assumed mean of series X,
B = assumed mean of series Y.
β€’ π‘ΊπŸ
=
π’πŸ.π’”πŸ
𝟐+ π’πŸ.π’”πŸ
𝟐
(π’πŸ+ π’πŸβˆ’πŸ)
[ when the sample variance (or sample
standard deviation) i. e. biased estimates are given]
Example: Two different types of drugs D1 and D2 were
administrated on certain patients for increasing weight at
interval of one week time period. From the following
observation, can you conclude that Test at 5% level of
significance.
i. Whether two drugs differ significantly?
ii. The second drug is more effective in increasing
weight.
D1 8 12 13 9 3 8 10 9
D2 10 8 12 15 6 11 12 12
Solution
Calculation of S2
X1 X2 X1
2 X2
2
8 10 64 100
12 8 144 64
13 12 169 144
9 15 81 225
3 6 9 36
8 11 64 121
10 12 100 144
9 12 81 144
Ξ£X1 = 72 Ξ£X2 = 86 Ξ£X1
2 = 712 Ξ£X2
2 = 978
β€’ Here n1 = n2 = 8 then π‘₯1= 72/8 = 9, π‘₯2= 86/8= 10.75
β€’ 𝑆2 =
1
𝑛1+ 𝑛2βˆ’2
Οƒ 𝑋1
2
βˆ’
(Οƒ 𝑋1)2
𝑛1
+ Οƒ 𝑋2
2
βˆ’
(Οƒ 𝑋2)2
𝑛2
β€’ =
1
8+ 8βˆ’2
712 βˆ’
(712)2
8
+ 978 βˆ’
(86)2
8
= 8.4
Contd…
(i) Setting of Hypothesis:
H0: ΞΌ1 = ΞΌ2 (i. e. both drugs are equally effective)
H1: ΞΌ1 β‰  ΞΌ2 (i. e. both drugs are not equally effective)
Test statistic: 𝑑 =
ΰ΄€
𝑋1βˆ’ ΰ΄€
𝑋2
𝑆2 1
𝑛1
+
1
𝑛2
=
9 βˆ’ 10.75
8.4
1
8
+
1
8
= βˆ’1.208
then π‘‘π‘π‘Žπ‘™ = βˆ’1.208 = 1.208
Level of significance(𝜢) = 0.05
Degree of freedom(d.f.) = (𝑛1 + 𝑛2 βˆ’ 2) = 8+8- 2 = 14
Critical value:
Tabulated value of t for two tailed test at 0.05 level of
significance with 14 d.f. is 2.145 i.e. π‘‘π‘‘π‘Žπ‘= 2.145
Decision: π‘‘π‘π‘Žπ‘™ = 1.208 < π‘‘π‘‘π‘Žπ‘ = 2.145 so we accept H0 .
Conclusion: we can conclude that both drugs are equally
effective.
Contd…
(ii) Setting of Hypothesis:
H0: ΞΌ1 = ΞΌ2 (i. e. both drugs are equally effective)
H1: ΞΌ1 < ΞΌ2 (i. e. second drug is effective than first drug)
Test statistic: 𝑑 =
ΰ΄€
𝑋1βˆ’ ΰ΄€
𝑋2
𝑆𝑝
2 1
𝑛1
+
1
𝑛2
=
9 βˆ’ 10.75
8.4
1
8
+
1
8
= βˆ’1.208
then π‘‘π‘π‘Žπ‘™ = βˆ’1.208 = 1.208
Level of significance(𝜢) = 0.05
Critical value: Tabulated value of t for two tailed test at 0.05
level of significance with 14 d.f. is 2.145 i.e. π‘‘π‘‘π‘Žπ‘= 1.761
Decision: π‘‘π‘π‘Žπ‘™ = 1.208 < π‘‘π‘‘π‘Žπ‘ =1.761 so we accept H0 .
Conclusion: we can conclude that both drugs are equally
effective
Example: Two salesmen A and B are working in a certain district. From a sample
survey conducted by the head office, the following results were obtained. State
whether there is any significant difference in the average sales between the two
salesmen.
Solution: With the usual notations, we have,
For A For B
𝑛1 = 20 𝑛2 = 18
π‘₯1 = 170 π‘₯2= 205
𝑠1 = 20 𝑠2 = 25
Setting of Hypothesis:
H0: ΞΌ1 = ΞΌ2 , i.e. there is no significant difference between the average sales of
two salesmen
H1: ΞΌ1 β‰  ΞΌ2 i.e. there is a significant difference between the average sales of two
salesmen.
Salesmen A Salesmen B
No of sales: 20 18
Average sales: 170 205
Standard deviation 20 25
Test statistic: under H0the test statistic is 𝑑 =
ΰ΄€
𝑋1βˆ’ ΰ΄€
𝑋2
𝑆2 1
𝑛1
+
1
𝑛2
𝑆2 =
𝑛1.𝑠1
2+ 𝑛2.𝑠2
2
(𝑛1+ 𝑛2βˆ’2)
=
𝑛1.𝑠1
2+ 𝑛2.𝑠2
2
(𝑛1+ 𝑛2βˆ’2)
= 534.75
t=
170βˆ’205
534.72
1
20
+
1
18
= -4.56 Hence tcal = 4.56
Level of significance( ) = 0.05
Degree of freedom(d.f.) = n1 + n2 – 2 = 20 + 18 – 2 = 36
Critical Value: Since d.f. =36 > 30, it follows normal distribution. Hence the
tabulated value of t at 5% level of significance for two tailed test is 1.96.
Decision: Since the calculated value of t is greater than the tabulated value of
t, it is significant, hence the null hypothesis is rejected i.e. alternative
hypothesis is accepted, which means that there is a significant difference
between the average sales of two salesmen.
When Z-test and When t-test is
suitable?
β€’ In case of test of significance of single mean
and double(two independent means)
β€’ You can use t test if population standard
deviation is not given(unknown) in case of
sample size greater than 30
β€’ Use tabulated value from table (also in t-table)
β€’ See next example
β€’ Two independent samples z-test:
β€’ Large sample size (typically > 30)
β€’ Known population standard deviation
β€’ Normally distributed population
β€’ Two independent samples (unpaired)
t-test:
β€’ Small sample size (typically < 30)
β€’ Unknown population standard deviation
β€’ Population may not be normally distributed
Question: Read the following information perform the following:
(i) Test whether two means are significantly difference at 0.05
level of significance using independent t-test.
(ii) Compute 95% confidence interval estimation for the
difference of means
(iii) Show the linkage between testing of hypothesis and interval
estimation in this problem
Group I Group II
Sample means 10 15
Sample standard deviation 3 5
Sample size 49 64
Solution:
β€’ (i) as same process solving above
β€’ (ii) C. I.
β€’ (π‘ΏπŸ βˆ’ π‘ΏπŸ) Β± t𝛂, π’πŸ + π’πŸ βˆ’ 𝟐 βˆ— π’”πŸ(
𝟏
π’πŸ
+
𝟏
π’πŸ
)
β€’ 𝑆2
=
𝑛1.𝑠1
2+ 𝑛2.𝑠2
2
(𝑛1+ 𝑛2βˆ’2)
see ttab two tailed test
β€’ (iii) if hypothetical value of mean difference
zero (0) lies between confidence interval we
accept null hypothesis otherwise reject
Case III : Paired t-test for difference of two means
Procedure
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0: 𝝁𝒙 = ππ’š or 𝝁𝒅 = 𝟎 i. e. there is no significant difference between the two population means
H1: 𝝁𝒙 β‰  ππ’š or 𝝁𝒅 β‰  𝟎 (two tailed test), : 𝝁𝒙 > ππ’š (right tailed test), :𝝁𝒙 < ππ’š(left tailed test)
Step 2: Select test statistic:𝒕 =
ΰ΄₯
𝒅
ΰ΅—
𝒔𝒅
𝟐
𝒏
~𝒕(π’βˆ’πŸ) Where d = X – Y
X = value before treatment Y = value after treatment
Treatment means performing advertisement for promoting sales of goods or performing, training for
increasing memory capacity of person etc…..
𝑑 =
Οƒ 𝑑
𝑛
π‘Žπ‘›π‘‘ 𝑠𝑑
2
=
Οƒ(π‘‘βˆ’ ΰ΄€
𝑑)2
(π‘›βˆ’1)
=
1
(π‘›βˆ’1)
Οƒ 𝑑2
βˆ’
Οƒ 𝑑 2
𝑛
Step 3: Fix the level of significance (𝜢)
Step 4: degrees of freedom = (n – 1)
Step 5: Write the critical value (t ) for 𝛼 % level of significance and (n – 1)d. f. from the t- table.
Step 6: Make decision:
Critical value approach:
If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is
accepted.
If Η€tΗ€(calculated) > tΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is
accepted.
Example: A company has reorganized its sale department. The following data
show the weekly sales figures (in Rs. lakhs) before and after the reorganization.
Can we say that sales have significantly changed due to the reorganization?
In this problem, the sales before reorganization (x) and after reorganization (y) are
not independent, but are paired together. Hence we shall apply paired t-test.
Setting of Hypothesis:
H0: πœ‡π‘₯ = πœ‡π‘¦ i.e. there is no significant change in sale after the reorganization.
H1: πœ‡π‘₯ β‰  πœ‡π‘¦ (Two tailed test), , i.e. there is a significant change in sale after the
reorganization.
Test Statistics: Under H0 , the test statistics is
𝑑 =
ΰ΄€
𝑑
ΰ΅—
𝑆2
𝑛
Week 1 2 3 4 5 6 7 8 9 10
Sales before: 12 15 13 11 17 15 10 11 18 19
Sales after: 16 17 14 13 15 14 12 11 17 22
𝑑 =
Οƒ 𝑑
𝑛
=
βˆ’10
10
= -1
s2 =
1
(π‘›βˆ’1)
Οƒ 𝑑2 βˆ’
Οƒ 𝑑 2
𝑛
=
1
(10βˆ’1)
44 βˆ’
βˆ’10 2
10
= 3.778
Wee
k
Sales (before
reorganization)
(x)
Sales (after
reorganization)
(y)
d= X - Y d2
1
2
3
4
5
6
7
8
9
10
12
15
13
11
17
15
10
11
18
19
16
17
14
13
15
14
12
11
17
22
-4
-2
-1
-2
2
1
-2
0
1
- 3
16
4
1
4
4
1
4
0
1
9
Total Οƒ 𝑑 = -10 Οƒ 𝑑2 = 44
Hence , 𝑑 =
ΰ΄€
𝑑
ΰ΅—
𝑆2
𝑛
=
βˆ’1
Ξ€
3.778
10
= -1.629 , Η€tΗ€(calculated) = 1.629
Level of significance = 0.05
Degree of freedom (d.f.) = n-1 = 10-1 = 9
The critical (tabulated) value of t for 9 d.f. at 5% level of
significance for two tailed test 2.262.
Decision: Since calculated value of t is less than critical value of t,
it is not significant at 5% level of significance. Hence, the data do
not provide any evidence against the null hypothesis which may
be accepted. It may therefore be concluded that reorganization
has no effect on sales.
Calculations:
S. N. Change in score (d) (d)2
1 8 64
2 10 100
3 -2 4
4 0 0
5 -5 25
6 -1 1
7 9 81
8 12 144
9 6 36
10 5 25
Total Ξ£d = 42 Ξ£d2 = 480
Example 2: A drug is given to 10 patients, on the increment in their blood pressure
were recorded to be 8, 10, -2, 0, - 5, - 1, 9, 12, 6, 5. Test at 5% level of significance
whether the drug is effective for either increasing or decreasing the blood pressure.
Contd…
β€’ Number of observation (n) = 10,
β€’ Now 𝑑 =
Οƒ 𝑑
𝑛
=
42
10
= 4.2 and
β€’ 𝑆2
=
1
(π‘›βˆ’1)
Οƒ 𝑑2
βˆ’
Οƒ 𝑑 2
𝑛
=
1
9
480 βˆ’
(42)2
10
= 33.73
β€’ Null Hypothesis: H0: πœ‡π‘₯ = πœ‡π‘¦ i. e. there was no increase or decrease in blood
pressure before and after the treatment. The treatment was not effective
β€’ Alternative Hypothesis: H1: πœ‡π‘₯ β‰  πœ‡π‘¦ i. e. there was increase or decrease in blood
pressure before and after the treatment. The treatment was effective. (two tailed
test)
β€’ Test statistic: 𝑑 =
ΰ΄€
𝑑
ΰ΅—
𝑆2
𝑛
=
4.2
33.735
10
= 2.29
β€’ level of significance(𝜢)= 0.05
β€’ Degree of freedom(d.f.) = n - 1 = 10 – 1 = 9
β€’ Critical value of t for 9 d. f. and two tailed test 0.05 level of significance, 𝑑𝛼=0.05 =
2.262
β€’ Decision: π‘‘π‘π‘Žπ‘™ = 2.29 > 𝑑𝛼=0.05 = 2.262 then it is significant so we reject H0 and
accept H1.
β€’ Conclusion: The treatment was effective.
Case IV: Test of significance of an observed sample correlation coefficient
Let r be the observed sample correlation coefficient in a sample of n pairs of observations
from bivariate normal population. In order to test whether any correlation coefficient
between variables in population, t-test is applied.
Procedure:
Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis.
H0: 𝝆 = 0 i.e. Population correlation coefficient is zero or Variables are uncorrelated in the
populations
H1: 𝝆 β‰  𝟎 (two tailed test), variables are correlated in the population
H1 : 𝝆 > 𝟎 (right tailed test) , H1 :𝝆 < 𝟎(left tailed test)
Step 2: Select test statistic:𝑑 =
𝒓
πŸβˆ’π’“πŸ
. 𝒏 βˆ’ 𝟐
Step 3: Fix the level of significance (𝜢)
Step 4: degrees of freedom = (n – 2)
Step 5: critical value (t ) for 𝛼 % level of significance and (n – 2)d. f. from the t- table.
Step 6: Make decision:
Critical value approach:
If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is
accepted.
If Η€tΗ€(calculated) > tΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is
accepted.
step 7: Conclusion:
Example: A random sample of 27 pairs of observations from a normal population
gives a correlation coefficient of 0.42. Is it likely that the variables in the
population are uncorrelated? Also compute 95% confidence limit for the
population correlation coefficient.
Solution: with usual notations, n = 27, r = 0.42
Setting of Hypothesis:
H0: 𝜌 = 0 i.e. Variables in the populations are not correlated .
H1: 𝜌 β‰  0 (Two tailed test) i.e. Variables in the populations are correlated .
Test Statistics: 𝑑 =
𝒓
πŸβˆ’π’“πŸ
. 𝒏 βˆ’ 𝟐 =
𝟎.πŸ’πŸ
πŸβˆ’πŸŽ.πŸ’πŸπŸ
. πŸπŸ• βˆ’ 𝟐 = 2.31
Level of significance(𝜢) = 0.05
Degree of freedom(d.f.) = n-2 = 27-2 = 25
Critical value: ttab = 2.060 Decision:
For 95% confidence limits:
1- 𝛼= 95%, 𝛼= 5% d.f. = 27-2 =25 𝑑𝛼,π‘›βˆ’2 = 2.060
Hence 95% Confidence limits for population correlation coefficient are
= r Β± 𝑑𝛼,π‘›βˆ’2 .
1βˆ’π‘Ÿ2
𝑛
= 0.42 Β± 2.060 .
1βˆ’0.422
27
= 0.42 Β±
THANK YOU

More Related Content

Similar to Testing of Hypothesis combined with tests.pdf

Statistical analysis
Statistical analysisStatistical analysis
Statistical analysisSuresh Sundar
Β 
Hyphotheses testing 6
Hyphotheses testing 6Hyphotheses testing 6
Hyphotheses testing 6Sundar B N
Β 
Test hypothesis & p-value
Test hypothesis & p-valueTest hypothesis & p-value
Test hypothesis & p-valueRaeesa Mukhtar
Β 
Telesidang 4 bab_8_9_10stst
Telesidang 4 bab_8_9_10ststTelesidang 4 bab_8_9_10stst
Telesidang 4 bab_8_9_10ststNor Ihsan
Β 
Hypothesis
HypothesisHypothesis
HypothesisMmedsc Hahm
Β 
Introduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detailIntroduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detailShriramKargaonkar
Β 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Harve Abella
Β 
Lecture6 Applied Econometrics and Economic Modeling
Lecture6 Applied Econometrics and Economic ModelingLecture6 Applied Econometrics and Economic Modeling
Lecture6 Applied Econometrics and Economic Modelingstone55
Β 
20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhdHimanshuSharma723273
Β 
Formulating Hypotheses
Formulating Hypotheses Formulating Hypotheses
Formulating Hypotheses Shilpi Panchal
Β 
L hypo testing
L hypo testingL hypo testing
L hypo testingMmedsc Hahm
Β 

Similar to Testing of Hypothesis combined with tests.pdf (20)

Statistical analysis
Statistical analysisStatistical analysis
Statistical analysis
Β 
Hypo
HypoHypo
Hypo
Β 
Unit 3
Unit 3Unit 3
Unit 3
Β 
Hyphotheses testing 6
Hyphotheses testing 6Hyphotheses testing 6
Hyphotheses testing 6
Β 
Test hypothesis & p-value
Test hypothesis & p-valueTest hypothesis & p-value
Test hypothesis & p-value
Β 
Testing of hypothesis and Goodness of fit
Testing of hypothesis and Goodness of fitTesting of hypothesis and Goodness of fit
Testing of hypothesis and Goodness of fit
Β 
Formulatinghypotheses
Formulatinghypotheses Formulatinghypotheses
Formulatinghypotheses
Β 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
Β 
Telesidang 4 bab_8_9_10stst
Telesidang 4 bab_8_9_10ststTelesidang 4 bab_8_9_10stst
Telesidang 4 bab_8_9_10stst
Β 
Hypothesis
HypothesisHypothesis
Hypothesis
Β 
Introduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detailIntroduction-to-Hypothesis-Testing Explained in detail
Introduction-to-Hypothesis-Testing Explained in detail
Β 
hypothesis.pptx
hypothesis.pptxhypothesis.pptx
hypothesis.pptx
Β 
Basics of Hypothesis Testing
Basics of Hypothesis TestingBasics of Hypothesis Testing
Basics of Hypothesis Testing
Β 
Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)Estimation and hypothesis testing 1 (graduate statistics2)
Estimation and hypothesis testing 1 (graduate statistics2)
Β 
Lecture6 Applied Econometrics and Economic Modeling
Lecture6 Applied Econometrics and Economic ModelingLecture6 Applied Econometrics and Economic Modeling
Lecture6 Applied Econometrics and Economic Modeling
Β 
20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd20200519073328de6dca404c.pdfkshhjejhehdhd
20200519073328de6dca404c.pdfkshhjejhehdhd
Β 
Formulating Hypotheses
Formulating Hypotheses Formulating Hypotheses
Formulating Hypotheses
Β 
Hypothesistesting2
Hypothesistesting2Hypothesistesting2
Hypothesistesting2
Β 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
Β 
L hypo testing
L hypo testingL hypo testing
L hypo testing
Β 

Recently uploaded

VIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130 Available With Room
VIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130  Available With RoomVIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130  Available With Room
VIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130 Available With Roomdivyansh0kumar0
Β 
Banana Powder Manufacturing Plant Project Report 2024 Edition.pptx
Banana Powder Manufacturing Plant Project Report 2024 Edition.pptxBanana Powder Manufacturing Plant Project Report 2024 Edition.pptx
Banana Powder Manufacturing Plant Project Report 2024 Edition.pptxgeorgebrinton95
Β 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
Β 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
Β 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
Β 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
Β 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
Β 
Lean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… AbridgedLean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… AbridgedKaiNexus
Β 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
Β 
Marketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet CreationsMarketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet Creationsnakalysalcedo61
Β 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
Β 
Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”soniya singh
Β 
(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR
(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR
(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCRsoniya singh
Β 
Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...lizamodels9
Β 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...lizamodels9
Β 
/:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In.../:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In...lizamodels9
Β 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
Β 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfmuskan1121w
Β 
Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...lizamodels9
Β 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdfOrient Homes
Β 

Recently uploaded (20)

VIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130 Available With Room
VIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130  Available With RoomVIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130  Available With Room
VIP Kolkata Call Girl Howrah πŸ‘‰ 8250192130 Available With Room
Β 
Banana Powder Manufacturing Plant Project Report 2024 Edition.pptx
Banana Powder Manufacturing Plant Project Report 2024 Edition.pptxBanana Powder Manufacturing Plant Project Report 2024 Edition.pptx
Banana Powder Manufacturing Plant Project Report 2024 Edition.pptx
Β 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Β 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Β 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
Β 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
Β 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
Β 
Lean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… AbridgedLean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… Abridged
Lean: From Theory to Practice β€” One City’s (and Library’s) Lean Story… Abridged
Β 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
Β 
Marketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet CreationsMarketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet Creations
Β 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
Β 
Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Call Girls in Mehrauli Delhi πŸ’―Call Us πŸ”8264348440πŸ”
Β 
(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR
(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR
(8264348440) πŸ” Call Girls In Keshav Puram πŸ” Delhi NCR
Β 
Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❀️88604**77959_Russian 100% Genuine Escor...
Β 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❀️8860477959 Escorts...
Β 
/:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In.../:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In...
/:Call Girls In Indirapuram Ghaziabad βž₯9990211544 Independent Best Escorts In...
Β 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
Β 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdf
Β 
Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❀️8860477959 Escorts 100% Genuine Ser...
Β 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdf
Β 

Testing of Hypothesis combined with tests.pdf

  • 1. B . S. Parajuli
  • 2. HYPOTHESIS AND ITS TESTING PROCEDURES B. S. Parajuli
  • 3.
  • 4. Hypothesis ➒An assumption that we make about the population parameter ➒A statistical hypothesis is a statement about population usually concerned with the value of one or more parameters of the population. i. logically conjectured relationship ii. Relationship between two or more variable iii. expressed in the form of testable statement(s) iv. Formal statement(s) / conjectured statement v. Variables (independent and dependent)
  • 5. ο‚’ The average monthly income of the people of this city is Rs. 17,000. i.e. 𝝁 = Rs. 17,000 ο‚’ The proportion of people with cell phones in this rural municipality is 0.62 i.e. P = 0.62 ο‚’ More than 20% of the adults watch children’s program in the television. ο‚’ Is the average waiting time for the customers of Smart Supermarket at the checkouts greater than 15 minutes? ο‚’ Is there any difference between the percentage of internet users between Kathmandu and Pokhara ?
  • 6. Purpose will identify independent variables influence upon dependent variable investigate a research problem enhances the objectivity of the study understand what data is to be collected increase the objectivity of research. concludes what is true or what is false
  • 7. Null Hypothesis: A statistical hypothesis or assumption made about the population parameter to testing its validity for the possible acceptance is called null hypothesis. It is also called hypothesis of no difference. It is denoted by H0. β€’ Always contains β€œ=” , β€œβ‰€β€ or β€œο‚³β€ sign Example : The mean age of MBM students of SMC is equal to 23 years (H0 ∢ 𝝁 = 23 years) Alternative Hypothesis: Any hypothesis which is complementary to the null hypothesis is called an alternative hypothesis and is denoted by H1. ο‚— Always contains β€œβ‰ β€ , β€œ>” or β€œ<” sign H1 ∢ 𝝁 β‰  23 years (TwoTailed) H1 ∢ 𝝁 > 23 years (RightTailed) H1 ∢ 𝝁 < 23 years(LeftTailed)
  • 8. To determine whether the test is one-tailed or two tailed depends upon the alternative hypothesis. If the alternative hypothesis is two tailed we apply two-tailed test and if the alternative hypothesis is one-tailed, we apply one-tailed test. Two Tailed Test : A test of statistical hypothesis in which the alternati ve hypothesis looks for the values are significantly different or not but the direction of the difference(more or less) is not specified. Key words like: ο‚— differs from, is not equal to, differs significantly, same, whether any difference, unbiased, whether equal to, not equals to
  • 9. One Tailed Test: A test of statistical hypothesis in which the alternative hypothesis H1 looks for a definite direction of the difference (more or less) with parameter is specified. Thus for the comparative study of the population parameter identifying the words such as: less than, more than, greater than, no more than, taller than, smaller than, better than , at least, at most , only, improved, superior, inferior, below, above, high, low, increase, decrease, better than, up to, majority, minority etc. After identifying one-tailed test, the left tailed test and right tailed tests can be identified in following way: i. If sample statistic < Population parameter- left tailed test ii. If sample statistic > Population parameter- Right tailed test iii. If first sample statistic < Second sample statistic- Left tailed test iv. If first sample statistic > Second sample statistic- Right tailed test
  • 10. Level of Significance: The maximum size of type I error that we are prepared to take risk is called the level of significance and is denoted by 𝛼. Symbolically it is defined as 𝛼 = P(type I error) = P (Rejecting a true null hypothesis) If we allow Ξ± = 5%, we are likely to reject 5 cases in making decision 100 times under the same conditions. Critical Region and Rejection Region: The region where the null hypothesis is rejected is called critical region or rejection region and the region for accepting true null hypothesis is called acceptance region. If the statistic belongs to the critical region, H0 is rejected and if the statistic belongs to the acceptance region H0 is accepted. The critical region is established on the basis of the choice of level of significance. If 𝛼 = 0.05(5%) then the total rejection area under the normality curve will be 0.05(5%) and the acceptance area under the normal probability curve will be 0.95(95%).
  • 11. Critical / Significant Value: The value which separates the acceptance region and the rejection region is called critical value or significant value. It depends upon (i) The level of significance used and (ii) The alternative hypothesis whether it is two tailed or one tailed (left or right ) test. Critical values Rejection Region /2 0 a /2 a Acceptance Region Two Tailed Test 0.025 0.025 0.95
  • 12. For large sample, the critical value of β€˜Z’ obtained from the statistical table similarly for small sample the critical value of β€˜t’ obtained from the statistical table. When the computed test statistic lies in the acceptance region, it is reasonable to accept the null hypothesis as it is believed to be probable true. Acceptance Region 0 a Critical values Rejection Region 0 a Acceptance Region RightTailedTest LeftTailedTest 0.95 0.95 0.05
  • 14. Contd… ➒ Example ➒ H0: Food habit does not affect blood pressure. ➒ H1: Food habits affects blood pressure. ➒ Food habit: independent variable ➒ Blood pressure: dependent variable. ➒ H0: Age does not affect blood pressure. ➒ H1: As age increases blood pressure increases. ➒ age: independent variable ➒ Blood pressure: dependent variable. ➒ H0: Regular exercise does not affect blood pressure. ➒ H1: Regular exercise may decrease blood pressure. ➒ regular exercise: independent variable ➒ blood pressure: dependent variable.
  • 15. Parametric and Non-parametric test ο‚— ParametricTest: i. Model specifies certain conditions about the parameters of the populations from which the samples are drawn ii. Implies that there is normal distribution iii. The data used in this test are continuous ο‚— Non- ParametricTest: i. does not depend on the particular form of the basic frequency function from which the samples are drawn ii. do not implies that there is a normal distribution iii. The data used in this test are frequency or ranks iv. measuremental data: data obtained by actual measurement.
  • 16. Parametric test Non-parametric test 1. Parametric tests specify certain conditions about the parameters of the population from which the sample has been drawn 1. Non-parametric test do not specify any condition about the population parameter of the population from which the sample has been drawn. 2. It is used in testing of hypothesis and estimation of parameters. 2. It is used in testing of hypothesis and but not estimation of parameters. 3. Parametric test are mostly applied only to the data which are measured in interval and ratio scale. 3. Non-parametric test are applied only to the data which are measured in nominal and ordinal data. 4. Parametric test are most powerful test. 4. Non-parametric test are less powerful test than parametric test. 5. It requires complicated sampling technique. 5. It does not require complicated sampling technique.
  • 17. Criteria of Good Hypothesis Statement β€’ Should be stated in declarative form β€’ Should articulate (describe) a relationship between two or more variables β€’ Should be testable empirically β€’ Useless if it cannot be tested empirically β€’ Should be limited in scope β€’ Should be clearly and precisely stated β€’ Should no ambiguity (vegness) in the variables β€’ Should state the conditions to apply. β€’ Context and study units must be clear β€’ Should reflect a guess at a solution β€’ Should outcome to a problem based upon some knowledge, previous research, or identified needs β€’ Should be consistent with most known facts
  • 18. Actual situation Decision from sample Accept Hβ‚€ Reject Hβ‚€ Hβ‚€ is true Correct decision No error Probability = 1 – Ξ± Wrong decision Type I error Probability = Ξ± (Not so harmful and can be corrected) Hβ‚€ is false (H₁ true) Wrong decision Type II error Probability = Ξ² (Degrees of impact of Type II error is very harmful and can’t be corrected) Correct decision No error Probability = 1 – Ξ² 1 – Ξ² = power of test Types of Errors inTesting of Hypothesis
  • 19. ➒ If a medicine is administered to a few patients of a particular disease to cure them and the medicine is curing the disease, but it is claimed that it has no effect and hence it is discontinued. This Type I error. ➒ If a medicine is administered to a few patients of a particular disease to cure them and the medicine is not curing the disease, but it is claimed that it have good effect and the treatment is continued. This type II error. Type I error : Rejecting the good lot of products (Producer’s risk) Type II error: Accepting the bad lot of products (Consumer’s risk)
  • 20.
  • 21.
  • 22. Steps in Hypothesis Testing a. Critical Value approach: Depends upon the size of sample and nature of data Use 𝛼 = 0.05 unless we are given Formulate the Null and Alternative Hypothesis Select the Right test statistic and compute test statistic Decide the level of significance Determine Critical value Decision: compare computed test statistic with critical value If Calculated Value ≀ Tabulated Value Yes No It is not Significant Accept HO It is significant Reject HO
  • 23. b. p-value approach: The p-value is the probability of getting the observed value of the computed test statistic or a value with even greater evidence against null hypothesis H0 if the null hypothesis is true. In other words, the observed value of level of significance for the testing of hypothesis assuming null hypothesis is true is called p- value. The p- value is the maximum rejection area under the normal probability curve under the null hypothesis H0, when the some test (or procedure) is applied. ➒ If p-value β‰₯ 𝛼 (level of significance) then accept null hypothesis (H0). ➒ If p-value < 𝛼 (level of significance) then accept alternative hypothesis (H1). Mostly p-value can be obtained from different statistical software based on the general output for the given problem.
  • 24. Finding p-value: (i) For Left Tailed Test: p = P( Z < Zcal) = 0.5 – P(0 < Z < Z cal) (ii) For Right Tailed Test p = P( Z >Zcal) = 0.5 – P(0 < Z < Z cal) (iii) For Two Tailed Test: p = P( Z < Zcal) + P( Z >Zcal) = 2 P(Z >Zcal) = 2[0.5 – P(0 < Z < Z cal)] Z = 0 Zcal Zcal Zcal Zcal Z = 0 Z = 0
  • 25. Two tailed test and Confidence interval β€’ If the reference value specified in H0 lies outside the interval (that is, is less than the lower bound or greater than the upper bound), you can reject H0. β€’ If the reference value specified in H0 lies within the interval (that is, is not less than the lower bound or greater than the upper bound), you fail to reject H0. If hypothesis value 𝐻0: πœ‡ = πœ‡0lies within calculated confidence interval estimate [ ΰ΄€ 𝑋 βˆ’ 𝑍 Ξ€ 𝛼 2 𝑆. 𝐸. ( ΰ΄€ 𝑋) < πœ‡0 < ΰ΄€ 𝑋 βˆ’ 𝑍 Ξ€ 𝛼 2 𝑆. 𝐸. ( ΰ΄€ 𝑋)] then null hypothesis is accepted otherwise rejected.
  • 27. 27 β€’ Z-test is important parametric test and based on normal probability distribution. When the samples are selected from population of known parameter with sample size more than 30 , Z test is used. We consider that if sample size is more than 30 then sample selected non-normal population is also approximately normally distributed. β€’ Z- test is defined as: 𝑍 = 𝑑 βˆ’ 𝐸(𝑑) 𝑆.𝐸.(𝑑) β€’ 𝑇𝑒𝑠𝑑 π‘ π‘‘π‘Žπ‘‘π‘–π‘ π‘‘π‘–π‘ = Statistic – Parameter Standard error Application of Z test: (i) Test of significance of single mean (ii) Test of significance of difference between two means (iii) Test of significance of single proportion (iv) Test of significance of difference between two proportions Introduction
  • 28. Critical value Critical values Level of significance (Ξ±) ZΞ± 1% 2% 5% 10% Two tailed test ǀ𝑍𝛼ǀ = 2.58 ǀ𝑍𝛼ǀ = 2.33 ǀ𝑍𝛼ǀ = 1.96 ǀ𝑍𝛼ǀ = 1.645 Right tailed test ǀ𝑍𝛼ǀ = 2.33 ǀ𝑍𝛼ǀ = 2.05 ǀ𝑍𝛼ǀ = 1.645 ǀ𝑍𝛼ǀ = 1.28 Left tailed test ǀ𝑍𝛼ǀ = βˆ’2.33 ǀ𝑍𝛼ǀ = βˆ’2.05 ǀ𝑍𝛼ǀ = βˆ’1.645 ǀ𝑍𝛼ǀ = βˆ’1.28 28
  • 29. Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0: 𝝁 = 𝝁𝟎 i. e. there is no significant difference between the sample mean (𝑋) and the population mean H1: 𝝁 β‰  𝝁𝟎 (two tailed test), or H1 : 𝝁 > 𝝁𝟎 (right tailed test),or H1 : 𝝁 < 𝝁𝟎 (left tailed test) If sample mean( ΰ΄€ 𝑋) > Population mean (πœ‡ ) : Right Tailed Test If sample mean( ΰ΄€ 𝑋) < Population mean (πœ‡ ) : Left Tailed Test Step 2: Select test statistic: 𝒁 = ΰ΄₯ 𝑿 βˆ’ 𝝁 ΰ΅— 𝝈 𝒏 If 𝜎 is unknown for large samples ො 𝜎 = s Step 3: Fix the level of significance (𝛼) Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table. Step 5: Make decision: Critical value approach: If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted. If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted. P-value approach: If p-value β‰₯ 𝛼 (level of significance) then accept null hypothesis (H0). If p-value < 𝛼 (level of significance) then accept alternative hypothesis (H1). 29 Case I: Test of Significance of Single Mean
  • 30. Example : A samples of 50 pieces of certain type of string was tested. The mean breaking strength turned out to be 14.5 kgs. Test whether the sample is from a batch of strings having a mean breaking strength of 15.6 kgs and standard deviation of 2.2 kgs. Here, Number of sample(n) = 50 Sample mean ( ΰ΄€ 𝑋) = 14.5 kgs. Population mean () = 15.6 kg Population standard deviation (𝜎) = 2.2 kgs Formulation of Hypothesis β€’ Null Hypothesis: Ho :  = 15.6 kgs i.e. the Population mean breaking strength of the strings is 15.6 kg. β€’ Alternative Hypothesis: H1 :  β‰ 15.6 kgs i.e. the mean breaking strength is not equal to 15.6 kg. (two tailed test) Test Statistics: Z = ΰ΄₯ 𝑿 βˆ’ 𝝁 ΰ΅— 𝝈 𝒏 = πŸπŸ’.πŸ“βˆ’ πŸπŸ“.πŸ” ΰ΅— 𝟐.𝟐 πŸ“πŸŽ = = - 3.55 | zcal | = | - 3.55 | = 3.55 Level of significance (a) = 5% Critical value: Tabulated value of z at 5% level of significance for two tailed test Ztab = 1.96 Decision: Since Zcal ο€Ύ Ztab , Ho is rejected i.e. H1 is accepted. Conclusion: Hence we conclude that the sample has not been drawn from the batch with mean breaking strength of 15.6 kgs and standard deviation 2.2 kgs. 30
  • 31. Example : In the past a blending process has produced an average of 5 Kg. of waste material for every batch with standard deviation 5 Kg. From a sample of 100 batches an average of 7 Kg. of waste per batch is obtained. At 5% level of significance is it reasonable to believe that the average has increased? Solution: with usual notations, n = 100, ΰ΄€ 𝑋= 7 kg,  = 5 kg,  = 5 kg Formulation of Hypothesis β€’ H0:  = 5 kg, i.e. the average of waste material has not been increased β€’ H1:  ο€Ύ 5 kg, i.e. the average has increased (Right tailed test) Test Statistics: Z = ΰ΄₯ 𝑿 βˆ’ 𝝁 ΰ΅— 𝝈 𝒏 = πŸ•βˆ’ πŸ“ ΰ΅— πŸ“ 𝟏𝟎𝟎 = 4 zcal = 4 Level of significance (a) = 5% Critical value: Tabulated value of z at 5% level of significance for right tailed test Ztab = 1.645 Decision: Since Zcal ο€Ύ Ztab , Ho is rejected i.e. H1 is accepted. Hence we conclude that the average waste material has been increased. 31
  • 32. Example : An insurance agent has claimed that the average age of the policy holders who have insured through him is less than the average age for all the agents, which is 30.55 years. A random sample of 100 policy holders who had insured through him gave the following age distribution. Calculate the arithmetic mean and standard deviation of this distribution and use these values to test his claim at 5% level of significance. Solution: 32 Class limits 16-20 21-25 26-30 31-35 36-40 Frequency 12 22 20 30 16 Age f x 𝒅′ = π’™βˆ’πŸπŸ– πŸ“ f𝒅′ fπ’…β€²πŸ 16-20 12 18 -2 -24 48 21-25 22 23 -1 -22 22 26-30 20 28 0 0 0 31-35 30 33 1 30 30 36-40 16 38 2 32 64 Ξ£fd’= 16 Ξ£fd’2=164
  • 33. xΜ„ = A + Ξ£fd’ n Γ— h = 28+ 16 100 Γ—5 = 28 + 0.08 = 28.8 years 𝑠 = Οƒ 𝑓d’2 𝑛 βˆ’ ( Ξ£fd’ n )2 Γ— h = 164 100 βˆ’ ( 16 100 )2 Γ— 5 = 6.352 years Setting of Hypothesis H0: 𝝁 = 30.5 years i.e. the average age of the policy holders for all agents is 30.5 years. H1: 𝝁 < 30.5 years (left tailed test) i.e. the average age of the policy holders who have insured through him is less than the average age for all the agents. Test Statistics: Under H0, the test statistic is Z = ΰ΄₯ 𝑿 βˆ’ 𝝁 ΰ΅— 𝝈 𝒏 = ΰ΄₯ 𝑿 βˆ’ 𝝁 ΰ΅— 𝒔 𝒏 = πŸπŸ–.πŸ– βˆ’ πŸ‘πŸŽ.πŸ“ ΰ΅— πŸ”.πŸ‘πŸ“πŸ‘ 𝟏𝟎𝟎 = -2.68 | zcal | = | - 2.68 | = 2.68 Level of significance (a) = 5% Critical value: Tabulated value of z at 5% level of significance for left tailed test Ztab = -1.645 i.e. | ztab | = 1.645 Decision: Since | zcal |> | ztab | , So Ho is rejected i.e. H1 is accepted. Hence we conclude that the average age of the policy holders who have insured through him is less than the average age of the policy holders of all the agents. 33
  • 34. Case II: Test of Significance of Difference between Two Means 34 Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0:𝛍₁ = 𝛍₂ i. e. Two samples have been drawn from the sample parent population H1:𝛍₁ β‰  𝛍₂(two tailed test), H1:μ₁ > ΞΌβ‚‚(right tailed test), H1:μ₁ < ΞΌβ‚‚(left tailed test) If First sample mean(ΰ΄₯ X1) > Second sample mean(ΰ΄₯ X2) : Right Tailed Test If First sample mean(ΰ΄₯ X1) < Second sample mean(ΰ΄₯ X2) : Left Tailed Test Step 2: Select test statistic: Z = ΰ΄₯ X1βˆ’ ΰ΄₯ X2 Οƒ1 2 n1 + Οƒ2 2 n2 π‘œπ‘Ÿ Z = (ΰ΄₯ X1βˆ’ΰ΄₯ X2) s1 2 n1 + s2 2 n2 π‘œπ‘ŸZ = (ΰ΄₯ X1βˆ’ ΰ΄₯ X2) 𝜎2 1 n1 + 1 n2 Step 3: Fix the level of significance (𝜢) Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table. Step 5: Make decision: Critical value approach: If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted. If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted. Step 6: Write Conclusion:
  • 35. Example: An examination was given to 50 students at college A and 60 students at college B. At college A, the average marks of the students was found to be 75 with a standard deviation of 9. At college B, the average marks of the students was 79 with a standard deviation of 7. Is there a significant difference between the average marks of the students at these two colleges? Solution: with usual notations, College A College B n1 = 50 n2 = 60 ΰ΄₯ X1= 75 ΰ΄₯ X2 =79 s1 = 9 s2 = 7 Setting of Hypothesis: H0 : 𝛍₁ = 𝛍₂ i. e. there is no significant difference between the average marks of the students at these two colleges H1: 𝛍₁ β‰  𝛍₂ (Two tailed test) There is significant difference between the average marks of the students at these two colleges
  • 36. Test Statistic: Under H0 the test statistic is Z = (ΰ΄₯ X1βˆ’ΰ΄₯ X2) s1 2 n1 + s2 2 n2 = (75βˆ’79) 92 50 + 72 60 = -2.56 Η€ZΗ€cal= 2.56 Level of significance(𝜢) = 5% Critical value: Ztab= 1.96 Decision: Since Zcal > Ztab so H0 is rejected and H1 is accepted. Conclusion: Hence we conclude that There is significant difference between the average marks of the students at these two colleges 36
  • 37. Example: In a random sample of 500 the mean is found to be 20. In another independent sample of 400 the mean is 15. Could the samples have been drawn from the same population with standard deviation 4 ? Solution: With usual notations, Common Population S. D. (𝜎) = 4 H0:𝛍₁ = 𝛍₂ (Both samples have been Drawn from the same population) H1:𝛍₁ β‰  𝛍₂ (Two tailed test) Test statistic: Z = ΰ΄₯ X1βˆ’ ΰ΄₯ X2 𝜎2 1 n1 + 1 n2 contd… Example : If 60 MA students are found to have a mean height of 63.60 inches and 50 MBS students have mean height of 69.51 inches, would you conclude that the MBS students are taller than MA students? Assume the standard deviation of height of post-graduate students to be 2.48 inches. For MA: n1 = 60 , ΰ΄₯ X1= 63.60 For MBS n2 = 50 , ΰ΄₯ X2 = 69.51 common population s.d. (𝜎) = 2.48 H0:𝛍₁ = 𝛍₂ (There is no difference between height of MA and MBS students) H1:𝛍₁ < 𝛍₂ (MBS students are taller than MA students) Left Tailed Test Test statistic: Z = ΰ΄₯ X1βˆ’ ΰ΄₯ X2 𝜎2 1 n1 + 1 n2 contd… 37 First Sample Second Sample n1 = 500 n2 = 400 ΰ΄₯ X1= 20 ΰ΄₯ X2 = 15
  • 38. Example: Two random samples of Nepalese people taken from rural and urban region gave the following data of income. Test whether the average monthly income of urban region people is significantly more than rural region people. Solution: With usual notations, Test Statistic: Under H0 the test statistic is 𝐙 = (ΰ΄₯ π‘ΏπŸβˆ’ΰ΄₯ π‘ΏπŸ) π’”πŸ 𝟐 π’πŸ + π’”πŸ 𝟐 π’πŸ contd….. 38 Sample size Average monthly income Standard deviation From Urban Region 100 1250 30 From Rural Region 150 800 50 Urban Region Rural Region n1 = 100 n2 = 150 ΰ΄₯ X1= 1250 ΰ΄₯ X2 = 800 s1 = 30 s2 = 50 H0:𝛍₁ = 𝛍₂ (There is no significant difference between the average income of the people of urban and rural region) H1:𝛍₁ > 𝛍₂ (The average monthly income of urban region people is more than rural region people) Right Tailed Test
  • 39. Case III: Test of significance of Single Proportion 39 Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0:P = 𝑃0i. e. the sample has been drawn from a population with proportion H1:P β‰  𝑃0(two tailed test), H1:P > 𝑃0(right tailed test), H1:𝑃 < 𝑃0(left tailed test) If sample proportion(p) > Population Proportion(P) : Right Tailed Test If Sample Proportion(p) < Population proportion(P) : Left tailed Test Step 2: Select test statistic:𝑍 = π·π‘–π‘“π‘“π‘’π‘Ÿπ‘’π‘›π‘π‘’ 𝑆.𝐸. = 𝑝 βˆ’ 𝑃 𝑃𝑄 𝑛 [Note: if we have sampling from a finite population of size N, then 𝑆. 𝐸. 𝑝 = π‘βˆ’π‘› π‘βˆ’1 𝑃(1βˆ’π‘ƒ) 𝑛 = π‘βˆ’π‘› π‘βˆ’1 𝑃𝑄 𝑛 ] Step 3: Fix the level of significance (𝛼) Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table. Step 5: Make decision: Critical value approach: If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant).So Hβ‚€ is accepted. If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted. Step 6: Write Conclusion:
  • 40. Example 1 : An airline must allocate available seating space between first class passengers and economy class passengers. The null hypothesis is that 20% of the passengers fly first class but management recognizes the possibility that the percentage could be more or less. A random sample of 400 passengers includes 70 passengers holding first class tickets. Can the null hypothesis be rejected at the 10% level of significance? Solution: With usual notations, P = 0.20 , n = 400 , Q = 1- P = 1 – 0.20 = 0.80 sample proportion(p) = π‘₯ 𝑛 = 70 400 = 0.175 Setting of Hypothesis: H0: P = 0.20 (20% of the passengers fly first class) H1: P β‰  0.20 ( more or less than 20% passengers fly first class) Test Statistics: Under H0 ,the test statistic is Z = 𝑝 βˆ’ 𝑃 𝑃𝑄 𝑛 = 0.175βˆ’0.20 0.20 Γ—0.80 400 = - 1.25 Zcal =1.25 Level of significance (𝜢) = 10% Critical value: Ztab= 1.645 Decision : Since Zcal < Ztab so H0 is accepted Conclusion: Hence we conclude that 20% of the passengers fly first class. 40
  • 41. Example 2: A manufacturer claimed that at least 90% of the machine parts that is supplied by that factory conformed to specifications. An examination of 200 such parts revealed that 160 parts were not faulty. Determine if the manufactuer’s claim is legitimate at 1% level of significance. Solution: With usual notations P = 0.90 , Q = 1- P = 1- 0.90 = 0.10 n = 200 , x = 160, p = π‘₯ 𝑛 = 160 200 = 0.80 Setting of Hypothesis: H0: P β‰₯0.90 (At least 90% of the machine parts is conformed to specifications) H1:P<0.90 [Left tailed Test](Less than 90% of the machine parts is conformed to specifications) Test Statistics: Zcal = 𝑝 βˆ’ 𝑃 𝑃𝑄 𝑛 = 0.80βˆ’0.90 0.90 Γ—0.10 200 = -4.71 Η€ZcalΗ€ = 4.71 Level of Significance(𝜢): 1% Critical Value: Ztab = -2.33 Η€ZtabΗ€ = 2.33 Decision: Since | zcal |> | ztab | So H0 is rejected and H1 is accepted. Conclusion: 41
  • 42. Example 3: A cellphone provider has the business objective of wanting to determine the proportion of subscribers who would upgrade to a new cellphone with improved features if it were made available at a substantially reduced cost. Data are collected from a random sample of 500 subscribers. The results indicate that 135 of the subscribers would upgrade to a new cellphone at a reduced cost. At the 0.05 level of significance, is there evidence that more than 20% of the customers would upgrade to a new cellphone at a reduced cost? Solution: With usual notations, n = 500 , x = 135 , p = π‘₯ 𝑛 = 135 500 = 0.27, P = 0.20 , Q = 0.80 Setting of Hypothesis: H0: P =0.20 (20% of the customers would upgrade to a new cellphone at a reduced cost H1: P>0.20 [Right tailed Test](more than 20% of the customers would upgrade to a new cellphone at a reduced cost) Test Statistics: Zcal = 𝑝 βˆ’ 𝑃 𝑃𝑄 𝑛 = 0.27βˆ’0.20 0.20 Γ—0.80 500 = 3.91 Level of Significance(𝜢): 0.05 Critical value: Ztab = 1.645 Decision: Since Zcal > Ztab so H0 is rejected 42
  • 43. 43 Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0:π‘·πŸ = π‘·πŸi. e. the sample has been drawn from a population with proportion P. H1: π‘·πŸ β‰  π‘·πŸ(two tailed test), H1 :π‘·πŸ > π‘·πŸ(right tailed test), H1 :π‘·πŸ < π‘·πŸ(left tailed test) If First sample proportion (p1) > Second sample proportion (p2) : Right Tailed If First sample proportion (p1) < Second sample proportion (p2): Left Tailed Step 2: Select test statistic: 𝑍 = 𝑝1 βˆ’ 𝑝2 𝑃1𝑄1 𝑛1 + 𝑃2𝑄2 𝑛2 π‘œπ‘Ÿ 𝑍 = (𝑝1βˆ’ 𝑝2) ΰ·  𝑃 ΰ·  𝑄 1 𝑛1 + 1 𝑛2 Note: If P1 and P2 are not known, each of them is estimated by ΰ·‘ P = π‘₯1+π‘₯2 𝑛1+𝑛2 = n1p1+n2p2 n1+ n2 ΰ·‘ Q = 1 - ΰ·‘ P Step 3: Fix the level of significance (𝜢) Step 4: Write the critical value (Z ) for 𝛼 % level of significance from the Z table. Step 5: Make decision: If Η€ZΗ€(calculated) ≀ ZΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted. If Η€ZΗ€(calculated) > ZΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted. Step 6: Write Conclusion: Case IV: Test of Significance for Difference of Two Proportions
  • 44. Example: An online survey asked 1,000 adults β€œWhat do you buy from your mobile device?”. Both sample sizes were 500 and that 195 out of 500 males and 305 out of 500 females reported they buy clothing from their mobile device. Is there evidence of a difference between males and females in the proportion who said they buy clothing from their mobile device at the 0.01 level of significance? Solution: With usual notations, For Male: 𝑛1= 500 , 𝑝1= 195 500 =0.39 For Female: 𝑛2= 500 , 𝑝2= 305 500 =0.61 Setting of Hypothesis: H0: 𝑃1 = 𝑃2 i. e. there is no significant difference between males and females in the proportion of clothes buying from mobile device. H1: 𝑃1 β‰  𝑃2 (Two tailed test). Test Statistics: Zcal = p1βˆ’ p2 ΰ·‘ Pΰ·‘ Q 1 n1 + 1 n2 ΰ·‘ P = n1p1+n2p2 n1+n2 = 500Γ—0.39+500Γ—0.61 500+500 = 0.5 ΰ·‘ Q = 1 βˆ’ 0.5 = 0.5 Hence Zcal = -6.957 Level of significance(𝜢) = 0.01 (1%) Critical value: Ztab= 2.58 Decision: 44
  • 45. 45 Example: At a certain date in a large city 400 out of a random sample of 500 men were found to be smokers. After the tax on tobacco had been heavily increased another random sample of 600 men in the same city included 400 smokers. Was the observed decrease in the proportion of the smokers significant? Solution: With usual notations, Setting of Hypothesis: β€’ Null Hypothesis (H0): 𝑃1 = 𝑃2 i. e. there is no significant difference between the proportion of the smokers before and after the increase in the tax. β€’ Alternative Hypothesis (H1): 𝑃1 > 𝑃2 (right tailed test). There is a significant decrease in the proportion of the smokers after the increase in the tax. Before Tax increased After Tax increased 𝑛1= 500 𝑛2= 600 𝑝1= π‘₯1 𝑛1 = 400 500 = 0.8 𝑝2= π‘₯2 𝑛2 = 400 600 = 0.667
  • 46. Test statistic: Zcal = p1βˆ’ p2 ΰ·‘ Pΰ·‘ Q 1 n1 + 1 n2 ΰ·‘ P = n1p1 + n2p2 n1 + n2 = 500 x 0.8 + 600 x 0.667 500 + 600 = 400 + 400 1100 = 800 1100 = 8 11 & ΰ·‘ Q = 1 βˆ’ 8 11 = 3 11 Zcal = p1βˆ’ p2 ΰ·‘ Pΰ·‘ Q 1 n1 + 1 n2 = 0.8 βˆ’ 0.667 8 11 π‘₯ 3 11 1 500 + 1 600 = 0.133 0.027 = 4.926 Level of significance (Ξ±) = 0.05, Critical value: then tabulated value of Z(Ξ± = 0.05) for right tailed Ztab = 1.645 Decision: Since Zcal = 4.926 > Ztab = 1.645 then, H0 is rejected i. e. Calculated value of Z is greater than tabulated value of Z at 0.05 level of significance. Then H0 is rejected. Conclusion: Then there is a significant decrease in the proportion of the smokers after the increase in the tax. 46
  • 47. Example: A firm found with the help of a sample survey of a city (size of sample 900) that πŸ‘ πŸ’ 𝒕𝒉 of them consumes things produced by them. The firm then advertised the goods in paper and on radio. After one year a sample of size 1000 reveals that proportion of consumers of the goods produced by the firm is πŸ’ πŸ“ 𝒕𝒉 . Is this rise significant to indicate that the advertisement was effective? Solution: With usual notations Setting of Hypothesis: H0: 𝑃1 = 𝑃2 The advertisement was not effective H1: 𝑃1 < 𝑃2 (Left tailed test). The advertisement was effective. Test Statistics: Zcal = p1βˆ’ p2 ΰ·‘ Pΰ·‘ Q 1 n1 + 1 n2 ΰ·‘ P = 𝑛1𝑝1+𝑛2𝑝2 𝑛1+𝑛2 = 900Γ— 3 4 +(1000Γ— 4 5 ) 900+1000 = 0.77 ΰ·‘ Q = 1 - ΰ·‘ P = 1 – 0.77 = 0.23 Hence Zcal = -2.59 |π‘π‘‘π‘Žπ‘| = 2.59 Level of Significance (𝜢) = 5% Critical Value: Ztab = -1.645, |π‘π‘‘π‘Žπ‘| = 1.645 Decision: H0 is rejected and H1 is accepted. 47 Before Ad After Ad 𝑛1= 900 𝑛2= 1000 𝑝1= 3 4 𝑝2= 4 5
  • 49. Student’s β€˜t’: Definition β€’ If x1, x2,….,xn is a random sample of size β€˜n’ from a normal population with mean ΞΌ and population variance Οƒ2, then Student’s β€˜t’ statistic is defined as: β€’ 𝐭 = ΰ΄₯ 𝐗 βˆ’ 𝛍 ΰ΅— 𝐒 𝐧 = (ΰ΄₯ 𝐗 βˆ’ 𝛍) ΰ΅— π’πŸ 𝐧 = 𝑛(ΰ΄₯ X βˆ’ ΞΌ) S ~𝑑(π‘›βˆ’1) … … (𝟏) β€’ Where: ΰ΄€ x = Οƒ X n , is the sample mean and S2 = 1 π‘›βˆ’1 Οƒ(π‘₯ βˆ’ π‘₯)2 = unbiased estimate of the population variance Οƒ2 . β€’ Note: β€˜t’ defined in the (1) follows Student’s β€˜t’- distribution with (n – 1) degrees of freedom β€’ Degrees of freedom is defined as number of independent observations.
  • 50. Assumptions 1. The parent population from which the sample is drawn is normal. 2. Sample size is generally less than or equals to 30. (which is considered as general traditional convention) 3. All the observations in the samples are independent. 4. Samples are drawn randomly from normal populations. 5. Population standard deviation (Οƒ) is unknown. 6. The sample values are correctly measured and recorded. 7. The hypothetical value of population mean (πœ‡0) of πœ‡ is correct values of the population mean.
  • 51. β€˜t’ distribution t = 0 -1 -3 -2 f(t) Normal curve t- distribution for n = 25 +2 +3 +1 Fig. 1 t- distribution for n = 5
  • 52. Applications of t-test (i) Test of significance of a single mean (ii) Test of significance of the difference between two independent means (iii)Paired t test for difference of two means (iv)Test of significance of an observed sample correlation coefficient and regression coefficients
  • 53. Case I: Test of significance of a single mean Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0: πœ‡ = πœ‡0 H1: πœ‡ β‰  πœ‡0 (Two tailed test), : πœ‡ > πœ‡0 (Right tailed test), : πœ‡ < πœ‡0 (Left tailed test) If sample mean ( ΰ΄€ 𝑋) > Population mean (πœ‡) : Right Tailed Test If sample mean ( ΰ΄€ 𝑋) < Population mean (πœ‡) : Left Tailed Test Step 2: Select test statistic: t = ΰ΄₯ X βˆ’ ΞΌ ΰ΅— S n = n (ΰ΄₯ X βˆ’ ΞΌ) S [π‘“π‘œπ‘Ÿ π‘’π‘›π‘π‘–π‘Žπ‘ π‘’π‘‘( π΄π‘π‘‘π‘’π‘Žπ‘™ π‘‘π‘Žπ‘‘π‘Ž 𝑖𝑠 𝑔𝑖𝑣𝑒𝑛)] t= ΰ΄₯ X βˆ’ ΞΌ ΰ΅— s nβˆ’1 = (nβˆ’1) (ΰ΄₯ X βˆ’ ΞΌ) s [π‘“π‘œπ‘Ÿ π‘π‘–π‘Žπ‘ π‘’π‘‘ (π‘ π‘Žπ‘šπ‘π‘™π‘’ 𝑠. 𝑑. 𝑖𝑠 𝑔𝑖𝑣𝑒𝑛)] Step 3: Fix the level of significance (𝜢) Step 4: degrees of freedom(d.f.) = (n – 1) Step 5: Write the critical value (t ) for 𝛼 % level of significance from the t- table. Step 6: Make decision: Critical value approach: If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) : it is not significant (insignificant). So Hβ‚€ is accepted. If Η€tΗ€(calculated) > tΞ±(table or critical) : it is significant. So Hβ‚€ is rejected, then H₁ is accepted. Step 7: Write Conclusion:
  • 54. Computation of S2 for Numerical Problems: β€’ 𝑆2 = 1 (π‘›βˆ’1) Οƒ(π‘₯ βˆ’ π‘₯ )2 where Οƒ(π‘₯ βˆ’ π‘₯ )2= sum of the squares of deviation taken from mean β€’ S2 = 1 (nβˆ’ 1) Οƒ π‘₯2 βˆ’ Οƒ π‘₯ 2 𝑛 β€’ 𝑆2 = 1 (π‘›βˆ’1) Οƒ 𝑑2 βˆ’ Οƒ 𝑑 2 𝑛 where π‘₯ = 𝐴 + Οƒ 𝑑 𝑛 How can we see critical value: d.f. Level of significance for one tailed test 0.10 0.05 0.025 0.01 0.005 0.0005 Level of significance for two tailed test 0.20 0.10 0.05 0.02 0.01 0.001 1 3.078 6.314 12.706 31.821 63.657 636.619 24 2.064
  • 55. Confidence intervals using β€˜t’ or Confidence interval for Small Samples β€’ Confidence intervals = estimator Β± (reliability constant) x (standard error of the estimate). β€’ Reliability coefficient is obtained from the table of t- distribution. β€’ C. I. (ΞΌ) = ΰ΄€ 𝐱 Β± 𝐭𝜢,π’βˆ’πŸ. 𝐒 𝐧 . [ For unbiased estimator, When actual data is given] β€’ C. I. (ΞΌ) = ΰ΄€ 𝐱 Β± 𝐭𝜢,π’βˆ’πŸ. 𝐬 π§βˆ’πŸ . [ For biased estimator when sample s.d. or variance is given]
  • 56. Example: The average cost of a hotel room in New York is said to be $168 per night. To determine if this is true, a random sample of 25 hotels is taken and resulted in average of $172.50 with standard deviation of $15.40. Test the appropriate hypothesis at 0.05 level of significance. Solution: With usual notations: πœ‡ =$ 168 , n = 25 , ΰ΄€ 𝑋= $ 172.50 , s = $ 15.40 Formulation of Hypothesis: H0: ΞΌ = $ 168 . The average cost of a hotel room in New York is $168 per night. H1: ΞΌ ο‚Ή 168 (Two tailed test). The average cost of a hotel room in New York is not equals to $168 per night. Test statistic: Under H0 ,the test statistics is 𝑑 = ΰ΄€ 𝑋 βˆ’ πœ‡ ΰ΅— 𝑆 π‘›βˆ’1 = 172.50 βˆ’ 168 ΰ΅— 15.40 25βˆ’1 = 1.43 tcal =1.43 Level of significance(𝜢) = 0.05 Degree of freedom(d.f.) = n-1 = 25-1 = 24 Critical value: The tabulated value of t for two tailed test at 0.05 level of significance with 24 d.f. is 2.064 i.e. ttab = 2.064 Decision: Since tcal < ttab so H0 is accepted. Conclusion: Hence we conclude that the average cost of a hotel room in New York is $168 per night
  • 57. Contd.. 95% confidence interval for population mean: n= 25, ΰ΄€ 𝑋 = $ 172.50 , s = $ 15.40 1- 𝛼 = 0.95 , 𝛼 = 0.05 d.f. = n-1 = 25-1 = 24 𝑑𝛼,π‘›βˆ’1= 𝑑0.05,24 = 2.064 95% CI for Population mean( πœ‡) is = ΰ΄€ 𝑋 Β± 𝑑𝛼,π‘›βˆ’1 . 𝑠 π‘›βˆ’1 = 172.50 Β± 2.064 . 15.40 25βˆ’1 = 172.50 Β± 6.48 = (172.50 – 6.48 , 172.50 + 6.48) = (166.01 , 178.98)
  • 58. Example : The following are weight (gm.) values for sample of 21 chocolates. Can we conclude from these data that the mean of the population from which the sample was drawn is greater than 14.0? Use Ξ± = 0.05. 14.5 12.9 14.0 16.1 12.0 17.5 14.1 12.9 17.9 12.0 16.4 24.2 12.2 14.4 17.0 10.0 18.5 20.8 16.2 14.9 19.6
  • 59. Solution: Calculation of S2: X X2 14.5 210.25 10.9 166.41 14.0 196.00 16.1 259.21 12.0 144.00 17.5 306.25 14.1 198.81 12.9 166.41 17.9 320.41 12.0 144.00 16.4 268.96 24.2 585.64 12.2 148.84 14.4 207.36 17.0 289.00 10.0 100.00 18.5 342.25 20.8 432.64 16.2 262.44 14.9 222.01 19.6 384.16 Ξ£X = 328.1 Ξ£X2 = 5355.05
  • 60. β€’ Now 𝑆2 = 1 (π‘›βˆ’1) Οƒ 𝑋2 βˆ’ (Οƒ 𝑋) 2 𝑛 = 1 21βˆ’1 5355.05 βˆ’ 328.1 2 21 = 11.44 β€’ ∴ 𝑠 = 11.44 = 3.38 β€’ Sample size (n) = 21, sample mean (π‘₯) = Οƒ 𝑋 𝑛 =328.1/21 = 15.62, sample standard deviation (s) = 3.38,, population mean (ΞΌ) = 14, β€’ Hypothesis setting: β€’ Null Hypothesis: H0:ΞΌ = 14 gm i. e. β€’ Alternative hypothesis: H1: ΞΌ > 14 gm(Right tailed test) i. e. β€’ Test statistic: 𝑑 = ΰ΄₯ X βˆ’ ΞΌ ΰ΅— S n = 15.62βˆ’14 ΰ΅— 3.38 21 = 1.62 Ξ€ 3.38 4.58 = 1.62 0.73 = 2.21 β€’ Level of significance(𝜢 )= 0.05 β€’ Degree of freedom(d.f.) = n -1 = 21 – 1 = 20 β€’ Critical value: 𝑑𝛼=0.05 π‘“π‘œπ‘Ÿ π‘œπ‘›π‘’ π‘‘π‘Žπ‘–π‘™π‘’π‘‘ ,20 𝑑𝑓 = 1.725 β€’ Decision: since π‘‘π‘π‘Žπ‘™ = 2.21 > 𝑑𝛼 = 1.725 then the test is significant so we reject hull hypothesis and then accept alternative hypothesis (H1). β€’ Conclusion:
  • 61. Case II: Test of significance of difference between two independent means (Pooled t- test) independent t -test Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0: 𝝁𝟏 = 𝝁𝟐 i. e. there is no significant difference between the two population means H1: 𝝁𝟏 β‰  𝝁𝟐 (two tailed test), H1: 𝝁𝟏 > 𝝁𝟐 (right tailed test), H1:𝝁𝟏 < 𝝁𝟐(left tailed test) If First sample mean(ΰ΄₯ X1) > Second sample mean(ΰ΄₯ X2) : Right Tailed Test If First sample mean(ΰ΄₯ X1) < Second sample mean(ΰ΄₯ X2) : Left Tailed Test Step 2: Test Statistics , t = (𝑋1 βˆ’ 𝑋2 ) 𝑆2 1 𝑛1 + 1 𝑛2 = (𝑋1 βˆ’ 𝑋2 ) 𝑆 1 𝑛1 + 1 𝑛2 ~𝑑(𝑛1+𝑛2βˆ’2) Step 3: Fix the level of significance (𝜢) Step 4: Degrees of freedom(d.f.) = (𝑛1 + 𝑛2 βˆ’ 2) Step 5: Write the critical value (t ) for 𝛼 % level of significance at (n1 + n2 – 2) d. f. from the t- table. Step 6: Make decision: Critical value approach: If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted. If Η€tΗ€(calculated) > tΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted. Step 7: Conclusion:
  • 62. Calculation of S2 β€’ Actual mean method: 𝑆2 = 1 (𝑛1+ 𝑛2βˆ’2) [Οƒ(𝑋1 βˆ’ 𝑋1)2 + Οƒ(𝑋2 βˆ’ 𝑋2)2 ] β€’ Direct method: 𝑆2 = 1 (𝑛1+ 𝑛2βˆ’2) ቂ ቃ Οƒ 𝑋1 2 βˆ’ Οƒ 𝑋1 2 𝑛1 + Οƒ 𝑋2 2 βˆ’ Οƒ 𝑋2 2 𝑛2 β€’ Short cut method: 𝑆2 = 1 (𝑛1+ 𝑛2βˆ’2) ቂ ቃ Οƒ 𝑑1 2 βˆ’ Οƒ 𝑑1 2 𝑛1 + Οƒ 𝑑2 2 βˆ’ Οƒ 𝑑2 2 𝑛2 β€’ Where d1 = X – A, d2 = X – B, A = assumed mean of series X, B = assumed mean of series Y. β€’ π‘ΊπŸ = π’πŸ.π’”πŸ 𝟐+ π’πŸ.π’”πŸ 𝟐 (π’πŸ+ π’πŸβˆ’πŸ) [ when the sample variance (or sample standard deviation) i. e. biased estimates are given]
  • 63. Example: Two different types of drugs D1 and D2 were administrated on certain patients for increasing weight at interval of one week time period. From the following observation, can you conclude that Test at 5% level of significance. i. Whether two drugs differ significantly? ii. The second drug is more effective in increasing weight. D1 8 12 13 9 3 8 10 9 D2 10 8 12 15 6 11 12 12
  • 64. Solution Calculation of S2 X1 X2 X1 2 X2 2 8 10 64 100 12 8 144 64 13 12 169 144 9 15 81 225 3 6 9 36 8 11 64 121 10 12 100 144 9 12 81 144 Ξ£X1 = 72 Ξ£X2 = 86 Ξ£X1 2 = 712 Ξ£X2 2 = 978 β€’ Here n1 = n2 = 8 then π‘₯1= 72/8 = 9, π‘₯2= 86/8= 10.75 β€’ 𝑆2 = 1 𝑛1+ 𝑛2βˆ’2 Οƒ 𝑋1 2 βˆ’ (Οƒ 𝑋1)2 𝑛1 + Οƒ 𝑋2 2 βˆ’ (Οƒ 𝑋2)2 𝑛2 β€’ = 1 8+ 8βˆ’2 712 βˆ’ (712)2 8 + 978 βˆ’ (86)2 8 = 8.4
  • 65. Contd… (i) Setting of Hypothesis: H0: ΞΌ1 = ΞΌ2 (i. e. both drugs are equally effective) H1: ΞΌ1 β‰  ΞΌ2 (i. e. both drugs are not equally effective) Test statistic: 𝑑 = ΰ΄€ 𝑋1βˆ’ ΰ΄€ 𝑋2 𝑆2 1 𝑛1 + 1 𝑛2 = 9 βˆ’ 10.75 8.4 1 8 + 1 8 = βˆ’1.208 then π‘‘π‘π‘Žπ‘™ = βˆ’1.208 = 1.208 Level of significance(𝜢) = 0.05 Degree of freedom(d.f.) = (𝑛1 + 𝑛2 βˆ’ 2) = 8+8- 2 = 14 Critical value: Tabulated value of t for two tailed test at 0.05 level of significance with 14 d.f. is 2.145 i.e. π‘‘π‘‘π‘Žπ‘= 2.145 Decision: π‘‘π‘π‘Žπ‘™ = 1.208 < π‘‘π‘‘π‘Žπ‘ = 2.145 so we accept H0 . Conclusion: we can conclude that both drugs are equally effective.
  • 66. Contd… (ii) Setting of Hypothesis: H0: ΞΌ1 = ΞΌ2 (i. e. both drugs are equally effective) H1: ΞΌ1 < ΞΌ2 (i. e. second drug is effective than first drug) Test statistic: 𝑑 = ΰ΄€ 𝑋1βˆ’ ΰ΄€ 𝑋2 𝑆𝑝 2 1 𝑛1 + 1 𝑛2 = 9 βˆ’ 10.75 8.4 1 8 + 1 8 = βˆ’1.208 then π‘‘π‘π‘Žπ‘™ = βˆ’1.208 = 1.208 Level of significance(𝜢) = 0.05 Critical value: Tabulated value of t for two tailed test at 0.05 level of significance with 14 d.f. is 2.145 i.e. π‘‘π‘‘π‘Žπ‘= 1.761 Decision: π‘‘π‘π‘Žπ‘™ = 1.208 < π‘‘π‘‘π‘Žπ‘ =1.761 so we accept H0 . Conclusion: we can conclude that both drugs are equally effective
  • 67. Example: Two salesmen A and B are working in a certain district. From a sample survey conducted by the head office, the following results were obtained. State whether there is any significant difference in the average sales between the two salesmen. Solution: With the usual notations, we have, For A For B 𝑛1 = 20 𝑛2 = 18 π‘₯1 = 170 π‘₯2= 205 𝑠1 = 20 𝑠2 = 25 Setting of Hypothesis: H0: ΞΌ1 = ΞΌ2 , i.e. there is no significant difference between the average sales of two salesmen H1: ΞΌ1 β‰  ΞΌ2 i.e. there is a significant difference between the average sales of two salesmen. Salesmen A Salesmen B No of sales: 20 18 Average sales: 170 205 Standard deviation 20 25
  • 68. Test statistic: under H0the test statistic is 𝑑 = ΰ΄€ 𝑋1βˆ’ ΰ΄€ 𝑋2 𝑆2 1 𝑛1 + 1 𝑛2 𝑆2 = 𝑛1.𝑠1 2+ 𝑛2.𝑠2 2 (𝑛1+ 𝑛2βˆ’2) = 𝑛1.𝑠1 2+ 𝑛2.𝑠2 2 (𝑛1+ 𝑛2βˆ’2) = 534.75 t= 170βˆ’205 534.72 1 20 + 1 18 = -4.56 Hence tcal = 4.56 Level of significance( ) = 0.05 Degree of freedom(d.f.) = n1 + n2 – 2 = 20 + 18 – 2 = 36 Critical Value: Since d.f. =36 > 30, it follows normal distribution. Hence the tabulated value of t at 5% level of significance for two tailed test is 1.96. Decision: Since the calculated value of t is greater than the tabulated value of t, it is significant, hence the null hypothesis is rejected i.e. alternative hypothesis is accepted, which means that there is a significant difference between the average sales of two salesmen.
  • 69. When Z-test and When t-test is suitable? β€’ In case of test of significance of single mean and double(two independent means) β€’ You can use t test if population standard deviation is not given(unknown) in case of sample size greater than 30 β€’ Use tabulated value from table (also in t-table) β€’ See next example
  • 70. β€’ Two independent samples z-test: β€’ Large sample size (typically > 30) β€’ Known population standard deviation β€’ Normally distributed population β€’ Two independent samples (unpaired) t-test: β€’ Small sample size (typically < 30) β€’ Unknown population standard deviation β€’ Population may not be normally distributed
  • 71. Question: Read the following information perform the following: (i) Test whether two means are significantly difference at 0.05 level of significance using independent t-test. (ii) Compute 95% confidence interval estimation for the difference of means (iii) Show the linkage between testing of hypothesis and interval estimation in this problem Group I Group II Sample means 10 15 Sample standard deviation 3 5 Sample size 49 64
  • 72. Solution: β€’ (i) as same process solving above β€’ (ii) C. I. β€’ (π‘ΏπŸ βˆ’ π‘ΏπŸ) Β± t𝛂, π’πŸ + π’πŸ βˆ’ 𝟐 βˆ— π’”πŸ( 𝟏 π’πŸ + 𝟏 π’πŸ ) β€’ 𝑆2 = 𝑛1.𝑠1 2+ 𝑛2.𝑠2 2 (𝑛1+ 𝑛2βˆ’2) see ttab two tailed test β€’ (iii) if hypothetical value of mean difference zero (0) lies between confidence interval we accept null hypothesis otherwise reject
  • 73. Case III : Paired t-test for difference of two means Procedure Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0: 𝝁𝒙 = ππ’š or 𝝁𝒅 = 𝟎 i. e. there is no significant difference between the two population means H1: 𝝁𝒙 β‰  ππ’š or 𝝁𝒅 β‰  𝟎 (two tailed test), : 𝝁𝒙 > ππ’š (right tailed test), :𝝁𝒙 < ππ’š(left tailed test) Step 2: Select test statistic:𝒕 = ΰ΄₯ 𝒅 ΰ΅— 𝒔𝒅 𝟐 𝒏 ~𝒕(π’βˆ’πŸ) Where d = X – Y X = value before treatment Y = value after treatment Treatment means performing advertisement for promoting sales of goods or performing, training for increasing memory capacity of person etc….. 𝑑 = Οƒ 𝑑 𝑛 π‘Žπ‘›π‘‘ 𝑠𝑑 2 = Οƒ(π‘‘βˆ’ ΰ΄€ 𝑑)2 (π‘›βˆ’1) = 1 (π‘›βˆ’1) Οƒ 𝑑2 βˆ’ Οƒ 𝑑 2 𝑛 Step 3: Fix the level of significance (𝜢) Step 4: degrees of freedom = (n – 1) Step 5: Write the critical value (t ) for 𝛼 % level of significance and (n – 1)d. f. from the t- table. Step 6: Make decision: Critical value approach: If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted. If Η€tΗ€(calculated) > tΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted.
  • 74. Example: A company has reorganized its sale department. The following data show the weekly sales figures (in Rs. lakhs) before and after the reorganization. Can we say that sales have significantly changed due to the reorganization? In this problem, the sales before reorganization (x) and after reorganization (y) are not independent, but are paired together. Hence we shall apply paired t-test. Setting of Hypothesis: H0: πœ‡π‘₯ = πœ‡π‘¦ i.e. there is no significant change in sale after the reorganization. H1: πœ‡π‘₯ β‰  πœ‡π‘¦ (Two tailed test), , i.e. there is a significant change in sale after the reorganization. Test Statistics: Under H0 , the test statistics is 𝑑 = ΰ΄€ 𝑑 ΰ΅— 𝑆2 𝑛 Week 1 2 3 4 5 6 7 8 9 10 Sales before: 12 15 13 11 17 15 10 11 18 19 Sales after: 16 17 14 13 15 14 12 11 17 22
  • 75. 𝑑 = Οƒ 𝑑 𝑛 = βˆ’10 10 = -1 s2 = 1 (π‘›βˆ’1) Οƒ 𝑑2 βˆ’ Οƒ 𝑑 2 𝑛 = 1 (10βˆ’1) 44 βˆ’ βˆ’10 2 10 = 3.778 Wee k Sales (before reorganization) (x) Sales (after reorganization) (y) d= X - Y d2 1 2 3 4 5 6 7 8 9 10 12 15 13 11 17 15 10 11 18 19 16 17 14 13 15 14 12 11 17 22 -4 -2 -1 -2 2 1 -2 0 1 - 3 16 4 1 4 4 1 4 0 1 9 Total Οƒ 𝑑 = -10 Οƒ 𝑑2 = 44
  • 76. Hence , 𝑑 = ΰ΄€ 𝑑 ΰ΅— 𝑆2 𝑛 = βˆ’1 Ξ€ 3.778 10 = -1.629 , Η€tΗ€(calculated) = 1.629 Level of significance = 0.05 Degree of freedom (d.f.) = n-1 = 10-1 = 9 The critical (tabulated) value of t for 9 d.f. at 5% level of significance for two tailed test 2.262. Decision: Since calculated value of t is less than critical value of t, it is not significant at 5% level of significance. Hence, the data do not provide any evidence against the null hypothesis which may be accepted. It may therefore be concluded that reorganization has no effect on sales.
  • 77. Calculations: S. N. Change in score (d) (d)2 1 8 64 2 10 100 3 -2 4 4 0 0 5 -5 25 6 -1 1 7 9 81 8 12 144 9 6 36 10 5 25 Total Ξ£d = 42 Ξ£d2 = 480 Example 2: A drug is given to 10 patients, on the increment in their blood pressure were recorded to be 8, 10, -2, 0, - 5, - 1, 9, 12, 6, 5. Test at 5% level of significance whether the drug is effective for either increasing or decreasing the blood pressure.
  • 78. Contd… β€’ Number of observation (n) = 10, β€’ Now 𝑑 = Οƒ 𝑑 𝑛 = 42 10 = 4.2 and β€’ 𝑆2 = 1 (π‘›βˆ’1) Οƒ 𝑑2 βˆ’ Οƒ 𝑑 2 𝑛 = 1 9 480 βˆ’ (42)2 10 = 33.73 β€’ Null Hypothesis: H0: πœ‡π‘₯ = πœ‡π‘¦ i. e. there was no increase or decrease in blood pressure before and after the treatment. The treatment was not effective β€’ Alternative Hypothesis: H1: πœ‡π‘₯ β‰  πœ‡π‘¦ i. e. there was increase or decrease in blood pressure before and after the treatment. The treatment was effective. (two tailed test) β€’ Test statistic: 𝑑 = ΰ΄€ 𝑑 ΰ΅— 𝑆2 𝑛 = 4.2 33.735 10 = 2.29 β€’ level of significance(𝜢)= 0.05 β€’ Degree of freedom(d.f.) = n - 1 = 10 – 1 = 9 β€’ Critical value of t for 9 d. f. and two tailed test 0.05 level of significance, 𝑑𝛼=0.05 = 2.262 β€’ Decision: π‘‘π‘π‘Žπ‘™ = 2.29 > 𝑑𝛼=0.05 = 2.262 then it is significant so we reject H0 and accept H1. β€’ Conclusion: The treatment was effective.
  • 79. Case IV: Test of significance of an observed sample correlation coefficient Let r be the observed sample correlation coefficient in a sample of n pairs of observations from bivariate normal population. In order to test whether any correlation coefficient between variables in population, t-test is applied. Procedure: Step 1: Formulate the Null (H0) and Alternative (H1) Hypothesis. H0: 𝝆 = 0 i.e. Population correlation coefficient is zero or Variables are uncorrelated in the populations H1: 𝝆 β‰  𝟎 (two tailed test), variables are correlated in the population H1 : 𝝆 > 𝟎 (right tailed test) , H1 :𝝆 < 𝟎(left tailed test) Step 2: Select test statistic:𝑑 = 𝒓 πŸβˆ’π’“πŸ . 𝒏 βˆ’ 𝟐 Step 3: Fix the level of significance (𝜢) Step 4: degrees of freedom = (n – 2) Step 5: critical value (t ) for 𝛼 % level of significance and (n – 2)d. f. from the t- table. Step 6: Make decision: Critical value approach: If Η€tΗ€(calculated) ≀ tΞ±(table or critical ) we say it is not significant (insignificant). So Hβ‚€ is accepted. If Η€tΗ€(calculated) > tΞ±(table or critical) we say it is significant. So Hβ‚€ is rejected, then H₁ is accepted. step 7: Conclusion:
  • 80. Example: A random sample of 27 pairs of observations from a normal population gives a correlation coefficient of 0.42. Is it likely that the variables in the population are uncorrelated? Also compute 95% confidence limit for the population correlation coefficient. Solution: with usual notations, n = 27, r = 0.42 Setting of Hypothesis: H0: 𝜌 = 0 i.e. Variables in the populations are not correlated . H1: 𝜌 β‰  0 (Two tailed test) i.e. Variables in the populations are correlated . Test Statistics: 𝑑 = 𝒓 πŸβˆ’π’“πŸ . 𝒏 βˆ’ 𝟐 = 𝟎.πŸ’πŸ πŸβˆ’πŸŽ.πŸ’πŸπŸ . πŸπŸ• βˆ’ 𝟐 = 2.31 Level of significance(𝜢) = 0.05 Degree of freedom(d.f.) = n-2 = 27-2 = 25 Critical value: ttab = 2.060 Decision: For 95% confidence limits: 1- 𝛼= 95%, 𝛼= 5% d.f. = 27-2 =25 𝑑𝛼,π‘›βˆ’2 = 2.060 Hence 95% Confidence limits for population correlation coefficient are = r Β± 𝑑𝛼,π‘›βˆ’2 . 1βˆ’π‘Ÿ2 𝑛 = 0.42 Β± 2.060 . 1βˆ’0.422 27 = 0.42 Β±