Goodness-of-Fit Tests
Let Y1, Y 2, . . ., Y n be a set of independent and identically distributed random variables. Assume that the probability distribution of the Y i's has the density function f o (y). We can divide the set of all possible values of Yi,
i ÃŽ {1, 2, ..., n}, into m non-overlapping intervals D1, D2, ...., Dm. Define the probability values p1, p2 , ..., pm as;
p1 = P(Yi ÃŽ D1)
p2 = P(Yi ÃŽ D2):
:pm = P(Yi ÃŽ Dm)
Since the union of the mutually exclusive intervals D1, D2, ...., Dm is the set of all possible values for the Yi's, (p1 + p2 + .... + pm) = 1. Define the set of discrete random variables X1, X2, ...., Xm, where
X1= number of Yi's whose value ÃŽ D1
X2= number of Yi's whose value ÃŽ D2:
:Xm= number of Yi's whose value ÃŽ Dm
and (X1+ X2+ .... + Xm) = n. Then the set of discrete random variables X1, X2, ...., Xmwill have a multinomial probability distribution with parameters n and the set of probabilities {p1, p2, ..., pm}. If the intervals D1, D2, ...., Dm are chosen such that npi³ 5 for i = 1, 2, ..., m, then;
For the goodness-of-fit sample test, we formulate the null and alternative hypothesis as
Ho : fY(y) = fo(y)
H1 : fY(y) ¹ fo(y)
At the a level of significance, Ho will be rejected in favor of H1 if
However, it is possible that in a goodness-of-fit test, one or more of the parameters of fo(y) are unknown. Then the probability values p1, p2, ..., pm will have to be estimated by assuming that Ho is true and calculating their estimated values from the sample data. That is, another set of probability values p'1, p'2, ..., p'm will need to be computed so that the values (np'1, np'2, ..., np'm) are the estimated expected values of the multinomial random variable (X1, X2, ...., Xm). In this case, the random variable C will still have a chi-square distribution, but its degrees of freedom will be reduced. In particular, if the density function fo(y) has r unknown parameters,
For this goodness-of-fit test, we formulate the null and alternative hypothesis as
Ho: fY(y) = fo(y)
H1: fY(y) ¹ fo(y)
At the a level of significance, Ho will be rejected in favor of H1 if