Difference between revisions of "Hypothesis Testing"


(A ProblemM. K. Chung's lecture notes, 2003.)
(A ProblemM. K. Chung's lecture notes, 2003.)
Line 13: Line 13:
 
==A Problem<ref>M. K. Chung's lecture notes, 2003.</ref>==
 
==A Problem<ref>M. K. Chung's lecture notes, 2003.</ref>==
  
I believe that dogs are as smart as people. Assume IQ of a dog follows {{#tag:math|Xi \sim N(\mu,10^2)}}. IQ of 10 dogs are measured: 30, 25, 70, 110, 40, 80, 50, 60, 100, 60. We want to test if dogs are as smart as people by testing
+
I believe that dogs are as smart as people. Assume IQ of a dog follows {{#tag:math|X_i \sim N(\mu,10^2)}}. IQ of 10 dogs are measured: 30, 25, 70, 110, 40, 80, 50, 60, 100, 60. We want to test if dogs are as smart as people by testing
  
 
<center>{{#tag:math|H_0 : \mu = 100 \text{ vs. } H_1 : \mu < 100}}.</center>
 
<center>{{#tag:math|H_0 : \mu = 100 \text{ vs. } H_1 : \mu < 100}}.</center>
 +
  
 
One reasonable thing one may try is to see how high the sample mean is.
 
One reasonable thing one may try is to see how high the sample mean is.
Line 24: Line 25:
 
[1] 62.5
 
[1] 62.5
 
</syntaxhighlight>
 
</syntaxhighlight>
 +
  
 
Since the average IQ of 10 dogs are lower than 100, one would be inclined to reject {{#tag:math|H_0}}.
 
Since the average IQ of 10 dogs are lower than 100, one would be inclined to reject {{#tag:math|H_0}}.
Line 29: Line 31:
 
Let {{#tag:math|\bar{X} }} be a test statistic and {{#tag:math|R = (−∞,90]}} to be a rejection region. Let’s compute the probability of making Type I error based on this testing procedure. Under the assumption {{#tag:math|H_0}} is true,
 
Let {{#tag:math|\bar{X} }} be a test statistic and {{#tag:math|R = (−∞,90]}} to be a rejection region. Let’s compute the probability of making Type I error based on this testing procedure. Under the assumption {{#tag:math|H_0}} is true,
  
 +
<center>{{#tag:math|X_i \sim N(100,10^2)}}</center>
 +
 +
Under this condition, {{#tag:math|\bar{X} \sim N(100, 10)}} and
 +
 +
<center>{{#tag:math|\alpha = P(\bar{X} \leq 90) }}</center>
 +
 +
<syntaxhighlight lang="r" line start="4">
 +
> pnorm(90,100,sqrt(10))
 +
[1] 0.0007827011
 +
</syntaxhighlight>
  
  

Revision as of 08:55, 16 November 2020

H. Kemal Ilter
2020


Concepts

  1. The null hypothesis [math]H_0[/math] is a claim about the value of a population parameter. The alternate hypothesis [math]H_1[/math] is a claim opposite to [math]H_0[/math].
  2. A test of hypothesis is a method for using sample data to decide whether to reject [math]H_0[/math]. [math]H_0[/math] will be assumed to be true until the sample evidence suggest otherwise.
  3. A test statistic is a function of the sample data on which the decision is to be based.
  4. A rejection region is the set of all values of a test statistic for which [math]H_0[/math] is rejected.
  5. Type I error: you reject [math]H_0[/math] when [math]H_0[/math] is true. [math]P(\text{Type I error}) = P(\text{reject }H_0 \mid H_0\text{ true}) = \alpha[/math]. The resulting [math]\alpha[/math] is called the significance level of the test and the corresponding test is called a level [math]\alpha[/math] test. We will use test procedures that give [math]\alpha[/math] less than a specified level (0.05 or 0.01).

A Problem[1]

I believe that dogs are as smart as people. Assume IQ of a dog follows [math]X_i \sim N(\mu,10^2)[/math]. IQ of 10 dogs are measured: 30, 25, 70, 110, 40, 80, 50, 60, 100, 60. We want to test if dogs are as smart as people by testing

[math]H_0 : \mu = 100 \text{ vs. } H_1 : \mu \lt 100[/math].


One reasonable thing one may try is to see how high the sample mean is.

1 > x<-c(30, 25, 70, 110, 40, 80, 50, 60, 100, 60)
2 > mean(x)
3 [1] 62.5


Since the average IQ of 10 dogs are lower than 100, one would be inclined to reject [math]H_0[/math].

Let [math]\bar{X} [/math] be a test statistic and [math]R = (−∞,90][/math] to be a rejection region. Let’s compute the probability of making Type I error based on this testing procedure. Under the assumption [math]H_0[/math] is true,

[math]X_i \sim N(100,10^2)[/math]

Under this condition, [math]\bar{X} \sim N(100, 10)[/math] and

[math]\alpha = P(\bar{X} \leq 90) [/math]
4 > pnorm(90,100,sqrt(10))
5 [1] 0.0007827011




References

  1. M. K. Chung's lecture notes, 2003.