STATISTICS III:
ESTIMATION
Characteristics of Estimators
When a parameter is being estimated, the estimate can be either a single number or it can be a range of scores. When the estimate is a single number, the estimate is called a "point estimate"; when the estimate is a range of scores, the estimate is called an interval estimate. Confidence intervals are used for interval estimates.
As an example of a point estimate, assume you wanted to estimate the mean time it takes 12- year-olds to run 100 yards. The mean running time of a random sample of 12-year-olds would be an estimate of the mean running time for all 12-year-olds. Thus, the sample mean, M, would be a point estimate of the population mean, m.
Often point estimates are used as parts of other statistical calculations. For example, a point estimate of the standard deviation is used in the calculation of a confidence interval for m. Point estimates of parameters are often used in the formulas for significance testing.
Point estimates are not usually as informative as confidence intervals. Their importance lies in the fact that many statistical formulas are based on them.
Characteristics of Estimators
Statistics are used to estimate parameters. There are three important attributes of statistics as estimators: unbiasedness, consistency, and relative efficiency.
When we say that a measurement is unbiased we mean that the average of a large set of unbiased measurements will be close to the true value. For example, the sample mean, M, is an unbiased estimate of the population mean, m.
When more than one
statistic can be used to estimate a
parameter, one will naturally be more
efficient than the other(s). In general
the relative efficiency of two
statistics differs depending on the shape
of the distribution of the
numbers in the population. Statistics that minimize
the sum of squared
deviations such as the mean are generally the most efficient
estimators
for normal distributions but not for skewed distributions.
Estimating variance
The formula for the variance computed in the population, s2, is different from the formula for an unbiased estimate of variance, S2, computed in a sample. The two formulas are shown below:
The unexpected difference between the two formulas is that the denominator is N for s2 and is N-1 for S2. That there should be a difference in formulas is very counterintuitive. To understand the reason that N-1 rather than N is needed in the denominator of the formula for S2, consider the problem of estimating s2 when the population mean, m, is already known.
Assume that you knew that the mean amount
of practice
it takes student pilots to master a particular maneuver is 12
hours. If
you sampled one pilot and found he or she took 14 hours to
master the maneuver,
what would be your estimate of s2 ? The
answer lies
in considering the
definition of variance: It is the average
squared deviation
of individual scores from m.
With only one score, you have one squared deviation of
a score
from m. In this example, the one squared deviation is:
(X-m)2 = (14-12)2= 4.
This single
squared deviation from the mean is the best
estimate of the average
squared deviation and is an unbiased estimate of
s2. Since it
is based on
only one score, the estimate is not a very good
estimate although it is
still unbiased. It follows that if m is known and
N scores are sampled
from the population, then an unbiased estimate of s2 could be
computed
with the following formula:
Now it is time to consider what happens
when m is not
known and M is used as an estimate
of m. Which value is going to be
larger for a sample of N values of X:
or
?
Since M is the mean of the N values of X and since the
sum of
squared deviations of a set of numbers from their own mean is smaller
than
the sum of squared deviations from any other number, the
quantity
will always be smaller than
The argument goes that since
is an unbiased estimate of s^2 and
since
will always be smaller than
then
must be biased and will have a tendency to underestimate
s2. It
turns
out that dividing by N-1 rather than by N increases the estimate
just enough
to eliminate the bias exactly.
Confidence intervals
Before a simple research question such as " What is the mean number of digits that can be remembered?" can be answered, it is necessary to specify the population of people to which it is addressed. The researcher could be interested in, for example, adults over the age of 18, all people regardless of age, or students attending high school. For the present example, assume the researcher is interested in students attending high school.
Once the population is specified, the next step is to take a random sample from it. In this example, let's say that a sample of 10 students were drawn and each student's memory tested. The way to estimate the mean of all high school students would be to compute the mean of the 10 students in the sample. Indeed, the sample mean is an unbiased estimate of m, the population mean. But it will certainly not be a perfect estimate. By chance it is bound to be at least either a little bit too high or a little bit too low.
For the estimate of m to be of value, one must have some idea of how precise it is. That is, how close to m is the estimate likely to be?
An excellent way to specify the precision is to construct a confidence interval. If the number of digits remembered for the 10 students were: 4, 4, 5, 5, 5, 6, 6, 7, 8, 9 then the estimated value of m would be 5.9 and the 95% confidence interval would range from 4.71 to 7.09.
The wider the interval, the more confident you are that it contains the parameter. The 99% confidence interval is therefore wider than the 95% confidence interval and extends from 4.19 to 7.61.
Below are shown some examples of possible confidence intervals. Although the parameter m2 - m2 represents the difference between two means, it is still valid to think of it as one parameter; π2 - π2 can also be thought of as one parameter.
A confidence interval is a range of values that has a specified probability of containing the parameter being estimated. The 95% and 99% confidence intervals which have .95 and .99 probabilities of containing the parameter respectively are most commonly used. If the parameter being estimated were m, the 95% confidence interval might look like the following:
12.5 <= m <= 30.2
What this means is that the interval between 12.5 and 30.2 has a .95 probability of containing m.
A confidence interval
only has the specified
probability of containing the parameter if the sample
data on which it is
based is the only information available about the value
of the
parameter.
As an extreme example, consider the
case in which 1000
studies estimating the value of m in a certain
population all resulted
in estimates between 25 and 30. If one more study
were conducted and if
the 95% confidence interval on m were
computed (based on that one
study) to be:
35 <= m <= 45
then it would be absurd to say that the probability that m is between 35 and 45 is .95. It almost certainly is not. However, if the only data you had to go on were that one study, then, from your point of view, the probability is .95.
It is important to be very precise about
the sense in
which a confidence interval has a specified probability of
containing a
parameter: If the procedure for computing a 95% confidence
interval is
used over and over, 95% of the time the interval will contain
the parameter. Next you'll see how to compute a
confidence interval for the mean of a normally-distributed
variable for which the population standard deviation is known.
In practice, the population standard deviation is rarely
known. However, learning how to compute a confidence interval
when the standard deviation is known is an excellent
introduction to how to compute a confidence interval when the
standard deviation has to be estimated.
Three values are used to construct a confidence interval for
m: the sample mean (M), the value of z (which depends on the
level of confidence), and the standard error of the mean
(σM).
The confidence interval has M for its center and extends a
distance equal to the product of z and σM in both
directions.
Therefore, the formula for a confidence interval is
M - z*σM <= m <= M + z*σM .
Assume that the standard deviation of SAT verbal scores in a
school system is known to be 100. A researcher wishes to
estimate the mean SAT score and compute a 95% confidence
interval from a random sample of 10 scores.
The 10 scores are: 320, 380, 400, 420, 500, 520, 600, 660, 720, and 780.
Therefore, M = 530, N = 10, and
=
= 31.62. The value of z for the 95%
confidence interval is the number of standard deviations one must go from
the mean (in both directions) to contain .95 of the scores.
It turns out that one must go 1.96 standard deviations from the mean in
both directions to contain .95 of the scores. The value of 1.96 was found
using a z table. Since each tail is to contain .025 of the scores, you
find the value of z for which 1-.025 = .975 of the scores are below. This
value is 1.96. 
All the components of the confidence interval are now known: M = 530,
σM =
31.62, z = 1.96.
Lower limit = 530 - (1.96)(31.62) = 468.02
Upper limit = 530 + (1.96)(31.62) = 591.98
Confidence intervals can be constructed for any estimated parameter, not just m. For example, one might estimate the proportion of people who could pass a training program or the difference between the mean for subjects taking a drug and those taking a placebo.
More
about
confidence intervals
HOMEWORK
1. Which is wider, a 93% or a 98% confidence interval? Explain.
2. When you construct a 95% confidence interval, what are you 95%
confident about?
3. What is the effect of sample size on the width of a confidence
interval? Explain.
4. Why is the construction of confidence intervals considered part of
inferential statistics?
5. A population is known to be normally-distributed with a standard
deviation of 2.8. Compute the 95% confidence interval on the mean based on
the following sample of 6 numbers: 8, 9, 12, 13, 14, 16.