How to Convince a Student that

an Estimator is a Random

Variable

KERSTIN VANNMAN

At the University of Lulea, all students of engineering subjects take a compulsory course in statistics during their second semester (out of eight). The course contains the standard concepts, and we try to emphasize the inference part. The students are taught, in classes of about 30, in a student-activity kind of way. This means that the teacher tries to avoid talking constantly and instead presents the theory and examples in a form of a dialogue. I have found it most important to start every lesson by not talking. Instead I present a small problem to the students on a prepared over-head transparency, and they immediately begin to solve it on their own. The problem usually has a connection with the theory to be discussed during the lesson but can also be a review of the previous lesson. The most important thing is that the lesson begins with student activity rather than student passivity. We all know how easily they fall asleep during lectures.

 

Since most students have poor, or almost no, pre-knowledge in statistics from secondary school most of the concepts in the elementary course are new and difficult. One such concept to get across is that of an estimator being a random variable. To help the students get the appropriate feel for this situation I have found the following way fruitful. It is inspired by Professor Gottfried Noether and his book "Introduction to Statistics. A Nonparametric Approach".

First the students are asked to prepare this lesson by reading a few pages in their text-book (Vännman (1979)) in advance. There they are introduced, by an example, to the idea of a random sample and an estimator. Then I begin my lesson by showing the following transparency (see Figure 1). I, of course, use a Swedish version.
 
The author of our text-book visited the university of Sheffield in 1977. Professor Gottfried Noether was also there then. We had a walk through the streets of Sheffield. Suddenly Gottfried stopped and said, "Look, the taxis carry numbers" He noted the number of the taxi he had seen. We remained, watched taxis, and got the following numbers: 97, 234, 166, 7, 65, 17, 4 . Then Gottfried asked, "How many taxi cabs are there in this city?" How can we estimate this number with the aid of the numbers of the taxis?

Figure 1.
 

The two questions on the transparency always start a lot of activity. Some students press their calculators finding a figure, others start to doubt everything. I try to lead the discussion into the direction of making assumptions and finding a model. After a while, and some convincing discussion, we all agree on the most simple model, i.e. a uniform distribution on 1, 2 to some unknown number N. This is an important discussion since the students realize that you usually start by simplifying reality with the aid of a lot of assumptions, some of which might be unrealistic.

But now we want to estimate N, the unknown number of cars. There are always a lot of suggestions from the students. Very often the first one is N 168. When I ask where this number comes from the answer is, "I took two times the average of the observed numbers, minus 1", i.e.

= 2 - 1.                 (1)

Sometimes they skip the "minus 1", and I do not bother. The reason the students give for (1) is that the average of the observed numbers ought to be approximately equal to the average of 1, 2 N, i.e.

   is approx = (N +1)/2

But when they see that 168 is smaller than the largest observed number they reject the suggestion and go on with others. The next suggestion often is two times the median minus 1, but that is also smaller than the largest observation. Then usually the suggestion N = 237 occurs. This comes from

N = Xmax + Xmin ? 1,             (2)

which in turn comes from

(Xmax +Xmin)/2 is approx = (N+l)/2

This will at least always be greater than or equal to the largest observation.

At this moment I draw a number line on the blackboard indicating the taxi numbers. We find that suggestion (2) is Xmax + the first gap. That is a good idea, but we also have other gaps. Let us use all of them. After some discussion the students usually get N = 266, which comes from

= Xmax + the average of all gaps. (3)

Suddenly we have many different suggestions on the blackboard. What is the true number of cars? Here the students get an aha-experience. Their teacher does not know the answer. Some students react with anger. "Why didn’t you ask a taxi-driver?" Others may propose a trip to Sheffield. It is good that the students get emotionally involved.

Then some of them realize why I have brought a micro computer into the classroom. We can now simulate the situation and maybe see which method is to be preferred. I have a prepared program that first simulates the number of cars, N, using a uniform distribution on 200, 201 600. Then it simulates a chosen number of observations from a uniform distribution on 1, 2 N, and calculates the three estimates given in formulas (l)?(3). One may repeat the simulation of the observations for the same "unknown" N as long as one wants.

Since the display of the micro computer is small I ask one student to act as a secretary at the blackboard and to write down the three estimates after each simulation. At this moment the true number N is unknown and all students can follow how the estimates vary and make guesses. The part above the thick line in Figure 2 is what they might see emerge, row by row, on the blackboard. They decide themselves how many observations to be simulated, 20 is a common suggestion.

TAXI SIMULATION

20 observations per

simulation
 
 
(1)
(2)
(3)
 
407
447
460
 
535
459
478
 
440
463
482
 
534
490
502
 
524
522
508
 
452
465
482
 
534
502
504
 
539
478
493
 
468
517
511
 
569
479
497
AVER
500
482
492
STD
54
25
16
True number = 489

Figure 2.
 

Here they can see that the value of the estimator varies, and hence the estimator may be considered as a random variable. They also realize that if we go on for a while with the simulation we will find the distribution of the estimator. We usually stop after 10 simulations. The program also calculates the average and the standard deviation of the estimates introducing the idea of unbiasedness and small variance (see Figure 2). When I ask which method they prefer the answer is immediately (3) because of the small standard deviation. Then we get the true value from the program, in this case 489.

I usually do one series of simulations on the blackboard and then sow some others prepared in advance, including an example with a larger number of simulations. (See Figure 3 and Figure 4, showing 100 simulations.) We have introduced stem-and-leaf displays and boxplots (see e.g. Velleman and Hoaglin (1981)) earlier in the course.

There are many interesting loose ends to be pulled here, depending on how deep you want to go. I talk a little bit more formally about the matter and then the students begin to solve exercises. But we are not spending time checking whether estimators are unbiased or not or calculating variances for different estimators. Instead the students use simulated values from different distributions to compare estimators, as we did in the taxi example.
 
 


 

I have found many advantages with a lesson like this. The students understand, for example, that an estimator is a random variable, since they have seen how the value of the estimator varies when we repeat an experiment. They also realize that in reality we have only one observation of the estimator. Furthermore, since we want to come close to the true number they understand that unbiasedness and small variance are desirable properties. They find that model assumptions are important. Since students are active during such a lesson, they remember it. Therefore I can remind them of the taxi simulation, which is convenient, for instance, when treating. confidence intervals. Still another advantage is that the lesson provides an efficient way of using the short time available.

Sometimes I hear teachers say, "But I have no time for such spectacular shows". I do not believe that. On the contrary, you need them. I and some colleagues have used the taxi show successfully for several years, and we will go on using it for many years to come.


 
 

Acknowledgement

The cartoons, The Footies, appearing in this article are by Andrejs Dunkels, to whom the author wishes to express her sincere thanks.

University of Lulea, Sweden

References

Noether, G. (1976). Introduction to Statistics. A Nonparametric Approach. Second Edition.  Houghton Muffin Company, Boston.

Vännman, K. (1979). Matematisk statistik. Hbgskolan i Lulea. (In Swedish.)

Velleman, P. and Hoaglin D. (1981). Applications, Basics, and Computing of ExploratoryData Analysis. Duxbury Press, Boston.
 

Back to top

Back to contents of The Best of Teaching Statistics
Back to main Teaching Statistics page