Chapter 2 Some Basic Terms and Calculations

2.1 The Mean

One of the most familiar and commonly estimated population parameters is the mean. Given a simple random sample, the population mean is estimated by: ¯X=ni=1Xin

where:

  • Xi = The observed value of the ith unit in the sample.

  • n = The number of units in the sample.

  • ni=1Xi means to sum up all n of the X-values in sample.

If there are N units in the population, the total of the X-values over all units in the population would be estimated by: ˆT=N¯X

The circumflex (^) over the T is frequently used to indicate an estimated value as opposed to the true but unknown population value.

It should be noted that this estimate of the mean is used for a simple random sample. It may not be appropriate if the units included in the sample are not selected entirely at random.

Methods of computing confidence limits for the mean are discussed in the section on sampling.

2.2 Standard Deviation

Another commonly estimated population parameter is the standard deviation. The standard deviation characterizes dispersion of individuals about the mean. It gives us some idea whether most of the individuals in a population are close to the mean or spread out. The standard deviation of individuals in a population is frequently symbolized by σ (sigma). On the average, about two-thirds of the unit values of a normal population will be within 1 standard deviation of the mean. About 95 percent will be within 2 standard deviations and about 99 percent within 2.6 standard deviations.

We will seldom know or be able to determine σ exactly. However, given a sample of individual values from the population we can often make an estimate of σ, which is commonly symbolized by s. For a simple random sample of n units, the estimate is: s=X2(X)2nn1

where:

X2=the sum of squared values of all individual measurements
(X)2=the square of the sum of all measurements

This is equivalent to the formula s=(XiˉX)2n1

where:

ˉX=the arithmetic mean =Xn
(XˉX)=deviation of individual measurement from the mean of all measurements.

Here is an example: Ten individual trees in a loblolly pine plantation were selected at random and measured. Their diameters were 9,9,11,9,7,7,10,8,9, and 11 inches. Based on this sample, what is the arithmetic mean diameter and the standard deviation? Tabulating the measurements and squaring each of them:

X X2
9 81
9 81
11 121
9 81
7 49
7 49
10 100
8 64
9 81
11 121

X=90 and X2=828 for the table above.

Mean=ˉX=Xn=9010=9.0

Standard deviation=s=X2(X)2nn1=8289021091=189=1.414

Statisticians often speak in terms of variance rather than standard deviation. The variance is simply the square of the standard deviation. The population variance is symbolized by σ2 and the sample estimate of the variance by s2.

Using the sample range to estimate the standard deviation. – The standard deviation of the sample is an estimate of the standard deviation (σ) of the population. The sample range (R) may also be used to estimate the population standard deviation. Table 1 (Appendix) shows the ratio of the population standard deviation to the range for simple random samples of various sizes. In the example we’ve been using, the range is 117=4. For a sample of size 10, the table gives the value of the ratio σR as 0.325. Therefore, σ4=0.325 and σ=1.3 is an estimate of the true population standard deviation. Though easy to compute, this is an efficient estimator of σ only for very small samples (say less than 7 observations).

2.3 Coefficient of Variation

In nature, populations with large means often show more variation than populations with small means. The coefficient of variation (C) facilitates comparison of variability about different sized means. It is the ratio of the standard deviation to the mean. A standard deviation of 2 for a mean of 10 indicates the same relative variability as a standard deviation of 16 for a mean of 80. The coefficient of variation would be 0.20 or 20 percent in each case.

In the problem discussed in the previous section the coefficient of variation would be estimated by: C=sˉX=1.4149.0=0.157 or 15.7 percent.

2.4 Standard Error of the Mean

There is usually variation among the individual units of a population. The standard deviation is a measure of this variation.

Since the individual units vary, variation may also exist among the means (or any other estimates) computed from samples of these units. Take, for example, a population with a true mean of 10. If we were to select four units at random, they might have a sample mean of 8. Another sample of four units from the same population might have a mean of 11, another 10.5, and so forth. Clearly it would be desirable to know the variation likely to be encountered among the means of samples from this population. A measure of the variation among sample means is the standard error of the mean. It can be thought of as a standard deviation among sample means; it is a measure of the variation among sample means, just as the standard deviation is a measure of the variation among individuals. As will be described in the section on simple random sampling, the standard error of the mean may be used to compute confidence limits for a population mean.

The computation of the standard error of the mean (often symbolized by sˉx) depends on the manner in which the sample was selected. For simple random sampling without replacement (i.e., a given unit cannot appear in the sample more than once) from a population having a total of N units the formula for the estimated standard error of the mean is: sˉx=s2n(1nN)

In the problem discussed on page 4 we had n=10 and found that s=1.414 or s2=2. If the population contained 1,000 trees, the estimated mean diameter (ˉX=90 inches) would have a standard error of: sˉx=210(1101000)=0.198=0.445

The term (1nN) is called the finite population correction or fpc. If sampling is with replacement (not too common) or if the sampling fraction (nN) is very small (say less than 1/20), the fpc may be omitted and the standard error of the mean for a simple random sample is simply: sˉx=s2n

The variance of the sample mean is simply the square of the standard error of the mean. s2ˉx=s2n(1nN)

2.5 Covariance

Very often, each unit of a population will have more than a single characteristic. Trees, for example, may be characterized by their height, diameter, and form class. The covariance is a measure of the association between the magnitudes of two characteristics. If there is little or no association, the covariance will be close to zero. If the large values of one characteristic tend to be associated with the small values of another characteristic, the covariance will be negative. If the large values of one characteristic tend to be associated with the large values of another characteristic, the covariance will be positive. The population covariance of X and Y is often symbolized σxy; the sample estimate by sxy.

Suppose that the diameter (inches) and age (years) have been obtained for a number of randomly selected trees. If we symbolize diameter by Y and age by X, the sample covariance of diameter and age is given by: sxy=XY(ΣX)(ΣY)n(n1)

This is equivalent to the formula: sxy=(XˉX)(YˉY)(n1)

If n=12 and the Y and X values were as follows:

X Y
20 4
40 9
30 7
45 7
25 5
45 10
30 9
40 6
20 8
35 6
25 4
40 11
  • Sum X=395
  • Sum Y=86

then sxy=(4)(20)+(9)(40)+...+(11)(40)(86)(395)12121=29602830.8311=11.74

The positive covariance is consistent with the well known and economically unfortunate fact that the larger diameters tend to be associated with the older ages.

2.6 Simple Correlation Coefficient

The magnitude of the covariance, like that of the standard deviation, is often related to the size of the variables themselves. Units with large X and Y values tend to have larger covariances than units with small X and Y values. Also, the magnitude of the covariance would have been 298.196 instead of 11.74.

The simple correlation coefficient, a measure of the degree of linear association between two variable, is free of the effects of scale of measurement. It can vary from -1 to +1. A correlation of 0 indicates that there is no linear association (there may be a very strong nonlinear association, however). A correlation of +1 or -1 would suggest a perfect linear association. As for the covariance, a positive correlation implies that the large values of X are associated with the large values of Y. If the large values of X are associated with the small values of Y, the correlation is negative.

The population correlation coefficient is commonly symbolized by ρ (rho), and the sample-based estimate by r. The population correlation coefficient is defined to be: ρ=Covariance of X and Y(Variance of X)(Variance of Y)

For a simple random sample, the sample correlation coefficient is computed as follows: γ=sxysxsy=xy(x2)(y2)

where:

sxy=Sample covariance of X and Y sx=Sample standard deviation of X sy=Sample standard deviation of Y
Σxy=Corrected sum of XY products
=XY(X)(Y)n Σx2=Corrected sum of squares for X =X2(X)2n Σy2=Corrected sum of squares for Y

=Y2(Y)2n
For the values used to illustrate the covariance we have:

xy=(4)(20)+(9)(40)+...+(11)(40)(86)(395)12=129.1667 y2=42+92+...+11286212=57.667 x2=202+402+...+402395212 So, γ=129.1667(57.667)(922.9167)=129.1667230.6980=0.56

Correlation or chance.–The computed value is a statistic such as the correlation coefficient depends on which particular units were selected for the sample. Such estimates will vary sample to sample. More important, they will usually vary from the population value which we try to estimate.

In the above example, the sample correlation coefficient was 0.56. Does this mean that there is a real linear association between Y and X? Or could we get a value as large as this just by chance when sampling a population in which there is no linear association between Y and X (i.e., a population for which ρ=0)?

This can be tested by referring to table 7 (Appendix). The column headed “Degrees of freedom” refers to the sample size. A correlation coefficient estimated from a simple random sample of n units will have (n2) degrees of freedom. Looking in the tow for 10 degrees of freedom we find in the column headed “5%” a value of 0.576. This says that in sampling from a population for which ρ=0 we would get a sample value as large as 0.576 just by chance about 5 percent of the time. Sample values smaller than 0.576 could occur more often than this. Thus we might conclude that are sample r=0.56 could have been obtained by chance in sampling from a population with a true correlation of zero.

This test result is usually summarized by saying that the sample correlation coefficient is not significant at the 0.05 level. In statistical terms, we tested the hypotheses that ρ=0 and failed to reject the hypothesis at the 0.05 level. This is not exactly the same as saying that we reject the hypothesis or that we have proved that ρ=0. The distinction is subtle but real.

For a sample correlation larger than 0.576 we might decide that the departure from a value of zero is larger than we would expect by chance. Statistically we would reject the hypothesis that ρ=0.

2.7 Variance of a linear function

Quite often we will want to combine variables or population estimates in a linear function. For example, if the mean timber volume per acre has been estimated as ˉX, then the total volume on M acres will be MˉX; the estimate of total volume is a linear function of the estimated-mean volume. If the estimate of cubic volume per acre in sawtimber is ˉX1 and of pulpwood above the sawtimber top is ˉX2, then the estimate of total volume cubic foot volume per acre is ˉX1+ˉX2. If on a given tract the mean volume per half-acre is ˉX1 for spruce and the mean volume per quarter-acre is ˉX2 for yellow birch, then the estimated total volume per acre of spruce and birch would be 2ˉX1+4ˉX2.

In general terms, a linear function of three variables (say X1,X2, and X3) can be written as L=a1X1+a2X2+a3X3

where:

  • a1, a2, and a3 are constants.

If the variances are s21,s22, and s23 (for X1,X2, and X3 respectively) and the covariances are s1,2,s1,3, and s2,3, then the variance of L is given by:

s2L=a21s21+a22s22+a23s23+2(a1a2s1,2+a1a3s1,3+a2a3s2,3)

The standard deviation (or standard error) of L is simply the square root of this.

The extension of the rule to cover any number of variables should be fairly obvious.

Some examples

I. The sample mean volume per acre for a 10,000-acre tract is ˉX=5,680 board feet with a standard error of sˉX=632 (so s2ˉX=399,424). The estimated total volume is: L=10,000(ˉX)=56,800,000 board feet

The variance of this estimate would be: s2L=(10,000)2(s2ˉx)=39,942,400,000,000

Since the standard error of an estimate is the square root of its variance, the standard error of the estimated total is: sL=s2L=6,320,000

II. In 1955 a random sample of 40 one-quarter-acre circular plots was used to estimate the cubic foot volume of a stand of pine. Plot centers were monumented for possible relocation at a later time. The mean volume per plot was ˉX1=225 cubic feet. The plot variance was s2X1=8,281 so that the variance of the mean was s2ˉx1=8,281/40=207.025.

In 1960 a second inventory was made using the same plot centers. This time, however, the circular plots were only one-tenth acre. The mean volume per plot was ˉX2=122 cubic feet. The plot variance was sx22=6,084, so the variance of the mean was s2x2=152.100. The covariance of initial and final plot volumes was sx1,x2=4,259, making the covariance of the means s¯x1,¯x2=4,259/40=106.475.

The net periodic growth per acre would be estimated as: G=10ˉX24ˉX1=10(122)4(225)=320 cubic feet per acre.

By the rule for linear function the variance of G would be: s2G=(10)2s2ˉx2+(4)2s2ˉx1+2(10)(4)sˉx1ˉx2

=100(152.100)+16(207.025)80(106.475) =10,004.4

In this example there was a statistical relationship between the 1960 and 1955 means because the same plot locations were used in both samples. The covariance of the means (sˉx1,ˉx2) is a measure of this relationship. If the plots in 1960 had been located at random rather than at the 1955 locations, the two means would have been considered statistically independent and their covariance would have been set at zero. In this case the equation for the variance of the net periodic growth per acre (G) would reduce to: s2G=(10)2s2ˉx2+(4)2s2ˉx1 =100(152.100)+16(207.025)=18,522.4