FACTOID # 57: Two-thirds of the world's kidnappings occur in Colombia.
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS   

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Residuals

In statistics, the concepts of error and residual are easily confused with each other.


Error is a misnomer; an error is the amount by which an observation differs from its expected value; the latter being based on the whole population from which the statistical unit was chosen randomly. The expected value, being the average of the entire population, is typically unobservable. If the average height of 21-year-old men is 5 feet 9 inches, and one randomly chosen man is 5 feet 11 inches tall, then the "error" is 2 inches; if the randomly chosen man is 5 feet 7 inches tall, then the "error" is −2 inches. The nomenclature arose from random measurement errors in astronomy. It is as if the measurement of the man's height were an attempt to measure the population average, so that any difference between the man's height and the average would be a measurement error.


A residual, on the other hand, is an observable estimate of the unobservable error. The simplest case involves a random sample of n men whose heights are measured. The sample average is used as an estimate of the population average. Then we have:

  • The difference between the height of each man in the sample and the unobservable population average is an error, and
  • The difference between the height of each man in the sample and the observable sample average is a residual.
Residuals are observable; errors are not.

Note that the sum of the residuals within a random sample is necessarily zero, and thus the residuals are necessarily not independent. The sum of the errors need not be zero; the errors are independent random variables if the individuals are chosen from the population independently.

Errors are often independent of each other; residuals are usually not independent of each other.

An example, with some of the mathematical theory

If we assume a normally distributed population with mean μ and standard deviation σ, and choose individuals independently, then we have

X_1, \dots, X_n\sim N(\mu,\sigma^2)

and the sample mean is a random variable distributed thus:

\overline{X}\sim N(\mu, \sigma^2/n).

The errors are then

\varepsilon_i=X_i-\mu,

whereas the residuals are

\widehat{\varepsilon}_i=X_i-\overline{X}.

(As is often done, the "hat" over the letter ε indicates an observable estimate of an unobservable quantity called ε.)


The sum of squares of the errors, divided by σ2, has a chi-square distribution with n degrees of freedom:

\sum_{i=1}^n \left(X_i-\mu\right)^2/\sigma^2\sim\chi^2_n.

This quantity, however, is not observable. The sum of squares of the residuals, on the other hand, is observable. The quotient of that sum by σ2 has a chi-square distribution with only n − 1 degrees of freedom:

\sum_{i=1}^n \left(\,X_i-\overline{X}\,\right)^2/\sigma^2\sim\chi^2_{n-1}.

It is remarkable that this random variable and the sample mean can be shown to be independent of each other. That fact and the normal and chi-square distributions given above form the basis of confidence interval calculations relying on Student's t-distribution. The σ appears in both the numerator and the denominator in those calculations and cancels. That is fortunate because in practice one would not know the value of σ2.


See also

Studentized residual


  Results from FactBites:
 
Work from home Australia residual income opportunities (627 words)
Earn residual income from 21+ countries on full automatic..
Our networks are now racing into China, Japan, South Korea..
Global residual income from 80+ countries is just a few clicks away for you.
Fuel oil - Wikipedia, the free encyclopedia (1103 words)
Fuel oil is a fraction obtained from petroleum distillation, either as a distillate or a residue.
Residual fuel oils are sometimes called light when they have been mixed with distillate fuel oil, while distillate fuel oils are called heavy when they have been mixed with residual fuel oil.
Residual fuel oil is less useful because it is so viscous that it has to be heated, which requires a special heating system, before use and it contains relatively high amounts of pollutants, particularly sulfur, which forms sulfur dioxide upon combustion.
  More results at FactBites »

 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your location
Your comments
Please enter the 5-letter protection code


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.