|
ANCOVA, or analysis of covariance is an old-fashioned name for a linear regression model with one continuous explanatory variable and one or more factors. The name exists for historical reasons, but there is no particular reason to distinguish the method from the general purpose linear model. ANCOVA is a statistical technique of controlling extraneous variables in correlational studies. In probability theory and statistics, the covariance between two real-valued random variables X and Y, with expected values and is defined as: where E is the expected value. ...
Extraneous variables are variables other than the independent variable that may bear any effect on the behavior of the person being studied. ...
ANCOVA is a merger of ANOVA and regression for continuous variables. ANCOVA tests whether certain factors have an effect after controlling for quantitative predictors. The inclusion of covariates increases statistical power because it accounts for some of the variability. In statistics, analysis of variance (ANOVA) is a collection of statistical models and their associated procedures which compare means by splitting the overall observed variance into different parts. ...
The power of a statistical test is the probability that the test will reject a false null hypothesis, or in other words that it will not make a Type II error. ...
Equations
One-factor ANCOVA analysis One factor analysis is appropriate when dealing with more than 3 populations; k populations. The single factor has k levels equal to the k populations. n samples from each population are chosen random from their respective population.
Calculating the sum of squared deviates for the independent variable X and the dependent variable Y The sum of squared deviates (SS): SSTy, SSTry, and SSEy must be calculated using the following equations for the dependent variable, Y. The SS for the covariate must also be calculated, the two necessary values are SSTx and SSEx. Least squares or ordinary least squares (OLS) is a mathematical optimization technique which, when given a series of measured data, attempts to find a function which closely approximates the data (a best fit). It attempts to minimize the sum of the squares of the ordinate differences (called residuals) between points...
The total sum of squares determines the variability of all the samples. nT represents the total number of samples:  The sum of squares for treatments determines the variability between populations or factors. nk represents the number of factors: The sum of squares for error determines the variability within each population or factor. nn represents the number of samples with a given population: The total sum of squares is equal to the sum of the sum of squares for treatments and the sum of squares for error:  Calculating the covariance of X and Y The total sum of square covariates determines the covariance of X and Y within the all the data samples: Adjusting SSTy The correlation between X and Y is . Linear correlations between 1000 pairs of numbers. ...
  The proportion of covariance is subtracted from the dependent, SSy values:  - SSTryadj = SSTyadj − SSEyadj
Adjusting the means of each population k The mean of each population is adjusted in the following manner:  Analysis using adjusted sum of squares values Mean squares for treatments where dfTr is equal to NT − k − 1. dfTr is one less than in ANOVA to account for the covariance and dfE = k − 1:   The F statistic is In statistics and probability, the F-distribution is a continuous probability distribution. ...
 External links |