Ordinary least squares estimator

The ordinary least squares (OLS) estimator is the most widely used estimator in econometrics and is often at least the good starting point before exploring more complex tools to perform regression analysis.

In a regression analysis, we consider a variable \(y\), called the response or the endogenous or the explained variable and the model seeks to explain the variations of this variable from one observation to another. To achieve this goal, a set of other variables \(x\) called the covariates or explanatory or exogenous variables are introduced. These covariates are not only correlated to the response, but they are assumed to have a causal effect on the response. This means that a variation of \(x\) causes a change on the value of \(y\), as the opposite is not true. For example, wages are correlated with education levels. More precisely, a variation of education has a causal effect on wage, which means that wage is the response and education is a covariate.

The dictionary definition of regression is “backward movement, a retreat, a return to an earlier state of development”.1 Regression analysis has nothing to do with this definition. The term was first used in the statistical literature by Francis Galton who studied the relationship between the height of children and the height of parents. His result is that tall parents have tall children, but that “there was a tendency for children’s heights to converge toward the average”. Galton called this a “regression of children’s heights toward the average”. Actually, this result is a statistical artifact, called a regression fallacy.

Chapter 1 is devoted to the derivation of the simple linear model, which means that we’ll consider a unique covariate. Then, Chapter 2 will investigate the statistical properties of the simple linear model. Chapter 3 describes the multiple linear model, used when there is more than one covariate. Finally, Chapter 4 is devoted to the interpretation of the estimators.

Throughout this part, we’ll present a real econometric analysis, which has three components:

Maddala, G. S., and Kajal Lahiri. 2009. Introduction to Econometrics. 4th ed. Wiley.

  1. This paragraph is based on Maddala and Lahiri (2009), pages 59-60 and 102-105.↩︎