## Coefficient Table Use #3: Making Predictions

When we use regular least-squares regression, we obtain a linear equation that is intended to predict the expected (average) value of the dependent variable given the values of the -variables. In notation, we obtain the equation for the least-squares regression line:

If we want to make a prediction of the value of for a given set of values for the variables, we can just plug the -values and the regression coefficients (the ‘s) into the regression equation.

Using the least-squares regression example based on the Kid Creative data discussed in Part 1, suppose I wanted to predict the Household Income for a person with the following characteristics:

- Gender Male: IsFemale =
- Married: IsMarried =
- College Educated: HasCollege =
- Not a Professional: IsProfessional =
- Not Retired: IsRetired =
- Employed: Unemployed =
- Five years of Residency in Current City: ResLength =
- Dual Income: Dual =
- Has Children: Minors =
- Rents Home: Own =
- Lives in a house: House =
- Race is white: White =
- First language is English: English =
- No previous purchases: PrevParent = and PrevChild =

To predict the Household Income for this person, I simply plug these -values into the least-squares regression equation. To do so, I need the regression coefficients. The table below shows the regression coefficients pulled from the coefficient output table for the KidCreative Household Income least-squares regression example (click here to see the entire regression output).

Using these regression coefficients and the particular -values given above, the predicted household income is:

I certainly do not want to type all of this into a calculator, so I am going to compute the prediction using Excel. Here is the section of the Excel worksheet that I used:

Thus, the expected household income predicted by the least-square regression for a person with the -variable values listed above is about $42,300.

Prediction is one of the most important uses of the regression coefficient table. In the last part of this background series, I will discuss assessing the statistical uncertainty with respect to the regression coefficient and will briefly touch on the uncertainty with respect to predictions. Click here to proceed to Part 5.

**Questions or Comments?**

Any questions or comments? Please feel free to comment below. I am always wanting to improve this material.

I have had to implement a very simple “captcha” field because of spam comments, so be a bit careful about that. Enter your answer as a number (not a word). Also, save your comment before you submit it in case you make a mistake. You can use cntrl-A and then cntrl-C to save a copy on the clipboard. Particularly if your comment is long, I would hate for you to lose it.

In the example shown there is a negative effect from the coefficient relating to being a female (which is not activated since it’s multiplied by 0). However the positive effect of being male (as shown in previous examples) is not present as it has not been chosen as an X criteria. My question is, would you normally choose to have both mutually exclusive X criteria in the equation and if not how do you decide which to include?