Chapter 9 Regression on Dummy Explanatory variables
Chapter 9 Regression on Dummy Explanatory Variables
9. 1 The Nature of Dummy Variables ●1. Concept: Dummy variables(also indicator variables; binary variables categorical variables, dichotomous variables. Qualitative variables in regression model For example: sex, race, color, religion, nationality, marital status. etc Qualitative variables can be quantified by constructing artificial variables that take on values of l or o 0: indicating the absence of an attribute 1: indicating the presence(or possession) of that attribute Dummy variable (D): variables that assume values such aso and
9.1 The Nature of Dummy Variables ⚫ 1.Concept: Dummy variables (also indicator variables; binary variables; categorical variables; dichotomous variables.)-- Qualitative variables in regression model. For example: sex, race, color, religion, nationality, marital status, etc. Qualitative variables can be quantified by constructing artificial variables that take on values of 1 or 0: 0: indicating the absence of an attribute 1: indicating the presence (or possession) of that attribute. Dummy variable(D): variables that assume values such as 0 and 1
o 2. ANOVA model. Regression models that contain only dummy explanatory variables are called analysis-of-variance (ANOVA)models B,+BD. +u sociology, psychology, education, market research elds of anoVa models are usually used in the fi (1) Dummy variables generally tal ke on values of l or 0, they are nonstochastic; that is, their values are fixed (2) Estimation Dummy explanatory variables do not pose any new estimation problems, under the assumptions of CLRM, we can use the customary ols method to estimate the parameters of models that contain dummy variables
⚫ 2. ANOVA model: ⚫ ——Regression models that contain only dummy explanatory variables are called analysis-of-variance (ANOVA) models. Yi = B1+B2Di +ui ANOVA models are usually used in the fields of sociology, psychology, education, market research. (1)Dummy variables generally take on values of 1 or 0, they are nonstochastic; that is, their values are fixed. (2)Estimation: Dummy explanatory variables do not pose any new estimation problems, under the assumptions of CLRM, we can use the customary OLS method to estimate the parameters of models that contain dummy variables
9.2 Regression with one Quantitative Variable and One Qualitative Variable, with Two Categories ANCOVa models Y1=B1+B2D1+B3X1+u1(9 Features 1. If a qualitative variable has m categories, introduce(m-1) dummy variables If there are only two categories use only one dummy variables 2. The assignment of 1 and o val ues to two categories, such as male and female, is arbitrary 3. The category that is assigned the value ofo is often referred to as the base, bench mark control, comparison or omitted category 4. The coefficient B, attached to the dummy variable d can be called the differential intercept coefficient because it tells by how much the value of the intercept term of the category that receives the value of 1 differs from the intercept coefficient of the base category
9.2 Regression with one Quantitative Variable and One Qualitative Variable, with Two Categories --ANCOVA models Yi = B1+B2Di +B3Xi +ui (9.6) Features : 1. If a qualitative variable has m categories, introduce (m – 1) dummy variables. If there are only two categories, use only one dummy variables 2. The assignment of 1 and 0 values to two categories, such as male and female, is arbitrary. 3. The category that is assigned the value of 0 is often referred to as the base , bench mark, control, comparison, or omitted category. 4. The coefficient B2 attached to the dummy variable D can be called the differential intercept coefficient because it tells by how much the value of the intercept term of the category that receives the value of 1 differs from the intercept coefficient of the base category
9.3 Regression on a Quantitative Variable and a Qualitative Variable with More Than Two Classes or Categories--Introduce m-1 dummy variables ●1. Model: Y1=B1+B2D21+B3D31+B4X1+u E(YID2 =0D3=0X =B+B4X (9.14 E(YD2=1D3=0X)=(B1+B2)+B4 (9.15) E(YD2=0D3=1X)(B1+B3)+B4x1 (9.16) ●2. Estimate
9.3 Regression on a Quantitative Variable and a Qualitative Variable with More Than Two Classes or Categories——Introduce m-1 dummy variables ⚫ 1. Model: Yi = B1+B2D2i +B3 D3i +B4Xi +ui (9.13) E(Yi |D2 =0′D3 =0′Xi )=B1+B4Xi (9.14) E(Yi |D2 =1′D3 =0′Xi )=(B1+B2 )+B4Xi (9.15) E(Yi |D2 =0′D3 =1′Xi )=(B1+B3 )+B4Xi (9.16) ⚫ 2. Estimate