当前位置：和泉文库 > 统计 > 浏览文档

《实用非参数统计》课程教学资源（阅读材料）回归与方差分析 Practical Regression and Anova using R（共16章）Faraway-PRA

Chapter 1 Introduction Chapter 2 Estimation Chapter 3 Inference Chapter 4 Errors in Predictors Chapter 5 Generalized Least Squares Chapter 6 Testing for Lack of Fit Chapter 7 Diagnostics Chapter 8 Transformation Chapter 9 Scale Changes, Principal Components and Collinearity Chapter 10 Variable Selection Chapter 11 Statistical Strategy and Model Uncertainty Chapter 12 Chicago Insurance Redlining - a complete example Chapter 13 Robust and Resistant Regression Chapter 14 Missing Data Chapter 15 Analysis of Covariance Chapter 16 ANOVA

文件格式：PDF，文件大小：0.99MB，售价：39.42元

共212页，可试读40页，点击往前阅读 ↑↑

文档详细内容（约212页）

2.9. MEAN AND VARIANCE OF ˆβ 21 Implications The Gauss-Markov theorem shows that the least squares estimate ˆβ is a good choice, but if the errors are correlated or have unequal variance, there will be better estimators. Even if the errors behave but are non-normal then non-linear or biased estimates may work better in some sense. So this theorem does not tell one to use least squares all the time, it just strongly suggests it unless there is some strong reason to do otherwise. Situations where estimators other than ordinary least squares should be considered are 1. When the errors are correlated or have unequal variance, generalized least squares should be used. 2. When the error distribution is long-tailed, then robust estimates might be used. Robust estimates are typically not linear in y. 3. When the predictors are highly correlated (collinear), then biased estimators such as ridge regression might be preferable. 2.9 Mean and Variance of βˆ Now ˆβ ☎ ✁ X TX ✂ 1X T y so Mean E ˆβ ☎ ✁ X TX ✂ 1X TXβ ☎ β (unbiased) var ˆβ ☎ ✁ X TX ✂ 1X Tσ 2 IX ✁ X TX ✂ 1 ☎ ✁ X TX ✂ 1σ 2 Note that since ˆβ is a vector, ✁ X TX ✂ 1σ 2 is a variance-covariance matrix. Sometimes you want the standard error for a particular component which can be picked out as in se ✁ ˆβi ✂ ☎ ✁ XTX ✂ 1 ii σˆ. 2.10 Estimating σ 2 Recall that the residual sum of squares was εˆ T εˆ ☎ y T ✁ I H ✂ y. Now after some calculation, one can show that Eεˆ T εˆ ☎ σ 2 ✁ n p ✂ which shows that σˆ 2 ☎ εˆ T εˆ n p is an unbiased estimate of σ 2 . n p is the degrees of freedom of the model. Actually a theorem parallel to the Gauss-Markov theorem shows that it has the minimum variance among all quadratic unbiased estimators of σ 2 . 2.11 Goodness of Fit How well does the model fit the data? One measure is R 2 , the so-called coefficient of determination or percentage of variance explained R 2 ☎ 1 ∑ ✁ yˆi yi ✂ 2 ∑ ✁ yi ¯y ✂ 2 ☎ 1 RSS Total SS ✁ corrected for mean ✂

2.11.GOODNESS OF FIT 22 8 0.0 0.2 0.4 0.6 0.8 1.0 X Figure 2.2:Variation in the response y when x is known is denoted by dotted arrows while variation in y when x is unknown is shown with the solid arrows The range is 0-R2-1-values closer to I indicating better fits.For simple linear regression R2=12where ris the correlation between x and y.An equivalent definition is R2= 09-况 Σ0%-两 The graphical intuition behind R2 is shown in Figure 2.2.Suppose you want to predict y.If you don't know x,then your best prediction is y but the variability in this prediction is high.If you do know x,then your prediction will be given by the regression fit.This prediction will be less variable provided there is some relationship betweenx and y.R2 is one minus the ratio of the sum of squares for these two predictions. Thus for perfect predictions the ratio will be zero and R2 will be one. Warning:R2 as defined here doesn't make any sense if you do not have an intercept in your model.This is because the denominator in the definition of R2 has a null model with an intercept in mind when the sum of squares is calculated.Alternative definitions of R2 are possible when there is no intercept but the same graphical intuition is not available and the R2's obtained should not be compared to those for models with an intercept.Beware of high R2's reported from models without an intercept. What is a good value of R2?It depends on the area of application.In the biological and social sciences, variables tend to be more weakly correlated and there is a lot of noise.We'd expect lower values for R2 in these areas-a value of 0.6 might be considered good.In physics and engineering,where most data comes from closely controlled experiments,we expect to get much higher R2's and a value of 0.6 would be considered low.Of course,I generalize excessively here so some experience with the particular area is necessary for you to judge your R2's well. An alternative measure of fit is 6.This quantity is directly related to the standard errors of estimates of B and predictions.The advantage is that 6 is measured in the units of the response and so may be directly interpreted in the context of the particular dataset.This may also be a disadvantage in that one

2.11. GOODNESS OF FIT 22 0.0 0.2 0.4 0.6 0.8 1.0 −0.2 0.2 0.6 1.0 x y Figure 2.2: Variation in the response y when x is known is denoted by dotted arrows while variation in y when x is unknown is shown with the solid arrows The range is 0 R 2 1 - values closer to 1 indicating better fits. For simple linear regression R 2 ☎ r 2 where r is the correlation between x and y. An equivalent definition is R 2 ☎ ∑ ✁ yˆi ¯y✂ 2 ∑ ✁ yi ¯y✂ 2 The graphical intuition behind R 2 is shown in Figure 2.2. Suppose you want to predict y. If you don’t know x, then your best prediction is ¯y but the variability in this prediction is high. If you do know x, then your prediction will be given by the regression fit. This prediction will be less variable provided there is some relationship between x and y. R 2 is one minus the ratio of the sum of squares for these two predictions. Thus for perfect predictions the ratio will be zero and R 2 will be one. Warning: R 2 as defined here doesn’t make any sense if you do not have an intercept in your model. This is because the denominator in the definition of R 2 has a null model with an intercept in mind when the sum of squares is calculated. Alternative definitions of R 2 are possible when there is no intercept but the same graphical intuition is not available and the R 2 ’s obtained should not be compared to those for models with an intercept. Beware of high R 2 ’s reported from models without an intercept. What is a good value of R 2 ? It depends on the area of application. In the biological and social sciences, variables tend to be more weakly correlated and there is a lot of noise. We’d expect lower values for R 2 in these areas — a value of 0.6 might be considered good. In physics and engineering, where most data comes from closely controlled experiments, we expect to get much higher R 2 ’s and a value of 0.6 would be considered low. Of course, I generalize excessively here so some experience with the particular area is necessary for you to judge your R 2 ’s well. An alternative measure of fit is σˆ. This quantity is directly related to the standard errors of estimates of β and predictions. The advantage is that σˆ is measured in the units of the response and so may be directly interpreted in the context of the particular dataset. This may also be a disadvantage in that one

点击进入文档下载页（PDF格式）

共212页，可试读40页，点击继续阅读 ↓↓

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录