当前位置：和泉文库 > 统计 > 浏览文档

《实用非参数统计》课程教学资源（阅读材料）回归与方差分析 Practical Regression and Anova using R（共16章）Faraway-PRA

Chapter 1 Introduction Chapter 2 Estimation Chapter 3 Inference Chapter 4 Errors in Predictors Chapter 5 Generalized Least Squares Chapter 6 Testing for Lack of Fit Chapter 7 Diagnostics Chapter 8 Transformation Chapter 9 Scale Changes, Principal Components and Collinearity Chapter 10 Variable Selection Chapter 11 Statistical Strategy and Model Uncertainty Chapter 12 Chicago Insurance Redlining - a complete example Chapter 13 Robust and Resistant Regression Chapter 14 Missing Data Chapter 15 Analysis of Covariance Chapter 16 ANOVA

文件格式：PDF，文件大小：0.99MB，售价：39.42元

共212页，可试读40页，点击往前阅读 ↑↑

文档详细内容（约212页）

3.4.CONFIDENCE INTERVALS FOR B 36 be very easy to get statistically significant results,but the actual effects may be unimportant.Would we really care if test scores were 0.1%higher in one state than another?Or that some medication reduced pain by 2%?Confidence intervals on the parameter estimates are a better way of assessing the size of an effect.There are useful even when the null hypothesis is not rejected because they tell us how confident we are that the true effect or value is close to the null. Even so,hypothesis tests do have some value,not least because they impose a check on unreasonable conclusions which the data simply does not support. 3.4 Confidence Intervals for B Confidence intervals provide an alternative way of expressing the uncertainty in our estimates.Even so,they are closely linked to the tests that we have already constructed.For the confidence intervals and regions that we will consider here,the following relationship holds.For a 100(1-a)%confidence region,any point that lies within the region represents a null hypothesis that would not be rejected at the 1000%level while every point outside represents a null hypothesis that would be rejected.So,in a sense,the confidence region provides a lot more information than a single hypothesis test in that it tells us the outcome of a whole range of hypotheses about the parameter values.Of course,by selecting the particular level of confidence for the region,we can only make tests at that level and we cannot determine the p-value for any given test simply from the region.However,since it is dangerous to read too much into the relative size of p-values (as far as how much evidence they provide against the null),this loss is not particularly important. The confidence region tells us about plausible values for the parameters in a way that the hypothesis test cannot.This makes it more valuable. As with testing,we must decide whether to form confidence regions for parameters individually or simultaneously.Simultaneous regions are preferable but for more than two dimensions they are difficult to display and so there is still some value in computing the one-dimensional confidence intervals. We start with the simultaneous regions.Some results from multivariate analysis show that (B-B)xX(B-B)x σ2 and (n-p)62 02 XH-P and these two quantities are independent.Hence B-B)IXTX(B-B)。X/P P62 Xh-p/(n-p) 三Fp,m-p So to form a 100(l-)%confidence region forβ，takeβsuch that B-B)'xTxB-B)≤pG2r0p These regions are ellipsoidally shaped.Because these ellipsoids live in higher dimensions,they cannot easily be visualized. Alternatively,one could consider each parameter individually which leads to confidence intervals which take the general form of estimate+critical value x s.e.of estimate

3.4. CONFIDENCE INTERVALS FOR β 36 be very easy to get statistically significant results, but the actual effects may be unimportant. Would we really care if test scores were 0.1% higher in one state than another? Or that some medication reduced pain by 2%? Confidence intervals on the parameter estimates are a better way of assessing the size of an effect. There are useful even when the null hypothesis is not rejected because they tell us how confident we are that the true effect or value is close to the null. Even so, hypothesis tests do have some value, not least because they impose a check on unreasonable conclusions which the data simply does not support. 3.4 Confidence Intervals for β Confidence intervals provide an alternative way of expressing the uncertainty in our estimates. Even so, they are closely linked to the tests that we have already constructed. For the confidence intervals and regions that we will consider here, the following relationship holds. For a 100 ✁ 1 α ✂ % confidence region, any point that lies within the region represents a null hypothesis that would not be rejected at the 100α% level while every point outside represents a null hypothesis that would be rejected. So, in a sense, the confidence region provides a lot more information than a single hypothesis test in that it tells us the outcome of a whole range of hypotheses about the parameter values. Of course, by selecting the particular level of confidence for the region, we can only make tests at that level and we cannot determine the p-value for any given test simply from the region. However, since it is dangerous to read too much into the relative size of p-values (as far as how much evidence they provide against the null), this loss is not particularly important. The confidence region tells us about plausible values for the parameters in a way that the hypothesis test cannot. This makes it more valuable. As with testing, we must decide whether to form confidence regions for parameters individually or simultaneously. Simultaneous regions are preferable but for more than two dimensions they are difficult to display and so there is still some value in computing the one-dimensional confidence intervals. We start with the simultaneous regions. Some results from multivariate analysis show that ✁ ˆβ β ✂ TX TX ✁ ˆβ β ✂ σ2 χ 2 p and ✁ n p ✂ σˆ 2 σ2 χ 2 n p and these two quantities are independent. Hence ✁ ˆβ β ✂ TX TX ✁ ˆβ β ✂ pσˆ 2 χ 2 p p χ 2 n p ✁ n p ✂✁ Fp ✂ n p So to form a 100 ✁ 1 α ✂ % confidence region for β, take β such that ✁ ˆβ β ✂ TX TX ✁ ˆβ β ✂ pσˆ 2F α ✁ p ✂ n p These regions are ellipsoidally shaped. Because these ellipsoids live in higher dimensions, they cannot easily be visualized. Alternatively, one could consider each parameter individually which leads to confidence intervals which take the general form of estimate ✂ critical value s ✁ e ✁ of estimate

3.4. CONFIDENCE INTERVALS FOR β 37 or specifically in this case: ˆβi ✂ t α 2 ✁ n p σˆ ✁ XTX ✂✁1 ii It’s better to consider the joint confidence intervals when possible, especially when the ˆβ are heavily correlated. Consider the full model for the savings data. The . in the model formula stands for “every other variable in the data frame” which is a useful abbreviation. > g <- lm(sr ˜ ., savings) > summary(g) Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 28.566087 7.354516 3.88 0.00033 pop15 -0.461193 0.144642 -3.19 0.00260 pop75 -1.691498 1.083599 -1.56 0.12553 dpi -0.000337 0.000931 -0.36 0.71917 ddpi 0.409695 0.196197 2.09 0.04247 Residual standard error: 3.8 on 45 degrees of freedom Multiple R-Squared: 0.338, Adjusted R-squared: 0.28 F-statistic: 5.76 on 4 and 45 degrees of freedom, p-value: 0.00079 We can construct individual 95% confidence intervals for the regression parameters of pop75: > qt(0.975,45) [1] 2.0141 > c(-1.69-2.01*1.08,-1.69+2.01*1.08) [1] -3.8608 0.4808 and similarly for growth > c(0.41-2.01*0.196,0.41+2.01*0.196) [1] 0.01604 0.80396 Notice that this confidence interval is pretty wide in the sense that the upper limit is about 50 times larger than the lower limit. This means that we are not really that confident about what the exact effect of growth on savings really is. Confidence intervals often have a duality with two-sided hypothesis tests. A 95% confidence interval contains all the null hypotheses that would not be rejected at the 5% level. Thus the interval for pop75 contains zero which indicates that the null hypothesis H0 : βpop75 ☎ 0 would not be rejected at the 5% level. We can see from the output above that the p-value is 12.5% — greater than 5% — confirming this point. In contrast, we see that the interval for ddpi does not contain zero and so the null hypothesis is rejected for its regression parameter. Now we construct the joint 95% confidence region for these parameters. First we load in a ”library” for drawing confidence ellipses which is not part of base R: > library(ellipse) and now the plot:

3.5.CONFIDENCE INTERVALS FOR PREDICTIONS 39 can be explained by realizing that two negatively correlated predictors are attempting to the perform the same job.The more work one does,the less the other can do and hence the positive correlation in the coefficients 3.5 Confidence intervals for predictions Given a new set of predictors,xo what is the predicted response?Easy-just o=xB.However,we need to distinguish between predictions of the future mean response and predictions of future observations.To make the distinction,suppose we have built a regression model that predicts the selling price of homes in a given area that is based on predictors like the number of bedrooms,closeness to a major highway etc.There are two kinds of predictions that can be made for a given xo. 1.Suppose a new house comes on the market with characteristics Its selling price will be Since Es=0,the predicted price isxB but in assessing the variance of this prediction,we must include the variance of e. 2.Suppose we ask the question-"What would the house with characteristics xo"sell for on average. This selling price is xB and is again predicted by xB but now only the variance in B needs to be taken into account. Most times,we will want the first case which is called "prediction of a future value"while the second case, called "prediction of the mean response"is less common. Now var (xoB)=x(+xoo2 A future observation is predicted to be xB(where we don't what the future will turn out to be). So a 100(1-)%confidence interval for a single future response is 士26V1y6w0p0 If on the other hand,you want a confidence interval for the average of the responses for given xo then use o±26VxW0+0 We return to the Galapagos data for this example. g <lm(Species Area+Elevation+Nearest+Scruz+Adjacent,data=gala) Suppose we want to predict the number of species(oftortoise)on an island with predictors 0.08,93,6.0,12.0,0.34 (same order as in the dataset).Of course it is difficult to see why in practice we would want to do this be- cause a new island is unlikely to present itself.For a dataset like this interest would center on the structure of the model and relative importance of the predictors,so we should regard this more as a"what if?"exercise. Do it first directly from the formula: >x0<-c(1,0.08,93,6.0,12.0,0.34) y0 <-sum(x0*gscoef) >y0 [1]33.92

3.5. CONFIDENCE INTERVALS FOR PREDICTIONS 39 can be explained by realizing that two negatively correlated predictors are attempting to the perform the same job. The more work one does, the less the other can do and hence the positive correlation in the coefficients. 3.5 Confidence intervals for predictions Given a new set of predictors, x0 what is the predicted response? Easy — just yˆ0 ☎ x T 0 ˆβ. However, we need to distinguish between predictions of the future mean response and predictions of future observations. To make the distinction, suppose we have built a regression model that predicts the selling price of homes in a given area that is based on predictors like the number of bedrooms, closeness to a major highway etc. There are two kinds of predictions that can be made for a given x0. 1. Suppose a new house comes on the market with characteristics x0. Its selling price will be x T 0 β ε. Since Eε ☎ 0, the predicted price is x T 0 ˆβ but in assessing the variance of this prediction, we must include the variance of ε. 2. Suppose we ask the question — “What would the house with characteristics x0” sell for on average. This selling price is x T 0 β and is again predicted by x T 0 ˆβ but now only the variance in ˆβ needs to be taken into account. Most times, we will want the first case which is called “prediction of a future value” while the second case, called “prediction of the mean response” is less common. Now var ✁ x T 0 ˆβ ✂ ☎ x T 0 ✁ X TX ✂ 1 x0σ 2 . A future observation is predicted to be x T 0 ˆβ ε (where we don’t what the future ε will turn out to be). So a 100 ✁ 1 α ✂ % confidence interval for a single future response is yˆ0 ✂ t α 2 ✁ n p σˆ 1 x T 0 ✁ XTX ✂ 1x0 If on the other hand, you want a confidence interval for the average of the responses for given x0 then use yˆ0 ✂ t α 2 ✁ n p σˆ x T 0 ✁ XTX ✂ 1x0 We return to the Galapagos data for this example. > g <- lm(Species ˜ Area+Elevation+Nearest+Scruz+Adjacent,data=gala) Suppose we want to predict the number of species (of tortoise) on an island with predictors 0.08,93,6.0,12.0,0.34 (same order as in the dataset). Of course it is difficult to see why in practice we would want to do this because a new island is unlikely to present itself. For a dataset like this interest would center on the structure of the model and relative importance of the predictors, so we should regard this more as a ”what if?” exercise. Do it first directly from the formula: > x0 <- c(1,0.08,93,6.0,12.0,0.34) > y0 <- sum(x0*g$coef) > y0 [1] 33.92

点击进入文档下载页（PDF格式）

共212页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录