8 Comparing the AIC to the BIC AIC BI_中国高校课件下载中心

点击下载：《多元统计分析》课程教学资源（阅读材料）Model selection in linear regression

正在加载图片...

8 Comparing the AIC to the BIC AIC BIC .BIC tends to have models that are "too ●AIC tends to have models that are“too big",good for prediction (perhaps),but small",but still have optimal predictive properties.Will not exaggerate impor- not so good for understanding whether tance of covariates (sometimes too con- specific covariates are important or not. servative). ●Tends to overfit. Tends to underfit (but see next point!). Not consistent:Even with an infinite Consistent,asymptotically will return sample size,will not necessarily converge to correct model,tends to remain too big. the correct model. Performs better for simple models, Performs better for complex models "Truth is low dimensional". "Truth is high dimensional" Usable for non-nested models. Usable for non-nested models. Not related to any significant tests. Equivalent to a stepwise procedure with a critical value of 15.7%. Asymptotically equivalent to a Bayes Factor,so easily leads to model probabil- Not equivalent to a Bayes Factor,so no natural way to get model probabilities,so ities,so easy to perform model averaging no model averaging. (done automatically in BIC software for R). Example of using the R BMA package for Bayesian model selection Here is an example of using the BMA model selection program First,we will generate some data: x <-rnorm(1000,mean=0,sd=4) z <-rnorm(1000,mean=5,sd=20) w<-rnorm(1000,mean=-3,sd=4) y <rnorm(1000,mean 3*x 2*z,sd=25) Y will be our independent variable,and we will create a matrix for the other variables as follows: data.matrix <-matrix(c(x,w,z),byrow=F,nrow=1000)8 Comparing the AIC to the BIC AIC BIC • AIC tends to have models that are “too big”, good for prediction (perhaps), but not so good for understanding whether specific covariates are important or not. • Tends to overfit. • Not consistent: Even with an infinite sample size, will not necessarily converge to correct model, tends to remain too big. • Performs better for complex models “Truth is high dimensional”. • Usable for non-nested models. • Equivalent to a stepwise procedure with a critical value of 15.7%. • Not equivalent to a Bayes Factor, so no natural way to get model probabilities, so no model averaging. • BIC tends to have models that are “too small”, but still have optimal predictive properties. Will not exaggerate importance of covariates (sometimes too conservative). • Tends to underfit (but see next point!). • Consistent, asymptotically will return the correct model. • Performs better for simple models, “Truth is low dimensional”. • Usable for non-nested models. • Not related to any significant tests. • Asymptotically equivalent to a Bayes Factor, so easily leads to model probabilities, so easy to perform model averaging (done automatically in BIC software for R). Example of using the R BMA package for Bayesian model selection # Here is an example of using the BMA model selection program # First, we will generate some data: x <- rnorm(1000, mean=0, sd=4) z <- rnorm(1000, mean=5, sd=20) w <- rnorm(1000, mean=-3, sd=4) y <- rnorm(1000, mean = 3*x + 2*z, sd=25) # Y will be our independent variable, and we will create a matrix for # the other variables as follows: data.matrix <- matrix(c(x,w,z), byrow=F, nrow=1000)

<<向上翻页向下翻页>>

点击下载：《多元统计分析》课程教学资源（阅读材料）Model selection in linear regression