• 论文 •

### 适用于大数据集的广义可加模型

• 出版日期:2016-04-15 发布日期:2016-04-05

### Generalized additive models for large data sets

Xu Yipin&Ni Ping

• Online:2016-04-15 Published:2016-04-05

Abstract: We consider an application in electricity grid load prediction, where generalized additive models are appropriate, but where the data set’s size can make their use practically intractable with existing methods. We therefore develop practical generalized additive model fitting methods for large data sets in the case in which the smooth terms in the model are represented by using penalized regression splines. The methods use iterative update schemes to obtain factors of the model matrix while requiring only subblocks of the model matrix to be computed at any one time. We show that efficient smoothing parameter estimation can be carried out in a well-justified manner. The grid load prediction problem requires updates of the model fit, as new data become available, and some means for dealing with residual auto-correlation in grid load. Methods are provided for these problems and parallel implementation is covered. The methods allow estimation of generalized additive models for large data sets by using modest computer hardware, and the grid load prediction problem illustrates the utility of reduced rank spline smoothing methods for dealing with complex modelling problems.