统计研究 ›› 2018, Vol. 35 ›› Issue (9): 103-114.doi: 10.19343/j.cnki.11-1302/c.2018.09.009

• • 上一篇    下一篇

基尼加权回归分析:概念、方法及应用

戴平生   

  • 出版日期:2018-09-25 发布日期:2018-09-25

Weighted Gini Regression: Concept, Method and its Application

Dai Pingsheng   

  • Online:2018-09-25 Published:2018-09-25

摘要: 普通最小二乘法是进行回归分析最常用的基本方法,但该方法要求满足若干经典假设,对于小样本或在与收入相关回归分析的参数估计中易受奇异值、高收入群体的影响。本文试图利用基尼加权回归弥补以上不足。基尼加权回归可分为参数方法与非参数方法两类,参数方法基于样本残差的基尼平均差最小原则对参数进行估计;非参数方法则是直接由两点间的斜率加权得到。基尼加权回归分析可以进行参数假设检验并定义拟合优度,其中的假设检验在实际应用中采用Jackknife重抽样方法估计方差。文中提出的样本拓展基尼平均差算法,弥补了现有算法对样本数据只能提供近似计算的不足,极大简化相应的计算公式。利用我国2015年省域截面数据、1994至2015年总量时间序列数据分别讨论入境旅游收入对收入基尼系数的影响,发现使用基尼加权回归的结果不仅符合理论预期,而且可以通过不平等厌恶参数的变化反映入境旅游收入对不同群体收入公平性的影响。

关键词: 基尼加权回归, 基尼平均差, 参数估计, 非参数估计

Abstract: The method of ordinary least squares (OLS) is one of the most common one for regression analysis. OLS relies on several classical assumptions, and estimators are affected easily by extreme values, high income groups in regression analysis with related to income or small sample size. This paper promotes the weighted Gini regression as an alternative way, which consists of parameter estimating and non-parameter estimating. Parameter estimator is based on minimum of Gini mean difference of sample residues; non-parameter estimator comes from weighted value of slopes. Hypothesis test and R-squared calculating are carried in weighted Gini regression, resampling Jackknife technology is used to estimate variance for hypothesis test. It promotes a new algorithm of sample extend Gini mean difference, which can cover the shortage of approximate treatment of sample data. It discusses about how inbound tourism receipt influences income Gini coefficients by using 2015 provincial cross-sectional data and 1994-2015 total time series data in China. The results from weighted Gini regression line with expectations of relationship between variables, and they can reflect effects of inbound tourism receipt on income equity of different groups by changing inequality preference.

Key words: Weighted Gini regression, Gini’s mean difference, Parameter Estimation, Non-parameter estimation