统计研究 ›› 2022, Vol. 39 ›› Issue (5): 134-145.doi: 10.19343/j.cnki.11–1302/c.2022.05.010

• • 上一篇    下一篇

一种基于机器学习的宏观经济数据融合方法

黄恒君 高海燕 韩 君   

  • 出版日期:2022-05-25 发布日期:2022-05-26

A Machine Learning Approach for Macroeconomic Data Fusion

Huang Hengjun Gao Haiyan Han Jun   

  • Online:2022-05-25 Published:2022-05-26

摘要: 大数据和机器学习正在改变经济统计学的研究范式与方法。宏观经济数据作为统计产品,用于描述一定范围内的经济状态或联系。与微观多源异构数据一样,宏观经济数据也具有融合二次开发的潜质,且具备更好的数据质量保障。本文在梳理机器学习数据融合方法的基础上,指出一类宏观经济数据融合任务,提出一种宏观经济数据融合方法,旨在提高预测能力。首先,通过论证经济状态数据、经济关联数据的可融合形式特征,给出提取不同类型数据共同特征的模型化表示方法;进而提出一种数据融合模型,给出模型求解的交替迭代求解算法,该模型可以统一处理数据融合基础上的无监督学习、监督学习和半监督学习任务。并且,本文基于2017年中国统计年鉴、2017年中国投入产出表和2017—2018年中国经济景气月报数据开展数据融合应用,结果表明,与非融合方法相比,数据融合方法提高了预测精度。

关键词: 数据融合, 经济状态, 经济关联, 机器学习

Abstract: Big data and machine learning are changing the research paradigm and methods of economic statistics. As a statistical product, macroeconomic data can describe economic status and economic connections in a certain range. Similar to integrated utilization of multi-source heterogeneous micro-data, macroeconomic data also has the potential for fusion, with its better data quality assurance. Based on an overview of data fusion methods of machine learning, this paper indicates a type of macroeconomic data fusion tasks, and proposes a data fusion approach, which aims to improve the prediction ability. Specifically, the potential merging characteristics of economic status data and economic connection data are demonstrated and a model representation method for extracting common feature information of these data is given. Then a fusion model is proposed, which can uniformly handle the task of unsupervised learning, supervised learning and semi-supervised learning on the basis of data fusion. An alternative iterative algorithm is followed for this model. The application of fusing the data from China Statistical Yearbooks (2017), Input-output Tables of China (2017), and China’s economic prosperity monthly reports (2017—2018) proves that our algorithm of data fusion has a better prediction accuracy compared with the non-fusion methods.

Key words: Data Fusion, Economic Status, Economic Connections, Machine Learning