统计研究 ›› 2020, Vol. 37 ›› Issue (11): 68-79.doi: 10.19343/j.cnki.11-1302/c.2020.11.006

• • 上一篇    下一篇

金融市场收益率方向预测模型研究———基于文本大数据方法

顾文涛 王儒 郑肃豪 杨永伟   

  • 出版日期:2020-11-25 发布日期:2020-11-24

Research on The Prediction Model of The Direction of Financial Market Returns: Based on Text Big Data Method

Gu Wentao Wang Ru Zheng Suhao Yang Yongwei   

  • Online:2020-11-25 Published:2020-11-24

摘要: 金融市场的发展关系着一国的经济命脉,而股票市场作为金融市场的重要组成部分,对其收益率的研究也一直都是学术界的热点。财经新闻常被认为蕴含着丰富的信息,其中所包含的情感信息作为影响投资者投资决策的重要因素之一,对股票收益率也具有一定的影响。故本文构建了适用于金融投资领域的财经新闻情感词典来对财经新闻进行文本分析,同时构造了新的预测模型:将财经新闻文本中所含的情感量化为情绪指数并与时变密度函数相结合,得到时变加权密度模型。并在此基础上以模型评分为权重组合多个预测模型构建出评分加权模型用于股票收益率预测。结果显示,加入情绪指数能有效提高模型预测能力,而评分加权模型的预测能力则在此基础上更进一步,在准确率以及评分规则上基本达到双重最优。

关键词: 方向预测, 情绪指数, 评分加权

Abstract: The development of the financial market concerns the economic lifeline of a country. As the stock market is an important part of the financial market, the research on the stock returns has always been a hot topic academically. Financial news is often considered to contain rich information, and as one of the important factors affecting investors’ investment decisions, the emotion contained in financial news also has a certain impact on stock returns. Therefore, this paper constructs a sentiment lexicon of financial news, which is applicable in the financial investment field, and uses it to analyze the text of financial news. A new prediction model is established: to quantify the sentiment contained in financial news text as a sentiment index, and combine it with the time-varying density function to obtain a time-varying weighted density model. On this basis, scoring the model and using the model score as the weight to combine different prediction models to construct a scoring weighted model for stock returns prediction. The results show that the sentiment index of financial news can improve the prediction of the model, and the prediction effect of the scoring weighted model is double optimal in accuracy and scoring rules.

Key words: Direction Prediction, Sentiment Index, Scoring Weighted