统计研究

• 论文 • 上一篇    

大数据分析仍需要统计思想——以ARGO模型为例

林存洁 李扬   

  • 出版日期:2016-11-15 发布日期:2016-11-11

Analysis on Big Data Needs Statistical Thinking:Taking ARGO Model as An Example

Lin Cunjie & Li Yang   

  • Online:2016-11-15 Published:2016-11-11

摘要: 在大数据时代,传统的统计学是否还有用武之地成为很多人的争议。本文以ARGO模型为案例,介绍了统计方法在大数据分析中的应用和取得的成果,并从统计学的角度出发,提出改进的措施与方法。通过ARGO模型的分析结果发现,大数据分析的很多根本性问题仍然是统计问题,而数据中的统计规律仍然是数据分析要挖掘的最大价值,这也意味着统计思想在大数据分析中只能越来越重要。而对于结构复杂、来源多样的大数据来说,统计学方法也需要新的探索和尝试,这将是统计学所面临的机遇和挑战。

关键词: 流感预测, 时间序列, 变量选择

Abstract: In the era of big data, people argue that the traditional statistic has lost its superiority in data analysis. In this paper, we take ARGO model as an example to introduce the applications and achievements of statistical methods in big data analysis, and put forward the potential improvements from statistics’ point of view. The analysis of ARGO model shows that many intrinsic problems in big data can be resolved by statistical methods and the statistical law contained in the data is still the greatest value of data mining, which means that the statistical thinking can only become more and more important in big data analysis. However, in the face of big data with complex structure and diversity of sources, statistical methods also need further exploration and try to seize the new opportunities for development and rise to new challenges.

Key words: Prediction of Flue Trend, Time Series, Variable Selection