统计研究 ›› 2007, Vol. 24 ›› Issue (4): 8-10.

• 论文 • 上一篇    下一篇

数据挖掘中基于可辨识矩阵的连续属性离散化方法

刘云霞 曾五一   

  1. 厦门大学计划统计系
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2007-04-15 发布日期:2007-04-15

Discretization of Continuous Attributes Based on Discernibility Matrix in Data Mining

LIU Yun-xia;ZENG Wu-yi   

  • Received:1900-01-01 Revised:1900-01-01 Online:2007-04-15 Published:2007-04-15

摘要: 连续属性的离散化在数据挖掘中有着非常重要的作用。本文基于可辨识矩阵提出一种连续属性离散化的方法,并利用平均互信息量对离散化结果进行修正。该方法通过统计模拟取得了良好的效果。

关键词: 数据挖掘, 可辨识矩阵, 连续属性, 离散化

Abstract: The discretization of continuous attributes is important in Data Mining. The paper put forward a method of discretization of continuous attributes based on discernibility matrix and revised the discrete result by average mutual information. This method gained good effect by statistical simulation.

 

Key words: Data Mining, Discernibility Matrix, Continuous Attributes, Discretization