基于多属性优势的松弛函数依赖的数据质量检测

DATA QUALITY DETECTION OF RELAXED FUNCTION DEPENDENCY BASED ON MULTI-ATTRIBUTE ADVANTAGE

  • 摘要: 为了同时兼顾松弛函数依赖检测效果,并降低计算复杂度,提出一种基于多属性优势的松弛函数依赖的数据质量检测。提出一种松弛函数依赖的数据质量检测算法,该算法通过多属性决策理论中的优势概念将多属性值之间的距离表示为指标,从而能够自动推断相似性阈值;引入一个效用函数可以根据松弛函数依赖的基数和阈值对其进行排序。最后,对真实数据集进行了实验评估,结果表明该方法具有较高的检测精度与较低的计算复杂度。

     

    Abstract: In order to take into account the effect of relaxation function dependency detection and reduce the computational complexity, a data quality detection based on multi-attribute dominance and relaxation function dependency is proposed. A data quality detection algorithm for relaxed functional dependencies was proposed. The algorithm could automatically infer the similarity threshold by expressing the distance between multiple attribute values as an index through the advantage concept in multiple attribute decision-making theory. A utility function was introduced, which could be sorted according to the cardinality and threshold of the relaxation function. The experimental evaluation on the real data set shows that the proposed method has high detection accuracy and low computational complexity.

     

/

返回文章
返回