面向缺损敏感属性的加权信息熵匿名算法

A WEIGHTED INFORMATION ENTROPY ANONYMOUS ALGORITHM FOR DEFECT SENSITIVE ATTRIBUTES

  • 摘要: 提出一种(γ,l-p,k)-匿名模型,模型引入了信息熵作为计算缺损数据的属性距离,通过敏感属性度高低分为不同的敏感级别,并设置相应的权重ω值,同时让等价类元组的不同敏感级别个数满足阈值γ。接着根据模型设计了加权信息熵匿名算法(Weighted Information Entropy Anonymous Algorithm based on Defect-Sensitive Attributes,WISA*)对缺损型数据集进行匿名化。实验结果表明,该算法不仅可以减少等价类信息损失,同时提高了敏感属性的多样性,从而降低了数据隐私泄露风险且复杂度较低。

     

    Abstract: This paper proposes a (γ,l-p,k)-anonymity model. In the anonymous model, information entropy was introduced as the attribute distance to calculate the missing data, and the sensitive attribute degree was divided into different sensitive levels, and the corresponding weight ω value was set. At the same time, we let the number of different sensitivity levels of equivalence class tuples meet the threshold γ. According to the model, a weighted information entropy anonymity algorithm WISA* was designed to anonymize the defective dataset. The experimental results show that the proposed algorithm WISA* not only reduces the loss of equivalent information, but also improves the performance of sensitive attributes on diversity, thereby descending the risk of privacy leakage of the data with a low complexity.

     

/

返回文章
返回