Jiang Sijia, Tan Zijing. CONDITIONAL FUNCTIONAL DEPENDENCIES DISCOVERY WITH STRUCTURE LEARNING IN PROBABILISTIC GRAPHICAL MODELS[J]. Computer Applications and Software, 2025, 42(2): 280-286. DOI: 10.3969/j.issn.1000-386x.2025.02.038
Citation: Jiang Sijia, Tan Zijing. CONDITIONAL FUNCTIONAL DEPENDENCIES DISCOVERY WITH STRUCTURE LEARNING IN PROBABILISTIC GRAPHICAL MODELS[J]. Computer Applications and Software, 2025, 42(2): 280-286. DOI: 10.3969/j.issn.1000-386x.2025.02.038

CONDITIONAL FUNCTIONAL DEPENDENCIES DISCOVERY WITH STRUCTURE LEARNING IN PROBABILISTIC GRAPHICAL MODELS

  • Conditional functional dependencies (CFDs) generalize functional dependencies and are widely employed in data quality and data cleaning. Usually, CFDs discovery methods will find all CFDs holding on data, and only a small number of CFDs that can detect errors user concern are used in data cleaning, leading to massive meaningless CFDs, and an expensive post-processing step in further required for selecting those relevant ones. In fact, CFDs discovery corresponded to structure learning by solving the sparse regression of probability graph model. By transforming the dirty dataset, estimating the inverse covariance of the transformed dataset and decomposing it to obtain the autoregression matrix, we could capture the conditional function dependencies that could characterize the distribution of dataset. Experiments show that this method can effectively find a small number of CFDs that can be used for error detection, which is more effective than state-of-the-art CFDs discovery methods.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return