时间权重驱动的日志模式循环迭代挖掘算法

TIME-WEIGHT DRIVEN LOG PATTERN MINING WITH ITERATIVE PROCESS

  • 摘要: 日志数据是运维人员监控软件系统的重要依据,自动化挖掘日志模式能够有效地帮助运维人员理解系统的行为。然而,相同任务与不同任务的日志交错会干扰日志模式的挖掘。为此,提出一种时间权重驱动的日志模式循环迭代挖掘算法TSP-Miner。该算法提取关联日志的合理时间间隔作为时间权重,准确识别日志间的关联性并挖掘日志模式,无需任务特征信息。通过循环迭代挖掘的策略,TSP-Miner持续替换日志序列中符合已有日志模式的子序列,简化其结构,即使在交错干扰下也能有效挖掘日志模式。基于真实和模拟数据集的实验结果均表明,TSP-Miner挖掘出的日志模式质量优于已有算法。

     

    Abstract: Logs are crucial data for maintenance experts to monitor software systems, and automating log pattern mining can significantly aid in understanding system behavior. However, the interleaving of logs from the same or different tasks can interfere with mining log patterns. To address this issue, a time-weighted iterative log pattern mining algorithm, TSP-Miner, is proposed. This algorithm extracted reasonable time intervals between correlated logs as time weights to accurately identify log correlations and mine log patterns, without requiring task-specific information. By employing a cyclic iteration strategy, TSP-Miner continuously replaced subsequences in a log sequence that match existing patterns. This process simplified the structure of the log sequence, thereby effectively mining log patterns even in the presence of interleaving interference. Experimental results on both real and simulated datasets demonstrate that TSP-Miner outperforms existing algorithms in terms of the quality of mined log patterns.

     

/

返回文章
返回