PSSketch: Finding Persistent and Sparse Flow with High Accuracy and Efficiency

📅 2025-05-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Persistent-sparse (PS) flows—characterized by high persistence yet low traffic density—pose significant challenges for early network threat warning, as they evade detection by conventional heavy-flow or persistent-flow identification methods. Method: This paper introduces, for the first time, an “anomalous boundary” dual-threshold criterion for precise PS-flow identification. We design a high-accuracy hierarchical Sketch structure featuring dynamic-bit-length counters and cross-layer overflow protection to jointly estimate flow persistence and density, augmented with memory-aware optimizations. Results: Experiments demonstrate that our approach reduces memory overhead by an order of magnitude, improves F1-score by up to 2.94×, decreases absolute relative error (ARE) by one to two orders of magnitude, and achieves higher throughput than state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
Finding persistent sparse (PS) flow is critical to early warning of many threats. Previous works have predominantly focused on either heavy or persistent flows, with limited attention given to PS flows. Although some recent studies pay attention to PS flows, they struggle to establish an objective criterion due to insufficient data-driven observations, resulting in reduced accuracy. In this paper, we define a new criterion"anomaly boundary"to distinguish PS flows from regular flows. Specifically, a flow whose persistence exceeds a threshold will be protected, while a protected flow with a density lower than a threshold is reported as a PS flow. We then introduce PSSketch, a high-precision layered sketch to find PS flows. PSSketch employs variable-length bitwise counters, where the first layer tracks the frequency and persistence of all flows, and the second layer protects potential PS flows and records overflow counts from the first layer. Some optimizations have also been implemented to reduce memory consumption further and improve accuracy. The experiments show that PSSketch reduces memory consumption by an order of magnitude compared to the strawman solution combined with existing work. Compared with SOTA solutions for finding PS flows, it outperforms up to 2.94x in F1 score and reduces ARE by 1-2 orders of magnitude. Meanwhile, PSSketch achieves a higher throughput than these solutions.
Problem

Research questions and friction points this paper is trying to address.

Identifying persistent sparse flows for threat detection
Establishing objective criteria to distinguish PS flows
Improving accuracy and efficiency in PS flow detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Defines anomaly boundary for PS flow detection
Uses layered sketch with variable-length counters
Optimizes memory and accuracy with two-layer design
🔎 Similar Papers
No similar papers found.
J
Jiayao Wang
National University of Defense Technology, Changsha, China
Qilong Shi
Qilong Shi
Ph.D. student, Tsinghua University
Computer NetworkSketchNetwork MeasurementData MiningLLM
X
Xiyan Liang
Nankai University, Tianjin, China
H
Han Wang
Peking University, Beijing, China
W
Wenjun Li
Peng Cheng Laboratory, Shenzhen, China
Z
Ziling Wei
National University of Defense Technology, Changsha, China
Weizhe Zhang
Weizhe Zhang
Professor of Peng Cheng Laboratory & Harbin Institute of Technology
Parallel and Distributed SystemCloud ComputingRealtime SchedulingComputer Network
S
Shuhui Chen
National University of Defense Technology, Changsha, China