The workflow motif: a widely-userful performance diagnosis abstraction for distributed applications

📅 2025-05-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Distributed application performance diagnosis suffers from a severe semantic gap between high-level application logic and low-level observability tools. To bridge this gap, we propose “workflow motifs”—a novel abstraction that formally characterizes frequent, structured behavioral patterns in request execution, thereby aligning application semantics with system traces. We establish the first generalizable and mineable theoretical model of workflow motifs and rigorously formalize its correspondence with frequent subgraph mining (e.g., gSpan). Based on this foundation, we implement the first prototype system for recommending HDFS performance optimization opportunities. Evaluated on real-world HDFS deployments, our approach precisely identifies critical bottleneck paths—including cache misses and redundant RPCs—demonstrating the motif abstraction’s effectiveness, interpretability, and practical utility in performance diagnosis.

Technology Category

Application Category

📝 Abstract
Diagnosing problems in deployed distributed applications continues to grow more challenging. A significant reason is the extreme mismatch between the powerful abstractions developers have available to build increasingly complex distributed applications versus the simple ones engineers have available to diagnose problems in them. To help, we present a novel abstraction, the workflow motif, instantiations of which represent characteristics of frequently-repeating patterns within and among request executions. We argue that workflow motifs will benefit many diagnosis tasks, formally define them, and use this definition to identify which frequent-subgraph-mining algorithms are good starting points for mining workflow motifs. We conclude by using an early version of workflow motifs to suggest performance-optimization points in HDFS.
Problem

Research questions and friction points this paper is trying to address.

Diagnosing issues in complex distributed applications is increasingly difficult
Mismatch exists between development abstractions and diagnostic tools for distributed systems
Workflow motifs help identify performance patterns in request executions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces workflow motif abstraction
Leverages frequent-subgraph-mining algorithms
Applies motifs to diagnose distributed systems
🔎 Similar Papers
No similar papers found.