🤖 AI Summary
Alluvial diagrams for multivariate data visualization suffer from disordered category ordering and inconsistent coloring, impairing structural clarity and perceptual discriminability. Method: We formalize two combinatorial optimization objectives—minimizing flow edge crossings to enhance layout coherence and maximizing inter-class color contrast to improve readability—and adapt the NeighborNet algorithm from evolutionary biology into a novel bi-objective optimization framework. An efficient approximation strategy is designed, enabling fully automated, semantics-aware ordering and coloring implemented in R and integrated into the open-source R package *wompwomp* (GitHub). Contribution/Results: Experiments on multiple real-world multivariate datasets demonstrate significant reductions in visual clutter and substantial improvements in pattern recognition efficiency. This work provides the first systematic, reproducible solution for joint layout and coloring optimization of alluvial diagrams.
📝 Abstract
Alluvial plots can be effective for visualization of multivariate data, but rely on ordering of alluvia that can be non-trivial to arrange. We formulate two optimization problems that formalize the challenge of ordering and coloring partitions in alluvial plots. While solving these optimization problems is challenging in general, we show that the NeighborNet algorithm from phylogenetics can be adapted to provide excellent results in typical use cases. Our methods are implemented in a freely available R package available on GitHub at https://github.com/pachterlab/wompwomp