Multi-view biclustering via non-negative matrix tri-factorisation

📅 2025-02-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the heterogeneity in feature–sample compatibility across multiple views, this paper proposes ResNMTF—a novel multi-view biclustering framework that automatically determines the number of clusters. ResNMTF jointly identifies sample subsets and their view-specific feature subsets, while supporting arbitrary shared-dimension configurations across views. Methodologically, it integrates nonnegative matrix tri-factorization with multi-view co-clustering and introduces the bisilhouette score—a first-of-its-kind evaluation metric specifically designed for biclustering structures—enabling both hyperparameter self-selection and result visualization. Extensive experiments on synthetic and real-world datasets demonstrate that ResNMTF accurately recovers overlapping and incomplete biclusters. The bisilhouette score exhibits strong agreement with external evaluation metrics (average correlation > 0.92), confirming its validity and practical utility for multi-view biclustering.

Technology Category

Application Category

📝 Abstract
Multi-view data is ever more apparent as methods for production, collection and storage of data become more feasible both practically and fiscally. However, not all features are relevant to describe the patterns for all individuals. Multi-view biclustering aims to simultaneously cluster both rows and columns, discovering clusters of rows as well as their view-specific identifying features. A novel multi-view biclustering approach based on non-negative matrix factorisation is proposed (ResNMTF). Demonstrated through extensive experiments on both synthetic and real datasets, ResNMTF successfully identifies both overlapping and non-exhaustive biclusters, without pre-existing knowledge of the number of biclusters present, and is able to incorporate any combination of shared dimensions across views. Further, to address the lack of a suitable bicluster-specific intrinsic measure, the popular silhouette score is extended to the bisilhouette score. The bisilhouette score is demonstrated to align well with known extrinsic measures, and proves useful as a tool for hyperparameter tuning as well as visualisation.
Problem

Research questions and friction points this paper is trying to address.

Identifies overlapping and non-exhaustive biclusters without prior knowledge
Clusters rows and view-specific features simultaneously in multi-view data
Extends silhouette score to bisilhouette for bicluster evaluation and tuning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Non-negative matrix tri-factorisation for multi-view biclustering
ResNMTF identifies overlapping non-exhaustive biclusters
Extended silhouette score to bisilhouette for evaluation
🔎 Similar Papers
No similar papers found.
E
Ella S. C. Orme
Department of Mathematics, Imperial College London, London, SW7 2AZ, United Kingdom
T
Theodoulos Rodosthenous
Department of Mathematics, Imperial College London, London, SW7 2AZ, United Kingdom
Marina Evangelou
Marina Evangelou
Imperial College London
StatisticsMachine Learning