Semi-supervised Vertex Hunting, with Applications in Network and Text Analysis

๐Ÿ“… 2025-10-26
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This paper addresses the semi-supervised vertex hunting (SSVH) problem: estimating vertices of a latent simplex when partial barycentric coordinates of data points are availableโ€”but corrupted by an unknown orthogonal transformation. We propose a novel framework leveraging properties of orthogonal projection matrices to incorporate these noisy, partially observed barycentric constraints into an unsupervised vertex hunting objective. The resulting optimization algorithm admits an explicit error bound and achieves faster convergence than purely unsupervised methods. Theoretically, the estimation error decays with the number of labeled samples, and the algorithm scales linearly in time complexity. Extensive experiments on mixed-membership network inference and text topic modeling demonstrate its efficiency, scalability, and superior performance over state-of-the-art baselines.

Technology Category

Application Category

๐Ÿ“ Abstract
Vertex hunting (VH) is the task of estimating a simplex from noisy data points and has many applications in areas such as network and text analysis. We introduce a new variant, semi-supervised vertex hunting (SSVH), in which partial information is available in the form of barycentric coordinates for some data points, known only up to an unknown transformation. To address this problem, we develop a method that leverages properties of orthogonal projection matrices, drawing on novel insights from linear algebra. We establish theoretical error bounds for our method and demonstrate that it achieves a faster convergence rate than existing unsupervised VH algorithms. Finally, we apply SSVH to two practical settings, semi-supervised network mixed membership estimation and semi-supervised topic modeling, resulting in efficient and scalable algorithms.
Problem

Research questions and friction points this paper is trying to address.

Estimating simplex from noisy data with partial supervision
Leveraging orthogonal projections for semi-supervised vertex hunting
Improving convergence rates for network and text analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semi-supervised vertex hunting with partial barycentric coordinates
Method leveraging orthogonal projection matrix properties
Achieves faster convergence than unsupervised algorithms
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yicong Jiang
Department of Statistics, Harvard University
Zheng Tracy Ke
Zheng Tracy Ke
Harvard University
Statistics