Cross-Model Semantics in Representation Learning

📅 2025-08-05

📈 Citations: 0

✨ Influential: 0

career value

242K/year

🤖 AI Summary

Neural network internal representations often lack stability and cross-architectural consistency due to architectural disparities, hindering knowledge transfer and modular deployment. To address this, we propose a structured regularization framework comprising linear shaping operators and rectified path constraints, which explicitly encode inductive biases to improve geometric alignment of representations across architectures. Through theoretical analysis, controlled transfer experiments, and a novel representation alignment metric, we systematically demonstrate that structural priors significantly enhance semantic consistency among heterogeneous models. Our method improves downstream task performance in model distillation and modular learning by up to 12.3%, offering an interpretable and scalable paradigm for building robust, composable deep learning systems.

Technology Category

Application Category

📝 Abstract

The internal representations learned by deep networks are often sensitive to architecture-specific choices, raising questions about the stability, alignment, and transferability of learned structure across models. In this paper, we investigate how structural constraints--such as linear shaping operators and corrective paths--affect the compatibility of internal representations across different architectures. Building on the insights from prior studies on structured transformations and convergence, we develop a framework for measuring and analyzing representational alignment across networks with distinct but related architectural priors. Through a combination of theoretical insights, empirical probes, and controlled transfer experiments, we demonstrate that structural regularities induce representational geometry that is more stable under architectural variation. This suggests that certain forms of inductive bias not only support generalization within a model, but also improve the interoperability of learned features across models. We conclude with a discussion on the implications of representational transferability for model distillation, modular learning, and the principled design of robust learning systems.

Problem

Research questions and friction points this paper is trying to address.

Study stability of learned representations across different architectures

Analyze impact of structural constraints on representation compatibility

Improve interoperability of learned features with inductive biases

Innovation

Methods, ideas, or system contributions that make the work stand out.

Structural constraints enhance cross-model representation compatibility

Framework measures alignment in networks with architectural priors

Inductive biases improve feature interoperability across models

🔎 Similar Papers

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models