TensorHub: Rethinking AI Model Hub with Tensor-Centric Compression

📅 2026-04-18
📈 Citations: 0
Influential: 0
📄 PDF

career value

227K/year
🤖 AI Summary
The rapid proliferation of large-scale AI models has imposed significant storage and distribution burdens on model repositories. This work proposes a tensor-centric, unsupervised deduplication and compression method that accurately identifies cross-model redundancies without requiring labeled data, leveraging tensor-level fingerprint extraction and clustering. By enabling fine-grained deduplication, the approach substantially reduces storage overhead while preserving model performance and usability. Experimental results demonstrate that the method achieves high compression efficiency on real-world model repositories with minimal runtime overhead.

Technology Category

Application Category

📝 Abstract
Modern AI models are growing rapidly in size and redundancy, leading to significant storage and distribution challenges in model hubs. We present TensorHub, a tensor-centric system for reducing storage overhead through fine-grained deduplication and compression. TensorHub leverages tensor-level fingerprinting and clustering to identify redundancy across models without requiring annotations. Our design enables efficient storage reduction while preserving model usability and performance. Experiments on real-world model repositories demonstrate substantial storage savings with minimal overhead.
Problem

Research questions and friction points this paper is trying to address.

AI model hub
model redundancy
storage overhead
model distribution
tensor compression
Innovation

Methods, ideas, or system contributions that make the work stand out.

tensor-centric
fine-grained deduplication
model compression
fingerprinting
AI model hub