🤖 AI Summary
The LoRA model ecosystem lacks structured representations, and existing retrieval methods rely on unreliable user-provided descriptions or biased popularity metrics.
Method: We propose the first scalable, metadata-free representation framework, leveraging CLIP embedding differences derived from generative behavior—specifically, multi-prompt and multi-seed sampling—to disentangle three semantic features: directionality, intensity, and consistency. This enables semantic-driven precise matching and quality-aware filtering.
Contribution/Results: We innovatively establish quantifiable links between the tripartite representation and legal concepts of “substantial similarity” and “autonomy” from copyright law. Evaluated on 650+ LoRA models, our method significantly outperforms text-based baselines. Both automated and human evaluations confirm its high accuracy, robust stability across prompts and seeds, and capacity for copyright-compliance analysis—demonstrating practical utility for responsible model discovery and governance.
📝 Abstract
The rapid proliferation of generative components, such as LoRAs, has created a vast but unstructured ecosystem. Existing discovery methods depend on unreliable user descriptions or biased popularity metrics, hindering usability. We present CARLoS, a large-scale framework for characterizing LoRAs without requiring additional metadata. Analyzing over 650 LoRAs, we employ them in image generation over a variety of prompts and seeds, as a credible way to assess their behavior. Using CLIP embeddings and their difference to a base-model generation, we concisely define a three-part representation: Directions, defining semantic shift; Strength, quantifying the significance of the effect; and Consistency, quantifying how stable the effect is. Using these representations, we develop an efficient retrieval framework that semantically matches textual queries to relevant LoRAs while filtering overly strong or unstable ones, outperforming textual baselines in automated and human evaluations. While retrieval is our primary focus, the same representation also supports analyses linking Strength and Consistency to legal notions of substantiality and volition, key considerations in copyright, positioning CARLoS as a practical system with broader relevance for LoRA analysis.