๐ค AI Summary
This work addresses the challenging problem of complex controllable image editing under unpaired data conditions. We propose the Collaborative-Competitive Multi-Agent (CCMA) framework, which jointly integrates collaborative and adversarial mechanisms. Collaborative agents perform fine-grained editing via latent-space co-optimization and diffusion-based policy networks; competitive agents enforce a balance between diversity and consistency through adversarial reward modeling and multi-agent reinforcement learning, enabling iterative refinement over multiple rounds. Crucially, CCMA is entirely training-data-freeโrequiring neither paired image-text annotations nor any supervised training data. Evaluated on multiple image editing benchmarks, it achieves state-of-the-art performance. User studies further demonstrate its superior naturalness and intent fidelity compared to existing methods, empirically validating the effectiveness and practicality of the proposed dual-mechanism design.