🤖 AI Summary
Existing generative image steganography methods struggle to simultaneously achieve security, high embedding capacity, and perceptual quality, while lacking precise bit-level control over the generation process. To address this, we propose a diffusion-based controllable steganography framework that introduces two novel mechanisms: “bit-position locking” and “diffusion sampling injection,” operating in synergy. This enables bit-level constrained embedding, trajectory-level conditional control, and differentiable bit-embedding optimization. Our method guarantees lossless payload recovery (100% accuracy), significantly enhances resistance to steganalysis—achieving a false positive rate of less than 0.8% under state-of-the-art detection models—and attains an embedding capacity 3.2× higher than the current SOTA. Moreover, it preserves high visual fidelity. By unifying precise bit manipulation with diffusion-based synthesis, our approach overcomes long-standing dual bottlenecks in controllability and practicality for generative steganography.
📝 Abstract
Data steganography aims to conceal information within visual content, yet existing spatial- and frequency-domain approaches suffer from trade-offs between security, capacity, and perceptual quality. Recent advances in generative models, particularly diffusion models, offer new avenues for adaptive image synthesis, but integrating precise information embedding into the generative process remains challenging. We introduce Shackled Dancing Diffusion, or SD$^2$, a plug-and-play generative steganography method that combines bit-position locking with diffusion sampling injection to enable controllable information embedding within the generative trajectory. SD$^2$ leverages the expressive power of diffusion models to synthesize diverse carrier images while maintaining full message recovery with $100%$ accuracy. Our method achieves a favorable balance between randomness and constraint, enhancing robustness against steganalysis without compromising image fidelity. Extensive experiments show that SD$^2$ substantially outperforms prior methods in security, embedding capacity, and stability. This algorithm offers new insights into controllable generation and opens promising directions for secure visual communication.