Kavli Affiliate: Jia Liu | First 5 Authors: Shuliang Liu, Shuliang Liu, , , | Summary: Vision-language models demand watermarking solutions that protect intellectual property without compromising multimodal coherence. Existing text watermarking methods disrupt visual-textual alignment through biased token selection and static strategies, leaving semantic-critical concepts vulnerable. We propose VLA-Mark, a vision-aligned framework that embeds […]
Continue.. VLA-Mark: A cross modal watermark for large vision-language alignment model