Kavli Affiliate: Jing Wang | First 5 Authors: Lei Yu, Yechao Zhang, Ziqi Zhou, Yang Wu, Wei Wan | Summary: With the rapid development of the Vision-Language Model (VLM), significant progress has been made in Visual Question Answering (VQA) tasks. However, existing VLM often generate inaccurate answers due to a lack of up-to-date knowledge. To […]
Continue.. Spa-VLM: Stealthy Poisoning Attacks on RAG-based VLM