Kavli Affiliate: Zhuo Li | First 5 Authors: Xuying Li, Zhuo Li, Yuji Kosuga, Yasuhiro Yoshida, Victor Bian | Summary: Large language models (LLMs) have demonstrated remarkable capabilities, but they also pose risks related to the generation of toxic or harmful content. This work introduces Precision Knowledge Editing (PKE), an advanced technique that builds upon […]
Continue.. Precision Knowledge Editing: Enhancing Safety in Large Language Models