Kavli Affiliate: Feng Wang | First 5 Authors: Mingcong Lu, Jiangcai Zhu, Wang Hao, Zheng Li, Shusheng Zhang | Summary: Multi-turn dialogues are a key interaction method between humans and Large Language Models (LLMs), as conversations extend over multiple rounds, keeping LLMs’ high generation quality and low latency is a challenge. Mainstream LLMs can be […]
Continue.. Intermittent Semi-working Mask: A New Masking Paradigm for LLMs