Kavli Affiliate: Zhuo Li | First 5 Authors: Zhuo Li, Yuhao Du, Xiaoqi Jiao, Yiwen Guo, Yuege Feng | Summary: Selecting high-quality and diverse training samples from extensive datasets plays a crucial role in reducing training overhead and enhancing the performance of Large Language Models (LLMs). However, existing studies fall short in assessing the overall […]
Continue.. Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm