Kavli Affiliate: Feng Wang | First 5 Authors: Jiaxing Li, Chi Xu, Feng Wang, Isaac M von Riedemann, Cong Zhang | Summary: Large Language Models (LLMs) have become increasingly popular, transforming a wide range of applications across various domains. However, the real-world effectiveness of their query cache systems has not been thoroughly investigated. In this […]
Continue.. SCALM: Towards Semantic Caching for Automated Chat Services with Large Language Models