Kavli Affiliate: Feng Wang | First 5 Authors: Feng Wang, Yiding Sun, Jiaxin Mao, Wei Xue, Danqing Xu | Summary: Large language models (LLMs) have demonstrated remarkable capabilities across various professional domains, with their performance typically evaluated through standardized benchmarks. However, the development of financial RAG benchmarks has been constrained by data confidentiality issues and […]
Continue.. FinS-Pilot: A Benchmark for Online Financial System