Kavli Affiliate: Cheng Peng | First 5 Authors: Amin Dada, Aokun Chen, Cheng Peng, Kaleb E Smith, Ahmad Idrissi-Yaghir | Summary: Traditionally, large language models have been either trained on general web crawls or domain-specific data. However, recent successes of generative large language models, have shed light on the benefits of cross-domain datasets. To examine […]
Continue.. On the Impact of Cross-Domain Data on German Language Models