SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

Kavli Affiliate: Wei Gao | First 5 Authors: Xuan Zhang, Cunxiao Du, Chao Du, Tianyu Pang, Wei Gao | Summary: Recent advancements in large language models (LLMs) have extended their capabilities to handle long contexts. However, increasing the number of model layers and the length of input sequences significantly escalates the memory required to store […]


Continue.. SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

Machine-Learning Analysis of Radiative Decays to Dark Matter at the LHC

Kavli Affiliate: Marcela Carena | First 5 Authors: Ernesto Arganda, Marcela Carena, Martín de los Rios, Andres D. Perez, Duncan Rocha | Summary: The search for weakly interacting matter particles (WIMPs) is one of the main objectives of the High Luminosity Large Hadron Collider (HL-LHC). In this work we use Machine Learning (ML) techniques to […]


Continue.. Machine-Learning Analysis of Radiative Decays to Dark Matter at the LHC

The Affleck-Dine Curvaton

Kavli Affiliate: Gordan Krnjaic | First 5 Authors: Aurora Ireland, Gordan Krnjaic, Takuya Okawa, , | Summary: The Standard Model of particle physics does not explain the origin of the universe’s baryon asymmetry or its primordial fluctuations. The Affleck-Dine mechanism is a well-motivated scenario for generating the baryon asymmetry through the post-inflationary dynamics of a […]


Continue.. The Affleck-Dine Curvaton