SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

Kavli Affiliate: Wei Gao | First 5 Authors: Xuan Zhang, Cunxiao Du, Chao Du, Tianyu Pang, Wei Gao | Summary: Recent advancements in large language models (LLMs) have extended their capabilities to handle long contexts. However, increasing the number of model layers and the length of input sequences significantly escalates the memory required to store […]


Continue.. SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction

LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation

Kavli Affiliate: Wei Gao | First 5 Authors: Xuan Zhang, Fengzhuo Zhang, Cunxiao Du, Chao Du, Tianyu Pang | Summary: Scaling language models to handle longer contexts introduces substantial memory challenges due to the growing cost of key-value (KV) caches. Motivated by the efficiency gains of hybrid models and the broad availability of pretrained large […]


Continue.. LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation

On “Inconsistencies of metalens performance and comparison with conventional diffractive optics”

Kavli Affiliate: Andrei Faraon | First 5 Authors: Amir Arbabi, Andrei Faraon, , , | Summary: It was recently claimed1 that reported focusing efficiency values of high numerical aperture metalenses are inconsistent with a theoretical bound, and their measurement results are incorrectly interpreted. We review the article and conclude that these claims are not well […]


Continue.. On “Inconsistencies of metalens performance and comparison with conventional diffractive optics”

NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

Kavli Affiliate: Xiang Zhang | First 5 Authors: Han Han, Tong Zhu, Xiang Zhang, Mengsong Wu, Hao Xiong | Summary: Large language models (LLMs) combined with tool learning have gained impressive results in real-world applications. During tool learning, LLMs may call multiple tools in nested orders, where the latter tool call may take the former […]


Continue.. NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

Bandwidth-tunable Telecom Single Photons Enabled by Low-noise Optomechanical Transduction

Kavli Affiliate: Simon Groblacher | First 5 Authors: Liu Chen, Alexander Rolf Korsch, CauĂȘ Moreno Kersul, Rodrigo Benevides, Yong Yu | Summary: Single-photon sources are of fundamental importance to emergent quantum technologies. Nano-structured optomechanical crystals provide an attractive platform for single photon generation due to their unique engineering freedom and compatibility with on-chip silicon fabrication. […]


Continue.. Bandwidth-tunable Telecom Single Photons Enabled by Low-noise Optomechanical Transduction

Expansion properties of the young supernova type Iax remnant Pa 30 revealed

Kavli Affiliate: David Charbonneau | First 5 Authors: Tim Cunningham, Ilaria Caiazzo, Nikolaus Z. Prusinski, James Fuller, John C. Raymond | Summary: The recently discovered Pa 30 nebula, the putative type Iax supernova remnant associated with the historical supernova of 1181 AD, shows puzzling characteristics that make it unique among known supernova remnants. In particular, […]


Continue.. Expansion properties of the young supernova type Iax remnant Pa 30 revealed

NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models

Kavli Affiliate: Dan Luo | First 5 Authors: Yanbiao Ji, Chang Liu, Xin Chen, Yue Ding, Dan Luo | Summary: Graphs are a fundamental data structure for representing relationships in real-world scenarios. With the success of Large Language Models (LLMs) across various natural language processing (NLP) tasks, there has been growing interest in integrating LLMs […]


Continue.. NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models

Application of zero-noise extrapolation-based quantum error mitigation to a silicon spin qubit

Kavli Affiliate: Giordano Scappucci | First 5 Authors: Hanseo Sohn, Jaewon Jung, Jaemin Park, Hyeongyu Jang, Lucas E. A. Stehouwer | Summary: As quantum computing advances towards practical applications, reducing errors remains a crucial frontier for developing near-term devices. Errors in the quantum gates and quantum state readout could result in noisy circuits, which would […]


Continue.. Application of zero-noise extrapolation-based quantum error mitigation to a silicon spin qubit

The spin lifetime of an individual atomic nucleus investigated via local-probe single-shot readout

Kavli Affiliate: Sander Otte | First 5 Authors: Evert W. Stolte, Jinwon Lee, Hester Vennema, Rik Broekhoven, Esther Teng | Summary: Nuclear spins owe their long-lived magnetic states to their excellent isolation from their environment. At the same time, a limited degree of interaction with their surroundings is necessary for reading and writing the spin […]


Continue.. The spin lifetime of an individual atomic nucleus investigated via local-probe single-shot readout