Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

Kavli Affiliate: Cheng Peng | First 5 Authors: Shuyao Xu, Cheng Peng, Jiangxuan Long, Weidi Xu, Wei Chu | Summary: Recent advances in model distillation demonstrate that data from advanced reasoning models (e.g., DeepSeek-R1, OpenAI’s o1) can effectively transfer complex reasoning abilities to smaller, efficient student models. However, standard practices employ rejection sampling, discarding incorrect […]


Continue.. Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

Measuring topological invariants of even-dimensional non-Hermitian systems through quench dynamics

Kavli Affiliate: Long Zhang | First 5 Authors: Xiao-Dong Lin, Long Zhang, , , | Summary: The accurate determination of non-Hermitian (NH) topological invariants plays a central role in the study of NH topological phases. In this work, we propose a general framework for directly measuring NH topological invariants in even-dimensional systems through quench dynamics. […]


Continue.. Measuring topological invariants of even-dimensional non-Hermitian systems through quench dynamics

Measuring topological invariants of even-dimensional non-Hermitian systems through quench dynamics

Kavli Affiliate: Long Zhang | First 5 Authors: Xiao-Dong Lin, Long Zhang, , , | Summary: The accurate determination of non-Hermitian (NH) topological invariants plays a central role in the study of NH topological phases. In this work, we propose a general framework for directly measuring NH topological invariants in even-dimensional systems through quench dynamics. […]


Continue.. Measuring topological invariants of even-dimensional non-Hermitian systems through quench dynamics

RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Kavli Affiliate: Zheng Zhu | First 5 Authors: Liu Liu, Xiaofeng Wang, Guosheng Zhao, Keyu Li, Wenkang Qin | Summary: Imitation Learning has become a fundamental approach in robotic manipulation. However, collecting large-scale real-world robot demonstrations is prohibitively expensive. Simulators offer a cost-effective alternative, but the sim-to-real gap make it extremely challenging to scale. Therefore, […]


Continue.. RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Theory of chiral-phonon-activated spin Seebeck effect

Kavli Affiliate: Mamoru Matsuo | First 5 Authors: Naoki Nishimura, Takumi Funato, Mamoru Matsuo, Takeo Kato, | Summary: We theoretically explore the generation of spin current driven by a temperature gradient in a junction between a chiral insulator and a normal metal. Based on the gyromagnetic effect caused by microscopic rotation due to phonons, we […]


Continue.. Theory of chiral-phonon-activated spin Seebeck effect

A Reduction-Driven Local Search for the Generalized Independent Set Problem

Kavli Affiliate: Yi Zhou | First 5 Authors: Yiping Liu, Yi Zhou, Zhenxiang Xu, Mingyu Xiao, Jin-Kao Hao | Summary: The Generalized Independent Set (GIS) problem extends the classical maximum independent set problem by incorporating profits for vertices and penalties for edges. This generalized problem has been identified in diverse applications in fields such as […]


Continue.. A Reduction-Driven Local Search for the Generalized Independent Set Problem

Multi-Mode Process Control Using Multi-Task Inverse Reinforcement Learning

Kavli Affiliate: Biao Huang | First 5 Authors: Runze Lin, Junghui Chen, Biao Huang, Lei Xie, Hongye Su | Summary: In the era of Industry 4.0 and smart manufacturing, process systems engineering must adapt to digital transformation. While reinforcement learning offers a model-free approach to process control, its applications are limited by the dependence on […]


Continue.. Multi-Mode Process Control Using Multi-Task Inverse Reinforcement Learning

Hybrid RIS-Enhanced ISAC Secure Systems: Joint Optimization in the Presence of an Extended Target

Kavli Affiliate: Long Zhang | First 5 Authors: Yu Yao, Junhao Zhang, Pu Miao, Long Zhang, Gaojie Chen | Summary: Unlike the conventional fully-passive and fully-active reconfigurable intelligent surfaces (RISs), a hybrid RIS consisting of active and passive reflection units has recently been concerned, which can exploit their integrated advantages to alleviate the RIS-induced path […]


Continue.. Hybrid RIS-Enhanced ISAC Secure Systems: Joint Optimization in the Presence of an Extended Target

Disentangling hierarchical relaxations in glass formers via dynamic eigenmodes

Kavli Affiliate: Yi Zhou | First 5 Authors: Wensi Sun, Yanshuang Chen, Wencheng Ji, Yi Zhou, Hua Tong | Summary: Hierarchical dynamics in glass-forming systems span multiple timescales, from fast vibrations to slow structural rearrangements, appearing in both supercooled fluids and glassy states. Understanding how these diverse processes interact across timescales remains a central challenge. […]


Continue.. Disentangling hierarchical relaxations in glass formers via dynamic eigenmodes

DocMMIR: A Framework for Document Multi-modal Information Retrieval

Kavli Affiliate: Yi Zhou | First 5 Authors: Zirui Li, Siwei Wu, Xingyu Wang, Yi Zhou, Yizhi Li | Summary: The rapid advancement of unsupervised representation learning and large-scale pre-trained vision-language models has significantly improved cross-modal retrieval tasks. However, existing multi-modal information retrieval (MMIR) studies lack a comprehensive exploration of document-level retrieval and suffer from […]


Continue.. DocMMIR: A Framework for Document Multi-modal Information Retrieval