SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading

Kavli Affiliate: Wei Gao | First 5 Authors: Qiaoling Chen, Shenggui Li, Wei Gao, Peng Sun, Yonggang Wen | Summary: In recent years, Large Language Models (LLMs) have exhibited remarkable capabilities, driving advancements in real-world applications. However, training LLMs on increasingly long input sequences imposes significant challenges due to high GPU memory and computational demands. […]


Continue.. SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading

Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Juntai Cao, Jiaqi Wei, Chenyu You, Dujian Ding | Summary: Despite the remarkable successes of Large Language Models (LLMs), their fundamental Transformer architecture possesses inherent theoretical limitations that restrict their capability to handle reasoning tasks with increasing computational complexity. Chain-of-Thought (CoT) prompting has emerged as […]


Continue.. Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective

Superconductivity in tin telluride films grown by molecular beam epitaxy

Kavli Affiliate: David A. Muller | First 5 Authors: Antonio Gonzalez, Samuel J. Poage, Bernardo Langa, Jr., Deepak Sapkota, Salva Salmani-Rezaie | Summary: The intersection of superconductivity and ferroelectricity hosts a wide range of exotic quantum phenomena. Here, we report on the observation of superconductivity in high-quality tin telluride films grown by molecular beam epitaxy. […]


Continue.. Superconductivity in tin telluride films grown by molecular beam epitaxy

The dearth of high-mass hydrogen-atmosphere metal-polluted white dwarfs within 40 pc

Kavli Affiliate: David Charbonneau | First 5 Authors: Tim Cunningham, Pier-Emmanuel Tremblay, Mairi O’Brien, Evan B. Bauer, Mark A. Hollands | Summary: We present a population synthesis model which addresses the different mass distributions of the metal-polluted and non-metal-polluted hydrogen-atmosphere white dwarfs identified in volume-limited samples. Specifically, metal-pollution has been observed to be rare in […]


Continue.. The dearth of high-mass hydrogen-atmosphere metal-polluted white dwarfs within 40 pc

The role of effective mass and long-range interactions in the band-gap renormalization of photo-excited semiconductors

Kavli Affiliate: Scott K. Cushing | First 5 Authors: Cian C. Reeves, Scott K. Cushing, Vojtech Vlcek, , | Summary: Understanding how to control changes in electronic structure and related dynamical renormalizations by external driving fields is the key for understanding ultrafast spectroscopy and applications in electronics. Here we focus on the band-gap’s modulation by […]


Continue.. The role of effective mass and long-range interactions in the band-gap renormalization of photo-excited semiconductors