Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Kavli Affiliate: Dan Luo | First 5 Authors: Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu | Summary: Zero-shot text-to-speech (TTS) synthesis aims to clone any unseen speaker’s voice without adaptation parameters. By quantizing speech waveform into discrete acoustic tokens and modeling these tokens with the language model, recent language model-based TTS models […]


Continue.. Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Improving HEVC Encoding of Rendered Video Data Using True Motion Information

Kavli Affiliate: David Muller | First 5 Authors: Christian Herglotz, David Müller, Andreas Weinlich, Frank Bauer, Michael Ortner | Summary: This paper shows that motion vectors representing the true motion of an object in a scene can be exploited to improve the encoding process of computer generated video sequences. Therefore, a set of sequences is […]


Continue.. Improving HEVC Encoding of Rendered Video Data Using True Motion Information

In operando cryo-STEM of pulse-induced charge density wave switching in TaS$_2$

Kavli Affiliate: Lena F. Kourkoutis | First 5 Authors: James L Hart, Saif Siddique, Noah Schnitzer, Stephen D. Funni, Lena F. Kourkoutis | Summary: The charge density wave (CDW) material 1T-TaS$_2$ exhibits a pulse-induced insulator-to-metal transition, which shows promise for next-generation electronics such as memristive memory and neuromorphic hardware. However, the rational design of TaS$_2$ […]


Continue.. In operando cryo-STEM of pulse-induced charge density wave switching in TaS$_2$

Evaluating the Efficacy of Supervised Learning vs Large Language Models for Identifying Cognitive Distortions and Suicidal Risks in Chinese Social Media

Kavli Affiliate: Dan Luo | First 5 Authors: Hongzhi Qi, Qing Zhao, Changwei Song, Wei Zhai, Dan Luo | Summary: Large language models, particularly those akin to the rapidly progressing GPT series, are gaining traction for their expansive influence. While there is keen interest in their applicability within medical domains such as psychology, tangible explorations […]


Continue.. Evaluating the Efficacy of Supervised Learning vs Large Language Models for Identifying Cognitive Distortions and Suicidal Risks in Chinese Social Media

Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media

Kavli Affiliate: Dan Luo | First 5 Authors: Hongzhi Qi, Qing Zhao, Changwei Song, Wei Zhai, Dan Luo | Summary: In the realm of social media, users frequently convey personal sentiments, with some potentially indicating cognitive distortions or suicidal tendencies. Timely recognition of such signs is pivotal for effective interventions. In response, we introduce two […]


Continue.. Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media

Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media

Kavli Affiliate: Dan Luo | First 5 Authors: Hongzhi Qi, Qing Zhao, Jianqiang Li, Changwei Song, Wei Zhai | Summary: On social media, users often express their personal feelings, which may exhibit cognitive distortions or even suicidal tendencies on certain specific topics. Early recognition of these signs is critical for effective psychological intervention. In this […]


Continue.. Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media

Phonon-Dressed Third-Harmonic Generation in Diamond

Kavli Affiliate: Jeffrey Moses | First 5 Authors: Jiaoyang Zheng, Guru Khalsa, Jeffrey Moses, , | Summary: We demonstrate a strong, elastic, frequency-upshifting optical polarizability that accompanies sum-frequency driving of Raman phonons and two-photon absorption, extending the duality of structural and optical property modification recently observed in light-driven phononics. The effect, phonon-dressed third-harmonic generation (THG), […]


Continue.. Phonon-Dressed Third-Harmonic Generation in Diamond

Phonon-Mediated Third-Harmonic Generation in Diamond

Kavli Affiliate: Jeffrey Moses | First 5 Authors: Jiaoyang Zheng, Guru Khalsa, Jeffrey Moses, , | Summary: We observe strongly anisotropic third-harmonic generation mediated by resonant sum-frequency driving of Raman phonons with THz light, extending light-induced dual control of structural and optical properties in solids. Either strong enhancement or strong suppression of the third harmonic […]


Continue.. Phonon-Mediated Third-Harmonic Generation in Diamond

Giant elastocaloric effect at low temperatures in TmVO$_4$ and implications for cryogenic cooling

Kavli Affiliate: Brad J. Ramshaw | First 5 Authors: Mark P. Zic, Matthias S. Ikeda, Pierre Massat, Patrick M. Hollister, Linda Ye | Summary: Adiabatic decompression of para-quadrupolar materials has significant potential as a cryogenic cooling technology. We focus on TmVO$_4$, an archetypal material that undergoes a continuous phase transition to a ferroquadrupole-ordered state at […]


Continue.. Giant elastocaloric effect at low temperatures in TmVO$_4$ and implications for cryogenic cooling

Enhancing Psychological Counseling with Large Language Model: A Multifaceted Decision-Support System for Non-Professionals

Kavli Affiliate: Dan Luo | First 5 Authors: Guanghui Fu, Qing Zhao, Jianqiang Li, Dan Luo, Changwei Song | Summary: In the contemporary landscape of social media, an alarming number of users express negative emotions, some of which manifest as strong suicidal intentions. This situation underscores a profound need for trained psychological counselors who can […]


Continue.. Enhancing Psychological Counseling with Large Language Model: A Multifaceted Decision-Support System for Non-Professionals