GneissWeb: Preparing High Quality Data for LLMs at Scale

Kavli Affiliate: Yi Zhou | First 5 Authors: Hajar Emami Gohari, Hajar Emami Gohari, , , | Summary: Data quantity and quality play a vital role in determining the performance of Large Language Models (LLMs). High-quality data, in particular, can significantly boost the LLM’s ability to generalize on a wide range of downstream tasks. Large […]


Continue.. GneissWeb: Preparing High Quality Data for LLMs at Scale

BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

Kavli Affiliate: Yi Zhou | First 5 Authors: Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, Jan Philip Wahle, Terry Ruas | Summary: People worldwide use language in subtle and complex ways to express emotions. While emotion recognition — an umbrella term for several NLP tasks — significantly impacts different applications in NLP and other fields, […]


Continue.. BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

Kavli Affiliate: Yi Zhou | First 5 Authors: Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, Jan Philip Wahle, Terry Ruas | Summary: People worldwide use language in subtle and complex ways to express emotions. While emotion recognition — an umbrella term for several NLP tasks — significantly impacts different applications in NLP and other fields, […]


Continue.. BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

Kavli Affiliate: Yi Zhou | First 5 Authors: Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, Jan Philip Wahle, Terry Ruas | Summary: People worldwide use language in subtle and complex ways to express emotions. Although emotion recognition–an umbrella term for several NLP tasks–impacts various applications within NLP and beyond, most work in this area has […]


Continue.. BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages

Exploring Large Language Models in Healthcare: Insights into Corpora Sources, Customization Strategies, and Evaluation Metrics

Kavli Affiliate: Zheng Zhu | First 5 Authors: Shuqi Yang, Mingrui Jing, Shuai Wang, Jiaxin Kou, Manfei Shi | Summary: This study reviewed the use of Large Language Models (LLMs) in healthcare, focusing on their training corpora, customization techniques, and evaluation metrics. A systematic search of studies from 2021 to 2024 identified 61 articles. Four […]


Continue.. Exploring Large Language Models in Healthcare: Insights into Corpora Sources, Customization Strategies, and Evaluation Metrics

Active Solids: Defect Self-Propulsion Without Flow

Kavli Affiliate: Mark J. Bowick | First 5 Authors: Fridtjof Brauns, Myles O’Leary, Arthur Hernandez, Mark J. Bowick, M. Cristina Marchetti | Summary: The self-propulsion of +1/2 topological defects is a hallmark of active nematic fluids, where the defects are advected by the flow field they themselves generate. In this paper we propose a minimal […]


Continue.. Active Solids: Defect Self-Propulsion Without Flow

Active Solids: Topological Defect Self-Propulsion Without Flow

Kavli Affiliate: Mark J. Bowick | First 5 Authors: Fridtjof Brauns, Fridtjof Brauns, , , | Summary: The self-propulsion of +1/2 topological defects is a hallmark of active nematic fluids, where the defects are advected by the flow field they themselves generate. In this paper we propose a minimal model for defect self-propulsion in a […]


Continue.. Active Solids: Topological Defect Self-Propulsion Without Flow

Enabling High-Bandwidth Coherent Modulation Through Scalable Lithium Niobate Resonant Devices

Kavli Affiliate: John E. Bowers | First 5 Authors: Sadra Rahimi Kari, Paolo Pintus, John E. Bowers, Matt Robbins, Nathan Youngblood | Summary: We present a compact, resonant-based coherent modulator on a thin-film lithium niobate (TFLN) platform, addressing the growing demand for high-speed, energy-efficient modulators in modern telecommunications. The design incorporates Mach-Zehnder Interferometers (MZIs) with […]


Continue.. Enabling High-Bandwidth Coherent Modulation Through Scalable Lithium Niobate Resonant Devices

Dissecting supergraviton six-point function with lightcone limits and chiral algebra

Kavli Affiliate: Xinan Zhou | First 5 Authors: Vasco Goncalves, Maria Nocchi, Xinan Zhou, , | Summary: We develop a bootstrap strategy to obtain the six-point function of supergravitons in $AdS_5times S^5$ from symmetry constraints and consistency conditions. Compared to previous bootstrap algorithms, a novel feature is the use of lightcone OPEs together with the […]


Continue.. Dissecting supergraviton six-point function with lightcone limits and chiral algebra

SessionRec: Next Session Prediction Paradigm For Generative Sequential Recommendation

Kavli Affiliate: Long Zhang | First 5 Authors: Lei Huang, Hao Guo, Linzhi Peng, Long Zhang, Xiaoteng Wang | Summary: We introduce SessionRec, a novel next-session prediction paradigm (NSPP) for generative sequential recommendation, addressing the fundamental misalignment between conventional next-item prediction paradigm (NIPP) and real-world recommendation scenarios. Unlike NIPP’s item-level autoregressive generation that contradicts actual […]


Continue.. SessionRec: Next Session Prediction Paradigm For Generative Sequential Recommendation