Counting Ability of Large Language Models and Impact of Tokenization

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Juntai Cao, Chenyu You, , | Summary: Transformers, the backbone of modern large language models (LLMs), face inherent architectural limitations that impede their reasoning capabilities. Unlike recurrent networks, Transformers lack recurrent connections, confining them to constant-depth computation. This restriction places them in the complexity class […]


Continue.. Counting Ability of Large Language Models and Impact of Tokenization

Modeling the Superlattice Phase Diagram of Transition Metal Intercalation in Bilayer 2H-TaS$_2$

Kavli Affiliate: David T. Limmer | First 5 Authors: Isaac M. Craig, B. Junsuh Kim, David T. Limmer, D. Kwabena Bediako, Sinéad M. Griffin | Summary: Van der Waals hosts intercalated with transition metal (TM) ions exhibit a range of magnetic properties strongly influenced by the structural order of the intercalants. However, predictive computational models […]


Continue.. Modeling the Superlattice Phase Diagram of Transition Metal Intercalation in Bilayer 2H-TaS$_2$

Semi-supervised Chinese Poem-to-Painting Generation via Cycle-consistent Adversarial Networks

Kavli Affiliate: Feng Wang | First 5 Authors: Zhengyang Lu, Tianhao Guo, Feng Wang, , | Summary: Classical Chinese poetry and painting represent the epitome of artistic expression, but the abstract and symbolic nature of their relationship poses a significant challenge for computational translation. Most existing methods rely on large-scale paired datasets, which are scarce […]


Continue.. Semi-supervised Chinese Poem-to-Painting Generation via Cycle-consistent Adversarial Networks

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

Kavli Affiliate: Xiang Zhang | First 5 Authors: Tianchun Wang, Yuanzhou Chen, Zichuan Liu, Zhanwen Chen, Haifeng Chen | Summary: The advent of large language models (LLMs) has revolutionized the field of text generation, producing outputs that closely mimic human-like writing. Although academic and industrial institutions have developed detectors to prevent the malicious usage of […]


Continue.. Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

Foundation Models in Electrocardiogram: A Review

Kavli Affiliate: Xiang Zhang | First 5 Authors: Yu Han, Xiaofeng Liu, Xiang Zhang, Cheng Ding, | Summary: The electrocardiogram (ECG) is ubiquitous across various healthcare domains, such as cardiac arrhythmia detection and sleep monitoring, making ECG analysis critically essential. Traditional deep learning models for ECG are task-specific, with a narrow scope of functionality and […]


Continue.. Foundation Models in Electrocardiogram: A Review

LEO-based Positioning: Foundations, Signal Design, and Receiver Enhancements for 6G NTN

Kavli Affiliate: Feng Wang | First 5 Authors: Harish K. Dureppagari, Chiranjib Saha, Harikumar Krishnamurthy, Xiao Feng Wang, Alberto Rico-Alvariño | Summary: The integration of non-terrestrial networks (NTN) into 5G new radio (NR) has opened up the possibility of developing a new positioning infrastructure using NR signals from Low-Earth Orbit (LEO) satellites. LEO-based cellular positioning […]


Continue.. LEO-based Positioning: Foundations, Signal Design, and Receiver Enhancements for 6G NTN

Miniature magneto-oscillatory wireless sensor for magnetic field and gradient measurements

Kavli Affiliate: Felix Fischer | First 5 Authors: Felix Fischer, Moonkwang Jeong, Tian Qiu, , | Summary: Magneto-oscillatory devices have been recently developed as very potent wireless miniature position trackers and sensors with an exceptional accuracy and sensing distance for surgical and robotic applications. However, it is still unclear to which extend a mechanically resonating […]


Continue.. Miniature magneto-oscillatory wireless sensor for magnetic field and gradient measurements

Magneto-oscillatory localization for small-scale robots

Kavli Affiliate: Felix Fischer | First 5 Authors: Felix Fischer, Christian Gletter, Moonkwang Jeong, Tian Qiu, | Summary: Magnetism is widely used for the wireless localization and actuation of robots and devices for medical procedures. However, current static magnetic localization methods suffer from large required magnets and are limited to only five degrees of freedom […]


Continue.. Magneto-oscillatory localization for small-scale robots

Supervised Chain of Thought

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Dujian Ding, , , | Summary: Large Language Models (LLMs) have revolutionized natural language processing and hold immense potential for advancing Artificial Intelligence. However, the core architecture of most mainstream LLMs — the Transformer — has inherent limitations in computational depth, rendering them theoretically incapable […]


Continue.. Supervised Chain of Thought

Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Kai Wan, Hua Sun, Shiqiang Wang, Mingyue Ji | Summary: Secure aggregation is concerned with the task of securely uploading the inputs of multiple users to an aggregation server without letting the server know the inputs beyond their summation. It finds broad applications in distributed […]


Continue.. Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion