YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework

Kavli Affiliate: Ting Xu | First 5 Authors: Dahang Wan, Rongsheng Lu, Yang Fang, Xianli Lang, Shuangbao Shu | Summary: Multispectral object detection, which integrates information from multiple bands, can enhance detection accuracy and environmental adaptability, holding great application potential across various fields. Although existing methods have made progress in cross-modal interaction, low-light conditions, and […]


Continue.. YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework

Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Jiaqi Wei, Zijie Qiu, Sheng Xu, Nanqing Dong | Summary: Peptide sequencing-the process of identifying amino acid sequences from mass spectrometry data-is a fundamental task in proteomics. Non-Autoregressive Transformers (NATs) have proven highly effective for this task, outperforming traditional methods. Unlike autoregressive models, which generate […]


Continue.. Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing

Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models

Kavli Affiliate: Xiang Zhang | First 5 Authors: Zongyu Wu, Minhua Lin, Zhiwei Zhang, Fali Wang, Xianren Zhang | Summary: Large vision-language models (LVLMs) have demonstrated outstanding performance in many downstream tasks. However, LVLMs are trained on large-scale datasets, which can pose privacy risks if training images contain sensitive information. Therefore, it is important to […]


Continue.. Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models

The Throughput Gain of Hypercycle-level Resource Reservation for Time-Triggered Ethernet

Kavli Affiliate: Feng Wang | First 5 Authors: Peng Wang, Suman Sourav, Binbin Chen, Hongyan Li, Feng Wang | Summary: Time-Triggered Communication is a key technology for many safety-critical systems, with applications spanning the areas of aerospace and industrial control. Such communication relies on time-triggered flows, with each flow consisting of periodic packets originating from […]


Continue.. The Throughput Gain of Hypercycle-level Resource Reservation for Time-Triggered Ethernet

Electron-magnon coupling at the interface of a “twin-twisted” antiferromagnet

Kavli Affiliate: Jeffrey B. Neaton | First 5 Authors: Yue Sun, Fanhao Meng, Sijia Ke, Kun Xu, Hongrui Zhang | Summary: We identify a "twin-twist" angle in orthorhombic two-dimensional magnets that maximizes interlayer orbital overlap and enables strong interfacial coupling. Focusing on the van der Waals antiferromagnet CrSBr, we show that this twist angle, near […]


Continue.. Electron-magnon coupling at the interface of a “twin-twisted” antiferromagnet

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

Kavli Affiliate: Xiang Zhang | First 5 Authors: Yu Pan, Yuguang Yang, Yanni Hu, Jianhao Ye, Xiang Zhang | Summary: Multilingual speech-to-speech translation (S2ST) aims to directly convert spoken utterances from multiple source languages into fluent and intelligible speech in a target language. Despite recent progress, several critical challenges persist: 1) achieving high-quality and low-latency […]


Continue.. S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation

Kavli Affiliate: Xiang Zhang | First 5 Authors: Yu Pan, Yuguang Yang, Yanni Hu, Jianhao Ye, Xiang Zhang | Summary: Multilingual speech-to-speech translation (S2ST) aims to directly convert spoken utterances from multiple source languages into fluent and intelligible speech in a target language. Despite recent progress, several critical challenges persist: 1) achieving high-quality S2ST remains […]


Continue.. S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation

Enhancing quantum noise characterization via extra energy levels

Kavli Affiliate: Irfan Siddiqi | First 5 Authors: Senrui Chen, Akel Hashim, Noah Goss, Alireza Seif, Irfan Siddiqi | Summary: Noise is a major challenge for building practical quantum computing systems. Precise characterization of quantum noise is crucial for developing effective error mitigation and correction schemes. However, state preparation and measurement (SPAM) errors on many […]


Continue.. Enhancing quantum noise characterization via extra energy levels

MagCache: Fast Video Generation with Magnitude-Aware Cache

Kavli Affiliate: Feng Wang | First 5 Authors: Zehong Ma, Longhui Wei, Feng Wang, Shiliang Zhang, Qi Tian | Summary: Existing acceleration techniques for video diffusion models often rely on uniform heuristics or time-embedding variants to skip timesteps and reuse cached features. These approaches typically require extensive calibration with curated prompts and risk inconsistent outputs […]


Continue.. MagCache: Fast Video Generation with Magnitude-Aware Cache

Coherent phonon motions and ordered vacancy compound mediated quantum path interference in Cu-poor CuIn$_{x}$Ga$_{(1-x)}$Se$_2$ (CIGS) with attosecond transient absorption

Kavli Affiliate: Peidong Yang | First 5 Authors: Hugo Laurell, Jonah R. Adelman, Elizaveta Yakovleva, Carl Hägglund, Kostiantyn Sopiha | Summary: In this study, coherent phonon motion is observed in bandgap excited CuIn$_{x}$Ga$_{(1-x)}$Se$_2$ (CIGS) utilizing extreme ultraviolet (XUV) attosecond transient absorption spectroscopy across the Se M$_{4,5}$ absorption edge. Two frequencies of coherent phonon motion are […]


Continue.. Coherent phonon motions and ordered vacancy compound mediated quantum path interference in Cu-poor CuIn$_{x}$Ga$_{(1-x)}$Se$_2$ (CIGS) with attosecond transient absorption