Berkeley – Kavli Institute Pre-Print Publications

YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework

Posted by dbos June 17, 2025Berkeley

Kavli Affiliate: Ting Xu | First 5 Authors: Dahang Wan, Rongsheng Lu, Yang Fang, Xianli Lang, Shuangbao Shu | Summary: Multispectral object detection, which integrates information from multiple bands, can enhance detection accuracy and environmental adaptability, holding great application potential across various fields. Although existing methods have made progress in cross-modal interaction, low-light conditions, and […]

Continue..

Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide Sequencing

Posted by dbos June 16, 2025June 23, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Jiaqi Wei, Zijie Qiu, Sheng Xu, Nanqing Dong | Summary: Peptide sequencing-the process of identifying amino acid sequences from mass spectrometry data-is a fundamental task in proteomics. Non-Autoregressive Transformers (NATs) have proven highly effective for this task, outperforming traditional methods. Unlike autoregressive models, which generate […]

Continue..

Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models

Posted by dbos June 14, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Zongyu Wu, Minhua Lin, Zhiwei Zhang, Fali Wang, Xianren Zhang | Summary: Large vision-language models (LVLMs) have demonstrated outstanding performance in many downstream tasks. However, LVLMs are trained on large-scale datasets, which can pose privacy risks if training images contain sensitive information. Therefore, it is important to […]

Continue..

The Throughput Gain of Hypercycle-level Resource Reservation for Time-Triggered Ethernet

Posted by dbos June 13, 2025June 23, 2025Berkeley

Kavli Affiliate: Feng Wang | First 5 Authors: Peng Wang, Suman Sourav, Binbin Chen, Hongyan Li, Feng Wang | Summary: Time-Triggered Communication is a key technology for many safety-critical systems, with applications spanning the areas of aerospace and industrial control. Such communication relies on time-triggered flows, with each flow consisting of periodic packets originating from […]

Continue..

Electron-magnon coupling at the interface of a “twin-twisted” antiferromagnet

Posted by dbos June 11, 2025June 23, 2025Berkeley

Kavli Affiliate: Jeffrey B. Neaton | First 5 Authors: Yue Sun, Fanhao Meng, Sijia Ke, Kun Xu, Hongrui Zhang | Summary: We identify a "twin-twist" angle in orthorhombic two-dimensional magnets that maximizes interlayer orbital overlap and enables strong interfacial coupling. Focusing on the van der Waals antiferromagnet CrSBr, we show that this twist angle, near […]

Continue..

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

Posted by dbos June 11, 2025June 23, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Yu Pan, Yuguang Yang, Yanni Hu, Jianhao Ye, Xiang Zhang | Summary: Multilingual speech-to-speech translation (S2ST) aims to directly convert spoken utterances from multiple source languages into fluent and intelligible speech in a target language. Despite recent progress, several critical challenges persist: 1) achieving high-quality and low-latency […]

Continue..

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation

Posted by dbos June 11, 2025Berkeley

Continue..

Enhancing quantum noise characterization via extra energy levels

Posted by dbos June 10, 2025June 23, 2025Berkeley

Kavli Affiliate: Irfan Siddiqi | First 5 Authors: Senrui Chen, Akel Hashim, Noah Goss, Alireza Seif, Irfan Siddiqi | Summary: Noise is a major challenge for building practical quantum computing systems. Precise characterization of quantum noise is crucial for developing effective error mitigation and correction schemes. However, state preparation and measurement (SPAM) errors on many […]

Continue..

MagCache: Fast Video Generation with Magnitude-Aware Cache

Posted by dbos June 10, 2025June 23, 2025Berkeley

Kavli Affiliate: Feng Wang | First 5 Authors: Zehong Ma, Longhui Wei, Feng Wang, Shiliang Zhang, Qi Tian | Summary: Existing acceleration techniques for video diffusion models often rely on uniform heuristics or time-embedding variants to skip timesteps and reuse cached features. These approaches typically require extensive calibration with curated prompts and risk inconsistent outputs […]

Continue..

Coherent phonon motions and ordered vacancy compound mediated quantum path interference in Cu-poor CuIn$_{x}$Ga$_{(1-x)}$Se$_2$ (CIGS) with attosecond transient absorption

Posted by dbos June 5, 2025June 17, 2025Berkeley

Kavli Affiliate: Peidong Yang | First 5 Authors: Hugo Laurell, Jonah R. Adelman, Elizaveta Yakovleva, Carl Hägglund, Kostiantyn Sopiha | Summary: In this study, coherent phonon motion is observed in bandgap excited CuIn$_{x}$Ga$_{(1-x)}$Se$_2$ (CIGS) utilizing extreme ultraviolet (XUV) attosecond transient absorption spectroscopy across the Se M$_{4,5}$ absorption edge. Two frequencies of coherent phonon motion are […]

Continue..