Berkeley – Page 3 – Kavli Institute Pre-Print Publications

EquiContact: A Hierarchical SE(3) Vision-to-Force Equivariant Policy for Spatially Generalizable Contact-rich Tasks

Posted by klaurent July 15, 2025July 28, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Joohwan Seo, Joohwan Seo, , , | Summary: This paper presents a framework for learning vision-based robotic policies for contact-rich manipulation tasks that generalize spatially across task configurations. We focus on achieving robust spatial generalization of the policy for the peg-in-hole (PiH) task trained from a small […]

Continue..

Multi-modal Mutual-Guidance Conditional Prompt Learning for Vision-Language Models

Posted by klaurent July 11, 2025July 21, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Shijun Yang, Shijun Yang, , , | Summary: Prompt learning facilitates the efficient adaptation of Vision-Language Models (VLMs) to various downstream tasks. However, it faces two significant challenges: (1) inadequate modeling of class embedding distributions for unseen instances, leading to suboptimal generalization on novel classes; (2) prevailing […]

Continue..

Gap reopening as signature of coupling between Majorana zero modes in Sn-(Bi,Sb)2(Te,S)3-based Josephson trijunctions

Posted by klaurent July 9, 2025July 21, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Duolin Wang, Duolin Wang, , , | Summary: In the past two decades, enormous efforts have been made to search for possible platforms and schemes to implement topological quantum computation (TQC). In exploring the Fu-Kane scheme of TQC based on Josephson trijunctions constructed on topological insulators, the […]

Continue..

BlueLM-2.5-3B Technical Report

Posted by klaurent July 8, 2025July 21, 2025Berkeley

Kavli Affiliate: Feng Wang | First 5 Authors: Baojiao Xiong, Baojiao Xiong, , , | Summary: We present BlueLM-2.5-3B, a compact and unified dense Multimodal Large Language Model (MLLM) designed for efficient edge-device deployment, offering strong general-purpose and reasoning capabilities. To the best of our knowledge, this is the first 3B-scale MLLM to support both […]

Continue..

Noise-Canceling Quantum Feedback: non-Hermitian Dynamics with Applications to State Preparation and Magic State Distillation

Posted by klaurent July 8, 2025Berkeley

Kavli Affiliate: Birgitta Whaley | First 5 Authors: Tathagata Karmakar, Tathagata Karmakar, , , | Summary: Time-continuous quantum measurement allows for the tracking of a quantum system in real time via sequences of short, and individually weak, measurement intervals. Such measurements are necessarily invasive, imparting backaction to the system, and allowing the observer to update […]

Continue..

MP-ALOE: An r2SCAN dataset for universal machine learning interatomic potentials

Posted by klaurent July 8, 2025July 21, 2025Berkeley

Kavli Affiliate: Kristin A. Persson | First 5 Authors: Matthew C. Kuner, Matthew C. Kuner, , , | Summary: We present MP-ALOE, a dataset of nearly 1 million DFT calculations using the accurate r2SCAN meta-generalized gradient approximation. Covering 89 elements, MP-ALOE was created using active learning and primarily consists of off-equilibrium structures. We benchmark a […]

Continue..

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Posted by klaurent July 7, 2025Berkeley

Kavli Affiliate: Felix Fischer | First 5 Authors: Gheorghe Comanici, Gheorghe Comanici, , , | Summary: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA […]

Continue..

Clinical NLP with Attention-Based Deep Learning for Multi-Disease Prediction

Posted by klaurent July 2, 2025Berkeley

Kavli Affiliate: Ting Xu | First 5 Authors: Ting Xu, Ting Xu, , , | Summary: This paper addresses the challenges posed by the unstructured nature and high-dimensional semantic complexity of electronic health record texts. A deep learning method based on attention mechanisms is proposed to achieve unified modeling for information extraction and multi-label disease […]

Continue..

High-quality metalens enables minimally invasive CFB endoscopy

Posted by klaurent June 26, 2025Berkeley

Kavli Affiliate: Feng Wang | First 5 Authors: Ruixiang Song, Ruixiang Song, , , | Summary: Metalenses, owing to their ultra-thin planar structures, present a promising solution for reducing endoscopic invasiveness. However, achieving high-quality imaging with minimal invasiveness (short focal length of metalens) remains a critical challenge. This paper presents a deep learning assisted metalens […]

Continue..

Modulating task outcome value to mitigate real-world procrastination via noninvasive brain stimulation

Posted by klaurent June 26, 2025Berkeley

Kavli Affiliate: Ting Xu | First 5 Authors: Zhiyi Chen, Zhiyi Chen, , , | Summary: Procrastination represents one of the most prevalent behavioral problems affecting individual health and societal productivity. Although it is often conceptualized as a form of self-control failure, its underlying neurocognitive mechanisms are poorly understood. A leading model posits that procrastination […]

Continue..