“Industrial-Scale” Black Hole Selection without a Satellite

Kavli Affiliate: Subo Dong | First 5 Authors: Zexuan Wu, Zexuan Wu, , , | Summary: The forthcoming GRAVITY+ instrument promises to usher in an era of "industrial-scale" mass measurements of isolated black holes (BHs), with the potential to assemble a sample of many tens of BHs via interferometric microlensing over several years. A key […]


Continue.. “Industrial-Scale” Black Hole Selection without a Satellite

PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting

Kavli Affiliate: Li Xin Li | First 5 Authors: Linqing Wang, Linqing Wang, , , | Summary: Recent advancements in text-to-image (T2I) diffusion models have demonstrated remarkable capabilities in generating high-fidelity images. However, these models often struggle to faithfully render complex user prompts, particularly in aspects like attribute binding, negation, and compositional relationships. This leads […]


Continue.. PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting

PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting

Kavli Affiliate: Li Xin Li | First 5 Authors: Linqing Wang, Linqing Wang, , , | Summary: Recent advancements in text-to-image (T2I) diffusion models have demonstrated remarkable capabilities in generating high-fidelity images. However, these models often struggle to faithfully render complex user prompts, particularly in aspects like attribute binding, negation, and compositional relationships. This leads […]


Continue.. PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting

Integrated photonic ultrawideband real-time spectrum sensing for 6G wireless networks

Kavli Affiliate: Feng Yuan | First 5 Authors: Yuansheng Tao, Yuansheng Tao, , , | Summary: The sixth generation (6G) wireless networks require dynamic spectrum management to optimize the utilization of scarce spectral resources and support emerging integrated sensing and communication (ISAC) applications. This necessitates real-time spectrum sensing (RT-SS) capability with ultrawide measurement range, compact […]


Continue.. Integrated photonic ultrawideband real-time spectrum sensing for 6G wireless networks

Baryonic Ecosystem IN Galaxies (BEINGMgII) — III. Cool gas reservoirs at $0.3 le z le 1.6$ in the Dark Energy Survey

Kavli Affiliate: Luis C. Ho | First 5 Authors: Reena Chaudhary, Reena Chaudhary, , , | Summary: We investigate the origin of intervening cool MgII absorption detected in the spectra of background quasars and the nature of associated galaxies across a broad redshift range of $0.3 le z le 1.6$. Using nebular [O II] $lambdalambda$3727,3729 […]


Continue.. Baryonic Ecosystem IN Galaxies (BEINGMgII) — III. Cool gas reservoirs at $0.3 le z le 1.6$ in the Dark Energy Survey

AudioRWKV: Efficient and Stable Bidirectional RWKV for Audio Pattern Recognition

Kavli Affiliate: Jing Wang | First 5 Authors: Jiayu Xiong, Jiayu Xiong, , , | Summary: Recently, Transformers (e.g., Audio Spectrogram Transformers, AST) and state-space models (e.g., Audio Mamba, AuM) have achieved remarkable progress in audio modeling. However, the O(L^2) computational complexity of the Transformer architecture hinders efficient long-sequence processing, while the Mamba architecture tends […]


Continue.. AudioRWKV: Efficient and Stable Bidirectional RWKV for Audio Pattern Recognition

Kwai Keye-VL 1.5 Technical Report

Kavli Affiliate: Jing Wang | First 5 Authors: Biao Yang, Biao Yang, , , | Summary: In recent years, the development of Large Language Models (LLMs) has significantly advanced, extending their capabilities to multimodal tasks through Multimodal Large Language Models (MLLMs). However, video understanding remains a challenging area due to the dynamic and information-dense nature […]


Continue.. Kwai Keye-VL 1.5 Technical Report

Use ADAS Data to Predict Near-Miss Events: A Group-Based Zero-Inflated Poisson Approach

Kavli Affiliate: Li Xin Li | First 5 Authors: Xinbo Zhang, Xinbo Zhang, , , | Summary: Driving behavior big data leverages multi-sensor telematics to understand how people drive and powers applications such as risk evaluation, insurance pricing, and targeted intervention. Usage-based insurance (UBI) built on these data has become mainstream. Telematics-captured near-miss events (NMEs) […]


Continue.. Use ADAS Data to Predict Near-Miss Events: A Group-Based Zero-Inflated Poisson Approach

The Resurgence of GCG Adversarial Attacks on Large Language Models

Kavli Affiliate: Zhuo Li | First 5 Authors: Yuting Tan, Yuting Tan, , , | Summary: Gradient-based adversarial prompting, such as the Greedy Coordinate Gradient (GCG) algorithm, has emerged as a powerful method for jailbreaking large language models (LLMs). In this paper, we present a systematic appraisal of GCG and its annealing-augmented variant, T-GCG, across […]


Continue.. The Resurgence of GCG Adversarial Attacks on Large Language Models

Dino U-Net: Exploiting High-Fidelity Dense Features from Foundation Models for Medical Image Segmentation

Kavli Affiliate: Feng Yuan | First 5 Authors: Yifan Gao, Yifan Gao, , , | Summary: Foundation models pre-trained on large-scale natural image datasets offer a powerful paradigm for medical image segmentation. However, effectively transferring their learned representations for precise clinical applications remains a challenge. In this work, we propose Dino U-Net, a novel encoder-decoder […]


Continue.. Dino U-Net: Exploiting High-Fidelity Dense Features from Foundation Models for Medical Image Segmentation