Measurement of the branching fractions and longitudinal polarisations of $B^0_(s) to K^*0 kern 0.18em overlinekern -0.18em K^*0$ decays

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:A time- and flavour-integrated amplitude analysis of $B^0$ and $B^0_s$ decays to the $(K^+π^-)(K^-π^+)$ final state in the $K^*(892)^0 kern 0.18em overlinekern -0.18em K^*(892)^0$ region is presented, using $pp$ collision data recorded with the LHCb detector in 2011–2018, corresponding to an integrated luminosity […]


Continue.. Measurement of the branching fractions and longitudinal polarisations of $B^0_(s) to K^*0 kern 0.18em overlinekern -0.18em K^*0$ decays

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:Vision Language Models (VLMs) demonstrate strong qualitative visual understanding, but struggle with metrically precise spatial reasoning required for embodied applications. The agentic paradigm promises that VLMs can use a wide variety of tools that could augment these capabilities, such as depth estimators, segmentation […]


Continue.. SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

Learning Steerable Clarification Policies with Collaborative Self-play

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:To handle underspecified or ambiguous queries, AI assistants need a policy for managing their uncertainty to determine (a) when to guess the user intent and answer directly, (b) when to enumerate and answer multiple possible intents, and (c) when to ask a clarifying […]


Continue.. Learning Steerable Clarification Policies with Collaborative Self-play

The Mass-Metallicity Relation and its Observational Effects at z~3-6

Kavli Affiliate: David W. Miller| First 5 Authors: Zach Lewis, Zach Lewis, , , | Summary:The correlation between galaxy stellar mass and gas-phase metallicity, known as the mass-metallicity relation (MZR), gives key insights into the processes that govern galaxy evolution. However, unquantified observational and selection biases can result in systematic errors in attempts to recover […]


Continue.. The Mass-Metallicity Relation and its Observational Effects at z~3-6

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:Despite progress in video-to-audio generation, the field focuses predominantly on mono output, lacking spatial immersion. Existing binaural approaches remain constrained by a two-stage pipeline that first generates mono audio and then performs spatialization, often resulting in error accumulation and spatio-temporal inconsistencies. To address […]


Continue.. ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:Understanding the dynamic physical world, characterized by its evolving 3D structure, real-world motion, and semantic content with textual descriptions, is crucial for human-agent interaction and enables embodied agents to perceive and act within real environments with human-like capabilities. However, existing datasets are often […]


Continue.. DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

$S^5$: Tidal Disruption in Crater 2 and Formation of Diffuse Dwarf Galaxies in the Local Group

Kavli Affiliate: Alexander Ji| First 5 Authors: Guilherme Limberg, Guilherme Limberg, , , | Summary:We present results of a spectroscopic campaign around the diffuse dwarf galaxy Crater 2 (Cra2) and its tidal tails as part of the Southern Stellar Stream Spectroscopic Survey ($S^5$). Cra2 is a Milky Way dwarf spheroidal satellite with extremely cold kinematics, […]


Continue.. $S^5$: Tidal Disruption in Crater 2 and Formation of Diffuse Dwarf Galaxies in the Local Group

Photon (Non)Conservation in the Reduced Speed of Light Approximation and How to (Almost) Fix It

Kavli Affiliate: Nickolay Gnedin| First 5 Authors: Nickolay Y. Gnedin, Nickolay Y. Gnedin, , , | Summary:The "Reduced Speed of Light" (RSL) approximation is commonly used to speed up radiative transfer calculations in cosmological simulations. However, it has been shown previously that the RSL approximation leads to photon non-conservation in some regimes. I show that […]


Continue.. Photon (Non)Conservation in the Reduced Speed of Light Approximation and How to (Almost) Fix It

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:Unified multimodal models (UMMs) aim to jointly perform multimodal understanding and generation within a single framework. We present TUNA, a native UMM that builds a unified continuous visual representation by cascading a VAE encoder with a representation encoder. This unified representation space allows […]


Continue.. TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Learning Visual Affordance from Audio

Kavli Affiliate: Hsiaowen Chen| First 5 Authors: [#item_custom_name[1, [#item_custom_name[2, [#item_custom_name[3, [#item_custom_name[4, [#item_custom_name[5| Summary:We introduce Audio-Visual Affordance Grounding (AV-AG), a new task that segments object interaction regions from action sounds. Unlike existing approaches that rely on textual instructions or demonstration videos, which often limited by ambiguity or occlusion, audio provides real-time, semantically rich, and visually independent […]


Continue.. Learning Visual Affordance from Audio