Context-Aware Pseudo-Label Scoring for Zero-Shot Video Summarization

Kavli Affiliate: Long Zhang | First 5 Authors: Yuanli Wu, Yuanli Wu, , , | Summary: We propose a rubric-guided, pseudo-labeled, and prompt-driven zero-shot video summarization framework that bridges large language models with structured semantic reasoning. A small subset of human annotations is converted into high-confidence pseudo labels and organized into dataset-adaptive rubrics defining clear […]


Continue.. Context-Aware Pseudo-Label Scoring for Zero-Shot Video Summarization

Aria Gen 2 Pilot Dataset

Kavli Affiliate: Cheng Peng | First 5 Authors: Chen Kong, Chen Kong, , , | Summary: The Aria Gen 2 Pilot Dataset (A2PD) is an egocentric multimodal open dataset captured using the state-of-the-art Aria Gen 2 glasses. To facilitate timely access, A2PD is released incrementally with ongoing dataset enhancements. The initial release features Dia’ane, our […]


Continue.. Aria Gen 2 Pilot Dataset

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

Kavli Affiliate: Zheng Zhu | First 5 Authors: Weijie Wang, Weijie Wang, , , | Summary: We present DriveGen3D, a novel framework for generating high-quality and highly controllable dynamic 3D driving scenes that addresses critical limitations in existing methodologies. Current approaches to driving scene synthesis either suffer from prohibitive computational demands for extended temporal generation, […]


Continue.. DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

WithAnyone: Towards Controllable and ID Consistent Image Generation

Kavli Affiliate: Cheng Peng | First 5 Authors: Hengyuan Xu, Hengyuan Xu, , , | Summary: Identity-consistent generation has become an important focus in text-to-image research, with recent models achieving notable success in producing images aligned with a reference identity. Yet, the scarcity of large-scale paired datasets containing multiple images of the same individual forces […]


Continue.. WithAnyone: Towards Controllable and ID Consistent Image Generation

NTIRE 2025 Challenge on Low Light Image Enhancement: Methods and Results

Kavli Affiliate: Biao Huang | Summary:This paper presents a comprehensive review of the NTIRE 2025 Low-Light Image Enhancement (LLIE) Challenge, highlighting the proposed solutions and final outcomes. The objective of the challenge is to identify effective networks capable of producing brighter, clearer, and visually compelling images under diverse and challenging conditions. A remarkable total of […]


Continue.. NTIRE 2025 Challenge on Low Light Image Enhancement: Methods and Results

E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization

Kavli Affiliate: Yi Zhou | First 5 Authors: Wenpu Li, Wenpu Li, , , | Summary: The estimation of optical flow and 6-DoF ego-motion, two fundamental tasks in 3D vision, has typically been addressed independently. For neuromorphic vision (e.g., event cameras), however, the lack of robust data association makes solving the two problems separately an […]


Continue.. E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization

E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization

Kavli Affiliate: Yi Zhou | First 5 Authors: Wenpu Li, Wenpu Li, , , | Summary: The estimation of optical flow and 6-DoF ego-motion, two fundamental tasks in 3D vision, has typically been addressed independently. For neuromorphic vision (e.g., event cameras), however, the lack of robust data association makes solving the two problems separately an […]


Continue.. E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization

Spinons, solitons and random singlets in the spin-chain compound copper benzoate

Kavli Affiliate: Long Zhang | First 5 Authors: Ying Chen, Ying Chen, , , | Summary: The $S=1/2$ antiferromagnetic Heisenberg chain is a paradigmatic quantum system hosting exotic excitations such as spinons and solitons, and forming random singlet state in the presence of quenched disorder. Realizing and distinguishing these excitations in a single material remains […]


Continue.. Spinons, solitons and random singlets in the spin-chain compound copper benzoate

R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation

Kavli Affiliate: Zheng Zhu | First 5 Authors: Xiuwei Xu, Xiuwei Xu, , , | Summary: Towards the aim of generalized robotic manipulation, spatial generalization is the most fundamental capability that requires the policy to work robustly under different spatial distribution of objects, environment and agent itself. To achieve this, substantial human demonstrations need to […]


Continue.. R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation

Dimension- and Facet-Dependent Altermagnetic Triferroics and Biferroics in CrSb

Kavli Affiliate: Long Zhang | First 5 Authors: Long Zhang, Long Zhang, , , | Summary: Altermagnets have recently garnered significant interest due to their vanishing net magnetic moment and non-relativistic momentum-dependent spin splitting. However, altermagnetic (AM) multiferroics especially triferroics remain scarce. We investigate the experimentally synthesized non-van der Waals CrSb as a model system […]


Continue.. Dimension- and Facet-Dependent Altermagnetic Triferroics and Biferroics in CrSb