Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers

Kavli Affiliate: Xiang Zhang | First 5 Authors: Divyansh Srivastava, Xiang Zhang, He Wen, Chenru Wen, Zhuowen Tu | Summary: We present Lay-Your-Scene (shorthand LayouSyn), a novel text-to-layout generation pipeline for natural scenes. Prior scene layout generation methods are either closed-vocabulary or use proprietary large language models for open-vocabulary generation, limiting their modeling capabilities and […]


Continue.. Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers

MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation

Kavli Affiliate: Feng Wang | First 5 Authors: Zilong Chen, Yikai Wang, Wenqiang Sun, Feng Wang, Yiwen Chen | Summary: In this paper, we introduce MeshGen, an advanced image-to-3D pipeline that generates high-quality 3D meshes with detailed geometry and physically based rendering (PBR) textures. Addressing the challenges faced by existing 3D native diffusion models, such […]


Continue.. MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation

DexCtrl: Towards Sim-to-Real Dexterity with Adaptive Controller Learning

Kavli Affiliate: Xiang Zhang | First 5 Authors: Shuqi Zhao, Ke Yang, Yuxin Chen, Chenran Li, Yichen Xie | Summary: Dexterous manipulation has seen remarkable progress in recent years, with policies capable of executing many complex and contact-rich tasks in simulation. However, transferring these policies from simulation to real world remains a significant challenge. One […]


Continue.. DexCtrl: Towards Sim-to-Real Dexterity with Adaptive Controller Learning

Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials

Kavli Affiliate: Kristin A. Persson | First 5 Authors: Peichen Zhong, Xinzhe Dai, Bowen Deng, Gerbrand Ceder, Kristin A. Persson | Summary: We present Crystal Host-Guided Generation (CHGGen), a diffusion-based framework for crystal structure prediction. Unconditional generation with diffusion models demonstrates limited efficacy in identifying symmetric crystals as the unit cell size increases. CHGGen addresses […]


Continue.. Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials

BAROC: Concealing Packet Losses in LSNs with Bimodal Behavior Awareness for Livecast Ingestion

Kavli Affiliate: Feng Wang | First 5 Authors: Haoyuan Zhao, Jianxin Shi, Guanzhen Wu, Hao Fang, Yi Ching Chou | Summary: The advent of Low-Earth Orbit satellite networks (LSNs), exemplified by initiatives like emph{Starlink}, emph{OneWeb} and emph{Kuiper}, has ushered in a new era of “Internet from Space" global connectivity. Recent studies have shown that LSNs […]


Continue.. BAROC: Concealing Packet Losses in LSNs with Bimodal Behavior Awareness for Livecast Ingestion

Accelerated discovery of cost-effective photoabsorber materials for near-infrared (λ=1600 nm) photodetector applications

Kavli Affiliate: Kristin A. Persson | First 5 Authors: Wayne Zhao, Ruo Xi Yang, Aaron D. Kaplan, Kristin A. Persson, | Summary: Current infrared sensing devices are based on costly materials with relatively few viable alternatives known. To identify promising candidate materials for infrared photodetection, we have developed a high-throughput screening methodology based on high-accuracy […]


Continue.. Accelerated discovery of cost-effective photoabsorber materials for near-infrared (λ=1600 nm) photodetector applications

An Error Mitigated Non-Orthogonal Quantum Eigensolver via Shadow Tomography

Kavli Affiliate: Birgitta Whaley | First 5 Authors: Hang Ren, Yipei Zhang, Wendy M. Billings, Rebecca Tomann, Nikolay V. Tkachenko | Summary: We present a shadow-tomography-enhanced Non-Orthogonal Quantum Eigensolver (NOQE) for more efficient and accurate electronic structure calculations on near-term quantum devices. By integrating shadow tomography into the NOQE, the measurement cost scales linearly rather […]


Continue.. An Error Mitigated Non-Orthogonal Quantum Eigensolver via Shadow Tomography

A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers

Kavli Affiliate: Ting Xu | First 5 Authors: Meng Wang, Tian Lin, Qingshan Hou, Aidi Lin, Jingcheng Wang | Summary: Artificial intelligence (AI) shows remarkable potential in medical imaging diagnostics, but current models typically require retraining when deployed across different clinical centers, limiting their widespread adoption. We introduce GlobeReady, a clinician-friendly AI platform that enables […]


Continue.. A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers

A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers

Kavli Affiliate: Ting Xu | First 5 Authors: Meng Wang, Tian Lin, Qingshan Hou, Aidi Lin, Jingcheng Wang | Summary: Artificial intelligence (AI) shows remarkable potential in medical imaging diagnostics, yet most current models require retraining when applied across different clinical settings, limiting their scalability. We introduce GlobeReady, a clinician-friendly AI platform that enables fundus […]


Continue.. A Clinician-Friendly Platform for Ophthalmic Image Analysis Without Technical Barriers

AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG

Kavli Affiliate: Xiang Zhang | First 5 Authors: Jiaqi Wei, Hao Zhou, Xiang Zhang, Di Zhang, Zijie Qiu | Summary: Retrieval-augmented generation (RAG) has emerged as a foundational paradigm for knowledge-grounded text generation. However, existing RAG pipelines often fail to ensure that the reasoning trajectories align with the evidential constraints imposed by retrieved content. In […]


Continue.. AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG