Benchmarking of quantum and classical SDP relaxations for QUBO formulations of real-world logistics problems

Kavli Affiliate: David Gross | First 5 Authors: Birte Ostermann, Taylor Garnowski, Fabian Henze, Vaibhavnath Jha, Asra Dia | Summary: Quadratic unconstrained binary optimization problems (QUBOs) are intensively discussed in the realm of quantum computing and polynomial optimization. We provide a vast experimental study of semidefinite programming (SDP) relaxations of QUBOs using sums of squares […]


Continue.. Benchmarking of quantum and classical SDP relaxations for QUBO formulations of real-world logistics problems

A Hierarchical Shock Model of Ultra-High-Energy Cosmic Rays

Kavli Affiliate: Roger Blandford | First 5 Authors: Paul Simeon, NoĆ©mie Globus, Kirk S. S. Barrow, Roger Blandford, | Summary: We propose that a hierarchical shock model$unicode{x2014}$including supernova remnant shocks, galactic wind termination shocks, and accretion shocks around cosmic filaments and galaxy clusters$unicode{x2014}$can naturally explain the cosmic ray spectrum from ~1 GeV up to ~200 […]


Continue.. A Hierarchical Shock Model of Ultra-High-Energy Cosmic Rays

Cosmic Reionization on Computers: Statistical Properties of the Distributions of Mean Opacities

Kavli Affiliate: Nickolay Y. Gnedin | First 5 Authors: Ella Werre, David Robinson, Camille Avestruz, Nickolay Y. Gnedin, | Summary: Quasar absorption lines provide a unique window to the relationship between galaxies and the intergalactic medium during the Epoch of Reionization. In particular, high redshift quasars enable measurements of the neutral hydrogen content of the […]


Continue.. Cosmic Reionization on Computers: Statistical Properties of the Distributions of Mean Opacities

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Kavli Affiliate: Li Xin Li | First 5 Authors: Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao | Summary: Existing large language model (LLM) evaluation benchmarks primarily focus on English, while current multilingual tasks lack parallel questions that specifically assess cross-linguistic reasoning abilities. This dual limitation makes it challenging to comprehensively assess LLMs’ […]


Continue.. MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Kavli Affiliate: Li Xin Li | First 5 Authors: Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao | Summary: Traditional benchmarks struggle to evaluate increasingly sophisticated language models in multilingual and culturally diverse contexts. To address this gap, we introduce MMLU-ProX, a comprehensive multilingual benchmark covering 13 typologically diverse languages with approximately 11,829 […]


Continue.. MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading

Kavli Affiliate: Wei Gao | First 5 Authors: Qiaoling Chen, Shenggui Li, Wei Gao, Peng Sun, Yonggang Wen | Summary: In recent years, Large Language Models (LLMs) have exhibited remarkable capabilities, driving advancements in real-world applications. However, training LLMs on increasingly long input sequences imposes significant challenges due to high GPU memory and computational demands. […]


Continue.. SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading

Hybrid Agents for Image Restoration

Kavli Affiliate: Li Xin Li | First 5 Authors: Bingchen Li, Xin Li, , , | Summary: Existing Image Restoration (IR) studies typically focus on task-specific or universal modes individually, relying on the mode selection of users and lacking the cooperation between multiple task-specific/universal restoration modes. This leads to insufficient interaction for unprofessional users and […]


Continue.. Hybrid Agents for Image Restoration

Why Prompt Design Matters and Works: A Complexity Analysis of Prompt Search Space in LLMs

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Juntai Cao, Jiaqi Wei, Chenyu You, Dujian Ding | Summary: Despite the remarkable successes of large language models (LLMs), the underlying Transformer architecture has inherent limitations in handling complex reasoning tasks. Chain-of-thought (CoT) prompting has emerged as a practical workaround, but most CoT-based methods rely […]


Continue.. Why Prompt Design Matters and Works: A Complexity Analysis of Prompt Search Space in LLMs

Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Juntai Cao, Jiaqi Wei, Chenyu You, Dujian Ding | Summary: Despite the remarkable successes of Large Language Models (LLMs), their fundamental Transformer architecture possesses inherent theoretical limitations that restrict their capability to handle reasoning tasks with increasing computational complexity. Chain-of-Thought (CoT) prompting has emerged as […]


Continue.. Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective

Speedy MASt3R

Kavli Affiliate: Cheng Peng | First 5 Authors: Jingxing Li, Yongjae Lee, Abhay Kumar Yadav, Cheng Peng, Rama Chellappa | Summary: Image matching is a key component of modern 3D vision algorithms, essential for accurate scene reconstruction and localization. MASt3R redefines image matching as a 3D task by leveraging DUSt3R and introducing a fast reciprocal […]


Continue.. Speedy MASt3R