dbos – Page 73 – Kavli Institute Pre-Print Publications

Benchmarking of quantum and classical SDP relaxations for QUBO formulations of real-world logistics problems

Posted by dbos March 13, 2025March 26, 2025UCSB

Kavli Affiliate: David Gross | First 5 Authors: Birte Ostermann, Taylor Garnowski, Fabian Henze, Vaibhavnath Jha, Asra Dia | Summary: Quadratic unconstrained binary optimization problems (QUBOs) are intensively discussed in the realm of quantum computing and polynomial optimization. We provide a vast experimental study of semidefinite programming (SDP) relaxations of QUBOs using sums of squares […]

Continue..

A Hierarchical Shock Model of Ultra-High-Energy Cosmic Rays

Posted by dbos March 13, 2025March 26, 2025Stanford

Kavli Affiliate: Roger Blandford | First 5 Authors: Paul Simeon, Noémie Globus, Kirk S. S. Barrow, Roger Blandford, | Summary: We propose that a hierarchical shock model$unicode{x2014}$including supernova remnant shocks, galactic wind termination shocks, and accretion shocks around cosmic filaments and galaxy clusters$unicode{x2014}$can naturally explain the cosmic ray spectrum from ~1 GeV up to ~200 […]

Continue..

Cosmic Reionization on Computers: Statistical Properties of the Distributions of Mean Opacities

Posted by dbos March 13, 2025March 26, 2025Chicago

Kavli Affiliate: Nickolay Y. Gnedin | First 5 Authors: Ella Werre, David Robinson, Camille Avestruz, Nickolay Y. Gnedin, | Summary: Quasar absorption lines provide a unique window to the relationship between galaxies and the intergalactic medium during the Epoch of Reionization. In particular, high redshift quasars enable measurements of the neutral hydrogen content of the […]

Continue..

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Posted by dbos March 13, 2025Peking

Kavli Affiliate: Li Xin Li | First 5 Authors: Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao | Summary: Existing large language model (LLM) evaluation benchmarks primarily focus on English, while current multilingual tasks lack parallel questions that specifically assess cross-linguistic reasoning abilities. This dual limitation makes it challenging to comprehensively assess LLMs’ […]

Continue..

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Posted by dbos March 13, 2025Peking

Kavli Affiliate: Li Xin Li | First 5 Authors: Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao | Summary: Traditional benchmarks struggle to evaluate increasingly sophisticated language models in multilingual and culturally diverse contexts. To address this gap, we introduce MMLU-ProX, a comprehensive multilingual benchmark covering 13 typologically diverse languages with approximately 11,829 […]

Continue..

SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading

Posted by dbos March 13, 2025March 26, 2025Caltech

Kavli Affiliate: Wei Gao | First 5 Authors: Qiaoling Chen, Shenggui Li, Wei Gao, Peng Sun, Yonggang Wen | Summary: In recent years, Large Language Models (LLMs) have exhibited remarkable capabilities, driving advancements in real-world applications. However, training LLMs on increasingly long input sequences imposes significant challenges due to high GPU memory and computational demands. […]

Continue..

Hybrid Agents for Image Restoration

Posted by dbos March 13, 2025March 31, 2025Peking

Kavli Affiliate: Li Xin Li | First 5 Authors: Bingchen Li, Xin Li, , , | Summary: Existing Image Restoration (IR) studies typically focus on task-specific or universal modes individually, relying on the mode selection of users and lacking the cooperation between multiple task-specific/universal restoration modes. This leads to insufficient interaction for unprofessional users and […]

Continue..

Why Prompt Design Matters and Works: A Complexity Analysis of Prompt Search Space in LLMs

Posted by dbos March 13, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Juntai Cao, Jiaqi Wei, Chenyu You, Dujian Ding | Summary: Despite the remarkable successes of large language models (LLMs), the underlying Transformer architecture has inherent limitations in handling complex reasoning tasks. Chain-of-thought (CoT) prompting has emerged as a practical workaround, but most CoT-based methods rely […]

Continue..

Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective

Posted by dbos March 13, 2025March 26, 2025Berkeley

Kavli Affiliate: Xiang Zhang | First 5 Authors: Xiang Zhang, Juntai Cao, Jiaqi Wei, Chenyu You, Dujian Ding | Summary: Despite the remarkable successes of Large Language Models (LLMs), their fundamental Transformer architecture possesses inherent theoretical limitations that restrict their capability to handle reasoning tasks with increasing computational complexity. Chain-of-Thought (CoT) prompting has emerged as […]

Continue..

Speedy MASt3R

Posted by dbos March 13, 2025March 26, 2025UCAS

Kavli Affiliate: Cheng Peng | First 5 Authors: Jingxing Li, Yongjae Lee, Abhay Kumar Yadav, Cheng Peng, Rama Chellappa | Summary: Image matching is a key component of modern 3D vision algorithms, essential for accurate scene reconstruction and localization. MASt3R redefines image matching as a 3D task by leveraging DUSt3R and introducing a fast reciprocal […]

Continue..