Kavli Affiliate: Max Tegmark | First 5 Authors: Ziming Liu, Yizhou Liu, Jeff Gore, Max Tegmark, | Summary: Beyond neural scaling laws, little is known about the laws underlying large language models (LLMs). We introduce Neural Thermodynamic Laws (NTL) — a new framework that offers fresh insights into LLM training dynamics. On the theoretical side, […]
Continue.. Neural Thermodynamic Laws for Large Language Model Training