Kavli Affiliate: Xiang Zhang | First 5 Authors: Nan Huang, Christian Kümmerle, Xiang Zhang, , | Summary: Normalization techniques are crucial for enhancing Transformer models’ performance and stability in time series analysis tasks, yet traditional methods like batch and layer normalization often lead to issues such as token shift, attention shift, and sparse attention. We […]
Continue.. UnitNorm: Rethinking Normalization for Transformers in Time Series