Simplify RLHF as Reward-Weighted SFT: A Variational Method

Kavli Affiliate: Zhuo Li | First 5 Authors: Yuhao Du, Zhuo Li, Pengyu Cheng, Zhihong Chen, Yuejiao Xie | Summary: Reinforcement Learning from Human Feedback (RLHF) is crucial for aligning Large Language Models (LLMs) with human values. However, RLHF has been continuously challenged by its high complexity in implementation and computation consumption. Even with recent […]


Continue.. Simplify RLHF as Reward-Weighted SFT: A Variational Method

Simplify RLHF as Reward-Weighted SFT: A Variational Method

Kavli Affiliate: Zhuo Li | First 5 Authors: Yuhao Du, Zhuo Li, Pengyu Cheng, Zhihong Chen, Yuejiao Xie | Summary: Reinforcement Learning from Human Feedback (RLHF) is crucial for aligning Large Language Models (LLMs) with human values. However, RLHF has been continuously challenged by its high complexity in implementation and computation consumption. Even with recent […]


Continue.. Simplify RLHF as Reward-Weighted SFT: A Variational Method

Systematic study of the composition of Type I X-ray burst ashes: Neutron star structure v.s. Reaction rate uncertainties

Kavli Affiliate: Renxin Xu | First 5 Authors: Guoqing Zhen, Helei Liu, Akira Dohi, Guoliang Lü, Nobuya Nishimura | Summary: In this study, we calculate for the first time the impacts of neutron star(NS) structure on the type I X-ray burst ashes using the texttt{MESA} code. We find an increased mass fraction of the heavier […]


Continue.. Systematic study of the composition of Type I X-ray burst ashes: Neutron star structure v.s. Reaction rate uncertainties

FuncGenFoil: Airfoil Generation and Editing Model in Function Space

Kavli Affiliate: Jing Wang | First 5 Authors: Jinouwen Zhang, Junjie Ren, Aobo Yang, Yan Lu, Lu Chen | Summary: Aircraft manufacturing is the jewel in the crown of industry, among which generating high-fidelity airfoil geometries with controllable and editable representations remains a fundamental challenge. While existing deep-learning-based methods rely on predefined parametric function families, […]


Continue.. FuncGenFoil: Airfoil Generation and Editing Model in Function Space

FEASTS Combined with Interferometry. III. The Low-column-density HI Around M51 and Possibility of Turbulent-mixing Gas Accretion

Kavli Affiliate: Luis C. Ho | First 5 Authors: Xuchen Lin, Jing Wang, Lister Staveley-Smith, Suoqing Ji, Dong Yang | Summary: With a new joint-deconvolution pipeline, we combine the single-dish and interferometric atomic hydrogen (HI) data of M51 observed by the FAST (FEASTS program) and VLA (THINGS). The product data cube has a typical line […]


Continue.. FEASTS Combined with Interferometry. III. The Low-column-density HI Around M51 and Possibility of Turbulent-mixing Gas Accretion

The Pristine survey: XXVIII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

Zhen Yuan, Tadafumi Matsuno, Tatyana Sitnova, Nicolas F. Martin, Rodrigo A. Ibata | Summary: [[{“value”:”The discovery of the most metal-poor stream, C-19, provides us with a fossil record of a stellar structure born very soon after the Big Bang. In this work, we search for new C-19 members over the whole sky by combining two […]


Continue.. The Pristine survey: XXVIII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

The Pristine survey: XXVIII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

Zhen Yuan, Tadafumi Matsuno, Tatyana Sitnova, Nicolas F. Martin, Rodrigo A. Ibata | Summary: [[{“value”:”The discovery of the most metal-poor stream, C-19, provides us with a fossil record of a stellar structure born very soon after the Big Bang. In this work, we search for new C-19 members over the whole sky by combining two […]


Continue.. The Pristine survey: XXVIII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

The Pristine survey: XXVIII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

Zhen Yuan, Tadafumi Matsuno, Tatyana Sitnova, Nicolas F. Martin, Rodrigo A. Ibata | Summary: [[{“value”:”The discovery of the most metal-poor stream, C-19, provides us with a fossil record of a stellar structure born very soon after the Big Bang. In this work, we search for new C-19 members over the whole sky by combining two […]


Continue.. The Pristine survey: XXVIII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

The Pristine survey: XXVII. The extremely metal-poor stream C-19 stretches over more than 100 degrees

Zhen Yuan, Tadafumi Matsuno, Tatyana Sitnova, Nicolas F. Martin, Rodrigo A. Ibata | Summary: [[{“value”:”The discovery of the most metal-poor stream, C-19, provides us with a fossil record of a stellar structure born very soon after the Big Bang. In this work, we search for new C-19 members over the whole sky by combining two […]


Continue.. The Pristine survey: XXVII. The extremely metal-poor stream C-19 stretches over more than 100 degrees