Kavli Affiliate: Jia Liu | First 5 Authors: Tianchen Zhou, Jia Liu, Yang Jiao, Chaosheng Dong, Yetian Chen | Summary: Online learning to rank (ONL2R) is a foundational problem for recommender systems and has received increasing attention in recent years. Among the existing approaches for ONL2R, a natural modeling architecture is the multi-armed bandit framework […]
Continue.. Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments