Kavli Affiliate: Xiang Zhang | First 5 Authors: Guanning Zeng, Guanning Zeng, , , | Summary: We propose YOLO-Count, a differentiable open-vocabulary object counting model that tackles both general counting challenges and enables precise quantity control for text-to-image (T2I) generation. A core contribution is the ‘cardinality’ map, a novel regression target that accounts for variations […]
Continue.. YOLO-Count: Differentiable Object Counting for Text-to-Image Generation