Kavli Affiliate: Liam Paninski, Nathaniel Sawtell
| Authors: Dan Biderman, Matthew R Whiteway, Cole Hurwitz, Nicholas R Greenspan, Robert S Lee, Ankit Vishnubhotla, Michael Schartner, Julia M Huntenburg, Anup Khanal, Guido T Meijer, Jean-Paul Noel, Alejandro Pan-Vazquez, Karolina Z Socha, Anne E Urai, The International Brain Laboratory, Richard Warren, Dillon Noone, Federico Pedraja, John Cunningham, Nathaniel B Sawtell and Liam Paninski
| Summary:
Contemporary pose estimation methods enable precise measurements of behavior via supervised deep learning with hand-labeled video frames. Although effective in many cases, the supervised approach requires extensive labeling and often produces outputs that are unreliable for downstream analyses. Here, we introduce “Lightning Pose,” an efficient pose estimation package with three algorithmic contributions. First, in addition to training on a few labeled video frames, we use many unlabeled videos and penalize the network whenever its predictions violate motion continuity, multiple-view geometry, and posture plausibility (semi-supervised learning). Second, we introduce a network architecture that resolves occlusions by predicting pose on any given frame using surrounding unlabeled frames. Third, we refine the pose predictions post-hoc by combining ensembling and Kalman smoothing. Together, these components render pose trajectories more accurate and scientifically usable. We release a cloud application that allows users to label data, train networks, and predict new videos directly from the browser.