Kavli Affiliate: Kristofer Bouchard
| Authors: Joel Ye, Fabio Rizzoglio, Adam Smoulder, Hongwei Mao, Xuan Ma, Patrick Marino, Raeed H Chowdhury, Dalton D Moore, Gary Blumenthal, Will Hockeimer, Nicolas G. Kunigk, J Patrick Mayo, Aaron P Batista, Steven M Chase, Adam G Rouse, Michael L. Boninger, Charles Greenspon, Andrew B. Schwartz, Nicholas Hatsopoulos, Lee E Miller, Kristofer Bouchard, Jennifer Collinger, Leila Wehbe and Robert Gaunt
| Summary:
Mapping the relationship between neural activity and motor behavior is a central aim of sensori-motor neuroscience and neurotechnology. While most progress to this end has relied on restricting complexity, the advent of foundation models instead proposes integrating a breadth of data as an alternate avenue for broadly advancing downstream modeling. We quantify this premise for motor decoding from intracortical microelectrode data, pretraining an autoregressive Transformer on 2000 hours of neural population spiking activity paired with diverse motor covariates from over 30 monkeys and humans. The resulting model is broadly useful, benefiting decoding on 8 downstream decoding tasks and generalizing to a variety of neural distribution shifts. However, we also highlight that scaling autoregressive Transformers seems unlikely to resolve limitations stemming from sensor variability and output stereotypy in neural datasets. Code: https://github.com/joel99/ndt3