Kavli Affiliate: Jin Kang
| Authors: Minzhe Guo, Michael P. Morley, Yixin Wu, Yina Du, Shuyang Zhao, Andrew Wagner, Michal Kouril, Kang Jin, Nathan Gaddis, Joseph A. Kitzmiller, Kathleen Stewart, Maria C. Basil, Susan M. Lin, Yun Ying, Apoorva Babu, Kathryn A. Wikenheiser-Brokamp, Kyu Shik Mun, Anjaparavanda P. Naren, Sara Lin, Geremy Clair, Joshua N. Adkins, Gloria S. Pryhuber, Ravi S. Misra, Bruce J. Aronow, Timothy L. Tickle, Nathan Salomonis, Xin Sun, Edward E. Morrisey, Jeffrey A. Whitsett and Yan Xu
| Summary:
Accurate cell type identification is a key and rate-limiting step in single cell data analysis. Single cell references with comprehensive cell types, reproducible and functional validated cell identities, and common nomenclatures are much needed by the research community to optimize automated cell type annotation and facilitate data integration, sharing, and collaboration. In the present study, we developed a novel computational pipeline to utilize the LungMAP CellCards as a dictionary to consolidate single-cell transcriptomic datasets of 104 human lungs and 17 mouse lung samples and constructed “LungMAP CellRef” and “LungMAP CellRef Seed” for both normal human and mouse lungs. “CellRef Seed” has an equivalent prediction power and produces consistent cell annotation as does “CellRef” but improves computational efficiency and simplifies its utilization for fast automated cell type annotation and online visualization. This atlas set incorporates 48 human and 40 mouse well-defined lung cell types catalogued from diverse anatomic locations and developmental time points. Using independent datasets, we demonstrated the utility of our CellRefs for automated cell type annotation analysis of both normal and disease lungs. User-friendly web interfaces were developed to support easy access and maximal utilization of the LungMAP CellRefs. LungMAP CellRefs are freely available to the pulmonary research community through fast interactive web interfaces to facilitate hypothesis generation, research discovery, and identification of cell type alterations in disease conditions.