Integrated analysis of multimodal single-cell data
Updated April 5, 2023The simultaneous measurement of multiple modalities represents an exciting frontier for single-cell genomics and necessitates computational methods that can define cellular states based on multimodal data. Here, we introduce "weighted-nearest neighbor" analysis, an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities. We apply our procedure to a CITE-seq dataset of 211,000 human peripheral blood mononuclear cells (PBMCs) with panels extending to 228 antibodies to construct a multimodal reference atlas of the circulating immune system. Multimodal analysis substantially improves our ability to resolve cell states, allowing us to identify and validate previously unreported lymphoid subpopulations. Moreover, we demonstrate how to leverage this reference to rapidly map new datasets and to interpret immune responses to vaccination and coronavirus disease 2019 (COVID-19). Our approach represents a broadly applicable strategy to analyze single-cell multimodal datasets and to look beyond the transcriptome toward a unified and multimodal definition of cellular identity.
To reference this project, please use the following link:
Downloaded and exported data is governed by the HCA Data Release Policy and licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). For more information please see our Data Use Agreement.
Analysis Portals
Project Label
humanBloodCiteSeq2Species
Sample Type
Anatomical Entity
Organ Part
Selected Cell Types
Disease Status (Specimen)
Disease Status (Donor)
Development Stage
Library Construction Method
Nucleic Acid Source
Paired End
falseAnalysis Protocol
analysis_protocol_1, analysis_protocol_2, analysis_protocol_3, analysis_protocol_4, analysis_protocol_5, analysis_protocol_6File Format
Cell Count Estimate
8.6kDonor Count
8