HCA Data Explorer

Integrated analysis of multimodal single-cell data

Updated April 5, 2023

The simultaneous measurement of multiple modalities represents an exciting frontier for single-cell genomics and necessitates computational methods that can define cellular states based on multimodal data. Here, we introduce "weighted-nearest neighbor" analysis, an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities. We apply our procedure to a CITE-seq dataset of 211,000 human peripheral blood mononuclear cells (PBMCs) with panels extending to 228 antibodies to construct a multimodal reference atlas of the circulating immune system. Multimodal analysis substantially improves our ability to resolve cell states, allowing us to identify and validate previously unreported lymphoid subpopulations. Moreover, we demonstrate how to leverage this reference to rapidly map new datasets and to interpret immune responses to vaccination and coronavirus disease 2019 (COVID-19). Our approach represents a broadly applicable strategy to analyze single-cell multimodal datasets and to look beyond the transcriptome toward a unified and multimodal definition of cellular identity.

Rahul SatijaCenter for Genomics and Systems Biology, New York University, New York, NY 10003, USA; New York Genome Center, New York, NY 10013, USA. Electronic address: rsatija@nygenome.org.rsatija@nygenome.org
Yuhan Hao1
Stephanie Hao2
Erica Andersen-Nissen3
William M Mauck4
Shiwei Zheng1
Andrew Butler1
Maddie J Lee5
Aaron J Wilk5
Charlotte Darby4
Michael Zager6
Paul Hoffman4
Marlon Stoeckius2
Efthymia Papalexi1
Eleni P Mimitou2
Jaison Jain4
Avi Srivastava4
Tim Stuart4
Lamar M Fleming7
Bertrand Yeung8
Angela J Rogers5
Juliana M McElrath7
Catherine A Blish9
Raphael Gottardo7
Peter Smibert10
Rahul Satija (Principal Investigator)11
1Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA; New York Genome Center, New York, NY 10013, USA.
2Technology Innovation Lab, New York Genome Center, New York, NY 10013, USA.
3Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Cape Town HVTN Immunology Lab, Hutchinson Cancer Research Institute of South Africa, Cape Town 8001, South Africa.
4Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA.
5Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA.
6Center for Data Visualization, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
7Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
8BioLegend Inc., San Diego, CA 92121, USA.
9Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA; Chan Zuckerberg Biohub, San Francisco, CA 94063, USA.
10Technology Innovation Lab, New York Genome Center, New York, NY 10013, USA. Electronic address: smibertp@gmail.com.
11Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA; New York Genome Center, New York, NY 10013, USA. Electronic address: rsatija@nygenome.org.
Rachel Schwartz

To reference this project, please use the following link:

https://explore.data.humancellatlas.org/projects/3ce9ae94-c469-419a-9637-5d138a4e642f
None
GEO Series Accessions:INSDC Study Accessions:

Downloaded and exported data is governed by the HCA Data Release Policy and licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). For more information please see our Data Use Agreement.

Analysis Portals

CZ CELLxGENECZ CELLxGENE
UCSC Cell BrowserUCSC Cell Browser

Project Label

humanBloodCiteSeq2

Species

Homo sapiens

Sample Type

specimens

Anatomical Entity

blood

Organ Part

Unspecified

Selected Cell Types

peripheral blood mononuclear cell

Disease Status (Specimen)

HIV infectious disease

Disease Status (Donor)

HIV infectious disease

Development Stage

human adult stage

Library Construction Method

2 library construction methods

Nucleic Acid Source

single cell

Paired End

false

Analysis Protocol

analysis_protocol_1, analysis_protocol_2, analysis_protocol_3, analysis_protocol_4, analysis_protocol_5, analysis_protocol_6

File Format

4 file formats

Cell Count Estimate

8.6k

Donor Count

8
csv.gz2 file(s)h5ad1 file(s)tar1 file(s)xlsx2 file(s)