SEACells infers transcriptional and epigenomic cellular states from single-cell genomics data.

TitleSEACells infers transcriptional and epigenomic cellular states from single-cell genomics data.
Publication TypeJournal Article
Year of Publication2023
AuthorsPersad S, Choo Z-N, Dien C, Sohail N, Masilionis I, Chaligne R, Nawy T, Brown CC, Sharma R, Pe'er I, Setty M, Pe'er D
JournalNat Biotechnol
Date Published2023 Mar 27
ISSN1546-1696
Abstract

Metacells are cell groupings derived from single-cell sequencing data that represent highly granular, distinct cell states. Here we present single-cell aggregation of cell states (SEACells), an algorithm for identifying metacells that overcome the sparsity of single-cell data while retaining heterogeneity obscured by traditional cell clustering. SEACells outperforms existing algorithms in identifying comprehensive, compact and well-separated metacells in both RNA and assay for transposase-accessible chromatin (ATAC) modalities across datasets with discrete cell types and continuous trajectories. We demonstrate the use of SEACells to improve gene-peak associations, compute ATAC gene scores and infer the activities of critical regulators during differentiation. Metacell-level analysis scales to large datasets and is particularly well suited for patient cohorts, where per-patient aggregation provides more robust units for data integration. We use our metacells to reveal expression dynamics and gradual reconfiguration of the chromatin landscape during hematopoietic differentiation and to uniquely identify CD4 T cell differentiation and activation states associated with disease onset and severity in a Coronavirus Disease 2019 (COVID-19) patient cohort.

DOI10.1038/s41587-023-01716-9
Alternate JournalNat Biotechnol
PubMed ID36973557
PubMed Central ID5762154
Grant ListR35 GM147125 / GM / NIGMS NIH HHS / United States
U54 CA209975 / CA / NCI NIH HHS / United States
U2C CA233284 / CA / NCI NIH HHS / United States

Person Type: