New developments in high-throughput biological studies mean that the genes that are active in just a single cell can now be determined. However, analyzing the complex datasets that result can be challenging. Now, a team at Osaka University has developed CAPITAL, a new computational tool for comparing complex datasets from single cells (Fig. 1).
RNA sequencing provides information on the subset of the entire population of genes that are actively being expressed, or are “switched on.” As technology has advanced, it has become possible to sequence the RNA population of just a single cell. This can provide a great deal of information on the specific changes in gene expression involved when a large population of mixed cells undergoes dynamic, transitional processes, such as differentiation or cell death, because each individual cell can be specifically analyzed rather than all the different cell types being pooled.
CAPITAL is specifically designed to compare complex datasets from single cells undergoing transitional processes. These analyses are carried out by defining a “pseudotime trajectory,” which places the cells along a hypothetical path that reflects their progress through the transitional process. These trajectories are not always straightforward and linear; they can become very complex and branching. In the past, only linear trajectories could be aligned for comparison [1, 2], but the team’s innovation means that complex branching trajectories can now be accurately and automatically aligned and compared.
After developing the algorithm used for CAPITAL, which implements a method known as tree alignment , they tested it on both synthetic datasets and authentic datasets from bone marrow cells [4-6]. The results demonstrated that CAPITAL is statistically more accurate and robust than the computational algorithms that existed previously [7-9], showing major advances over these methods (Fig. 2).
Trajectory comparison is a powerful analysis that can, as an example, identify the gene expression dynamics between different species to provide information on evolutionary processes. “We showed in this study that CAPITAL can reveal the existence of different molecular patterns between humans and mice even when some of the expression patterns are similar and appear to be conserved”, says lead author Reiichi Sugihara (Fig. 3). “This will allow the identification of novel regulators that determine cell fates”. This technology is not limited to just this type of data, as senior author Yuki Kato explains: “Our novel computational tool can be applied to a wide range of high-throughput datasets, including pseudotemporal, spatial, and epigenetic data”.
This powerful new technique will allow the global comparison of single-cell trajectories, which may lead to the identification of novel disease-associated genes that could not be identified by earlier comparative methods. Thus, CAPITAL represents a significant advance in the field of single-cell biology.
- Alpert, A., Moore, L. S., Dubovik, T. & Shen-Orr, S. S. Alignment of single-cell trajectories to compare cellular expression dynamics. Nat. Methods 15, 267–270 (2018).
- Cacchiarelli, D. et al. Aligning single-cell developmental and reprogramming trajectories identifies molecular determinants of myogenic reprogramming outcome. Cell Syst. 7, 258–268 (2018).
- Jiang, T., Wang, L. & Zhang, K. Alignment of trees—an alternative to tree edit. Theor. Comput. Sci. 143, 137–148 (1995).
- Setty, M. et al. Characterization of cell fate probabilities in single-cell data with Palantir. Nat. Biotechnol. 37, 451–460 (2019).
- Velten, L. et al. Human haematopoietic stem cell lineage commitment is a continuous process. Nat. Cell Biol. 19, 271–281 (2017).
- Paul, F. et al. Transcriptional heterogeneity and lineage commitment in myeloid progenitors. Cell 163, 1663–1677 (2015).
- Lopez, R., Regier, J., Cole, M. B., Jordan, M. I. & Yosef, N. Deep generative modeling for single-cell transcriptomics. Nat. Methods 15, 1053–1058 (2018).
- Hie, B., Bryson, B. & Berger, B. Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nat. Biotechnol. 37, 685–691 (2019).
- Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).