, Suresh Prajapati1
, Mansi Patel2
, Reeshu Gupta1,2
1Parul Institute of Applied Sciences, Parul University, Gujarat, India
2Research and Development Cell, Parul University, Gujarat, India
© 2026 The Korean Liver Cancer Association.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Conflicts of Interest
The authors have no conflicts of interests to declare.
Ethics Statement
This review article is fully based on articles which have already been published and did not involve additional patient participants. Therefore, IRB approval is not necessary.
Funding Statement
Not applicable.
Data Availability
Not applicable.
Author Contributions
Conceptualization: CJ, RG
Data curation: CJ, SP
Formal analysis: CJ, SP
Investigation: CJ, MP
Methodology: CJ, SP, MP
Project administration: CJ
Resources: RG
Supervision: RG
Writing - original draft: SP, RG
Writing - review & editing: RG
| Methods | Mechanisms | Analysis tool | Advantages | Disadvantages | Applications |
|---|---|---|---|---|---|
| Single-cell tagged reverse transcription (STRT-seq)20 | In this oligo dT primers containing barcode and primer binding sequence is used for the reverse transcription. Consequently, moloney murine leukemia virus reverse transcriptase (MMLV RT) introduces the barcode sequence at the 5’-end of the synthesized cDNA. MMLV RT also adds common sequence at the 5’-end of c-DNA. Sequencing reads are then generated specifically from 5’-end-tagged region. This allows for accurate quantification of gene expression based on transcription start sites | STRTprep pipeline72 | Early barcoding supports multiplexing | Complex protocol | Especially useful for transcript counting and identifying TSS |
| This is followed for single cell downstream analysis on Seurat | Exact location of the 5’-end of transcripts | Low sensitivity | |||
| Low RNA input needed14 | It only sequences a short region near the 5’-end of each transcript | ||||
| cDNA synthesis starts at the 3’-end of RNA using an oligo dT primer, which can lead to inefficient capture of degraded mRNAs or exclude transcripts without intact poly(A) tails | |||||
| Switching mechanism at the 5’-end of RNA template sequencing (SMART-seq2)21 | In this MMLV RT adds few nucleotides at 3’- end of cDNA due to the terminal transferase activity of MMLV RT. In this TSO containing LNA is used to enhance the stability and efficiency of reverse transcription. Additionally, betaine is used to reduce secondary structure of transcript. This strategy enables capturing of the 5’ cap-proximal region, preserving fulllength information | ScPipe73 | Full-length coverage across transcripts | Barcoding is not done and therefore, high throughput multiplexing is not favoured | Suitable for isoforms, splicing, low expression genes |
| nf-core/scrnaseq74 | Detects low-abundance transcripts | Expensive | |||
| Require as little as 50 pg RNA | Labor intensive | ||||
| Use of LNA-modified TSO, optimized oligo dT, and betaine improves reverse transcription efficiency | Lack of strand specificity | ||||
| Unable to detect nonpolyadenylated (poly[A]-) RNA | |||||
| Cell expression level RNA sequencing (CEL-seq)17 | In this method primers containing oligo dT, barcode, an Illumina 5’ sequencing adaptor, and a T7 promoter are used for RT. Briefly, RNA is isolated after cell lysis, and then converted to cDNA with CEL-seq-primer. Once barcoded, the cDNAs from multiple cells are combined into one tube. After completion of second-strand cDNA synthesis, several samples are combined and subjected to IVT reactions | zUMIs75 | Minimize amplification bias because the technique uses IVT and not PCR | Captures just the 3’-ends of mRNA transcripts | Suitable for balance throughput and sensitivity |
| Early barcoding allows pooling of samples and thus reduces batch effects, and cost | Multiple enzymatic steps are used and thus it is more technically demanding and time-consuming | ||||
| Highly sensitive and capable of detecting a large number of genes per cell, even at low input RNA levels | Usually done in plate formats and thus lower throughput | ||||
| High reproducibility | |||||
| Indexed droplet sequencing (InDrop)9,19 | Briefly, in this method, individual cells are encapsulated into nanoliter-sized droplets containing lysis buffer, RT mix, and primers composed of poly(dT) sequences, UMIs, cell barcodes, sequencing adaptors, T7 RNA polymerase promoters, and photocleavable spacers. Following photocleavage, primers are released, enabling cDNA synthesis and the incorporation of cell-specific barcodes during reverse transcription. After generating the second cDNA strand, IVT is conducted to amplify the material. The droplets are subsequently disrupted, and the amplified RNA is fragmented via zinc-ion-mediated cleavage. These RNA fragments are then reverse-transcribed to produce a cDNA library suitable for next-generation sequencing. Finally, the cDNA libraries are sequenced on Illumina sequencing platforms | zUMIs75 | High throughput, low cost | Detects fewer transcripts per cell | |
| The method can index over 15,000 cells per hour, demonstrates minimal technical variability, and is highly adaptable for integration with other sequencing-based platforms | Captures mainly the 3’-end of mRNAs | ||||
| Requires microfluidic devices and specialized expertise | |||||
| Scalable and automated | Doublet’s formation | ||||
| Barcodes collision or uneven capture | |||||
| Geographical position sequencing (GEO-seq)15,18 | It combines LCM with SMART-seq2-based RNA-seq. In this cell are precisely isolated from tissues on the basis of their location. Following that RNA is extracted and reverse transcribed. Full length cDNA is amplified like SMART-seq2 | Standard scRNA-seq pipelines76,77 | Avoids biases introduced by enzymatic dissociation or cell sorting, preserving native cell states | Requires high-quality tissue sections | Study spatial heterogeneity |
| High-efficiency, high-resolution strategy for spatial transcriptome analysis | Laser capture microdissection is timeconsuming and labour-intensive | ||||
| Wide application potentials such as prospective cell fates, biological functions and gene regulatory networks | Requires specialized equipment (LCM system) and skilled operators, increasing cost and technical barriers | ||||
| Multiple annealing and tailing-based quantitative scRNA-seq (MATQ)18,22 | In this method instead of oligo dT, multiple random primers are used that anneal across RNA. Poly(C) tail is added at the 3’-end of first stand of cDNA by terminal deoxynucleotidyl transferase. Similar to SMART-seq, reverse transcriptase adds an Illumina Truseq adaptor to the 5’-end. Because of the random priming and optimized reverse transcription, MATQseq captures full-length transcripts, including non-coding and non-polyadenylated RNA | Standard full-length scRNA-seq pipelines76,77 | High sensitivity | Random priming can lead to nonspecific amplification | Capture non-coding and mitochondrial RNA |
| Captures both poly(A) and nonpoly(A) RNAs | Complex library preparation | ||||
| No built-in barcoding | |||||
| Lower throughput | |||||
| 10x Chromium Genomics9,16,18 | In this method, microfluidic chip is used to combine single cell, barcoded beads having millions of oligonucleotides, enzymes and reagents for RT. RT is initiated inside each droplet. Following that barcodes and UMIs are incorporated. Following this, the GEMs are disrupted to release the barcoded cDNAs, which are then pooled for subsequent amplification and library construction | CellRanger78,79 | High throughput | Captures only a fraction (to 10-20%) of total mRNA in a cell | Analyse thousands of cells |
| Seurat78,80 | Automated and reproducible | High cost | |||
| Low technical noise | Requires high-quality single-cell suspensions | ||||
| Time saving | Doublets or multiplets | ||||
| Supports immune profiling, chromatin accessibility, spatial transcriptomics | Only captures the 3’- or 5’-end of transcripts |
scRNA-seq, single-cell RNA sequencing; dT, deoxythymidine; cDNA, complementary DNA; STRTprep, Single-cell tagged reverse transcription-preparation; TSS, transcription start sites; TSO, template switching oligonucleotide; LNA, locked nucleic acid; IVT, in vitro transcription; PCR, polymerase chain reaction; RT, reverse transcription; UMIs, unique molecular identifiers; LCM, laser capture microdissection; GEMs, gel bead-in-emulsions.
| Methods | Mechanisms | Analysis tool | Advantages | Disadvantages |
|---|---|---|---|---|
| Reduced representation bisulfite sequencing (RRBS)9,26 | It utilizes MspI restriction enzyme-which cuts DNA at all CCGG sites, regardless of their DNA methylation status. After digestion, selected fragments are size-selected (typically 40-220 bp), enriching for CpG-rich regions such as promoters and CpG islands. Bisulphite convert unmethylated DNA to uracil (read as T in sequencing). In sequencing presence of T confirms unmethylation while presence of C confirms methylated site. It enables the measurement of DNA methylation levels at 5-10% of all CpG sites in the mammalian genome26 | RnBeads81,82 | Relatively low cost | Limited to regions near Msp1 sites, hence low coverage |
| Standardized and well-validated | Not suitable for global methylation patterns | |||
| Whole genome bisulfite sequencing (WGBS)83,84 | It involves fragmentation of DNA and did not use Msp1. Following this entire genome is then sequenced after bisulphite treatment85 | Bismark81,86 | Covers nearly all CpG sites | Low library complexity |
| MethPipe87 | Can detect non-CpG methylation (e.g., CHH, CHG) | More input DNA required than reduced methods | ||
| MethylDackel88 | ||||
| MethylKit89 | ||||
| DSS90 | ||||
| RnBeads81,82 | ||||
| CpG island sequencing (CGI-seq)9,83,91 | In this after fragmentation of DNA, methylated or unmethylated DNA is enriched either using MBD proteins or using specific restriction enzymes respectively. Following that enriched DNA is sequenced using standard next-generation sequencing platforms | MEDIPS81,92 | High efficiency, simplified procedure | Inconsistent and/or low coverage |
| MethylKit89 | Targeted enrichment | Biased toward CpG islands | ||
| Assay for transposase-accessible chromatin using sequencing (ATAC-seq)9,83,93 | In this Tn5 transposase cut and tags accessible DNA. Tn5 inserts adapters preferentially to nucleosome free regions. Following that tagged fragments are PCR amplified and sequenced | nf-core/atacseq94 | High sensitivity for detecting open chromatin | Low recovery of DNA fragments |
| ENCODE-ATAC95 | Useful for mapping transcription factor footprints | Tn5 sequence bias | ||
| Low input can lead to noisy data | ||||
| DNase-seq9,28 | DNase I cuts DNA at accessible site. The cleaved fragments are sequenced after that. Sequences that bound to regulatory proteins are protected from DNase l digestion. Deep sequencing provides identify accurate location of regulatory proteins in the genome | ENCODE DNase-seq96 | Simplicity | Large amount of material needed, high error rate |
| Can detect open chromatin4 | DNase l is sequence-specific and hypersensitive sites might not account for the entire genome6 | |||
| No prior knowledge of the sequence or binding protein is required | DNA loss through the multiple purification steps limits sensitivity | |||
| Technically challenging, including optimization of DNase digestion | ||||
| chromatin immunoprecipitation followed by sequencing (ChIP-seq)27,83 | Cells are treated with formaldehyde to crosslink DNA and proteins. A specific antibody is used to pull down DNA-protein complex. After that cross-linking is reversed and DNA is sequenced | ENCODE ChIP-seq96 | High resolution, high coverage | Highly dependent on the quality of antibody |
| nf-core/chipseq pipeline74 | Reveals regulatory landscapes | Requires high-quality, specific antibodies | ||
| DROMPAplus27 | Works with various cell types and conditions | Crosslinking and immunoprecipitation can be inefficient or biased | ||
| High input requirement | ||||
| High background noise | ||||
| Droplet chromatin immunoprecipitation followed by sequencing (Drop-ChIP)24 | Individual nuclei of cells, barcoded beads, and reagents are co-encapsulated in droplets using a microfluidic device. ChIP is performed using antibodies targeting specific histone modifications DNA fragments are captured on barcoded beads. Following that emulsions are broken, and DNA is purified, amplified, and sequenced | Standard tools for single-cell epigenomic data are used97,98 | High throughput | Low coverage |
| High specificity | Technically challenging, requires microfluidic devices | |||
| Provides greater resolution of chromatin state heterogeneity in complex tissues | Limited to histone marks | |||
| Single-nucleus methylcytosine sequencing (snmC-seq)23,83 | Unlike standard WGBS, snmC-seq uses post-bisulfite adapter tagging to prevent DNA degradation and loss during bisulfite treatment. Following that DNA is sequenced to read methylation patterns | Bismark86 | Ultra-low-input DNA requirement | Suffers from DNA degradation |
| PCR bias can be introduced during amplification | ||||
| single-cell genome and epigenome transfer sequencing (scGET-seq1)99 | The method utilizes a recombinant transposase (TnH), created by fusing Tn5 with the chromodomain of HP1-α, which imparts binding affinity for H3K9me3-enriched heterochromatin. The combined use of Tn5 and TnH allows for comprehensive analysis of accessible and compacted chromatin states and their dynamic alterations | Snakemake100,101 | High resolution | Technically complex protocol |
| Requires advanced bioinformatics tools to integrate ATAC-seq and histone signal properly | ||||
| Dependent on quality of antibody | ||||
| Single nucleus barcoding assay for transposase-accessible chromatin sequencing (SNuBar-ATAC)29 | A single oligonucleotide adaptor containing unique barcodes is used during the tagmentation step, where a transposase enzyme inserts DNA fragments into open chromatin regions. Libraries are prepared for sequencing, and the barcodes are used to identify the origin of each fragment | SNuBar-ATAC pipeline29 | Robust chromatin accessibility profiling even from frozen or difficult tissues, and it is well-suited for tissues where nuclei isolation is more feasible than whole-cell isolation | Requires nuclei isolation optimization |
| Multiplexing capability | Each nucleus provides limited signal, requiring deep sequencing | |||
| Scalable | ||||
| Single-cell chromatin integration labelling sequencing (scChIL-seq)83,102 | Tn5 transposase is fused to protein A (pA-Tn5) or introduced via secondary antibodies, so it binds to the target histone mark. Transposition occurs in situ at the antibody-bound regions, integrating sequencing adapters directly at the chromatin site. The resulting DNA fragments are then amplified and sequenced | nf-core/chipseq pipeline74 | No need for nuclei isolation or bulk chromatin fragmentation | Antibody-dependent |
| Suitable for clinical or frozen samples | Require specialized bioinformatics pipelines | |||
| Single-cell cleavage under targets and tagmentation (scCUT&Tag)103 | In this method cells/nuclei are permeabilized. Primary anti-bodies bind to the target protein/epigenetic mark (e.g., H3K27me3, H3K4me1). A secondary antibody is used to tether a fusion protein of Protein A/G and Tn5 transposase (pA-Tn5) to the chromatin at the mark. Upon activation, Tn5 integrates sequencing adapters into nearby DNA. In scCUT&Tag, this is done on a single-cell platform. DNA fragments are barcoded per cell and sequenced | ChromHMMand104 | Low input requirement | Dependent on antibody quality |
| Segway104 | High signal-to-noise ratio (less background than ChIP) | |||
| Captures histone modifications, TF binding, and chromatin accessibility | ||||
| Compatible with frozen or fixed tissues | ||||
| Single-cell combinatorial indexing for transposase-accessible chromatin profiling sequencing (sciTIP-seq)25,83 | The transposase inserts sequencing adapters near the bound protein sites. Nuclei are indexed multiple times. In the first round of indexing fixed nuclei are randomly distributed across a 96- or 384-well plate. Each well contains a unique barcode. Barcodes are introduced by tagmentation or ligation. All nuclei are pooled together and then redistributed randomly into a new plate for the second round of indexing, introducing a second barcode. Each nucleus receives a unique combination of barcodes enabling single-nucleus resolution without physically isolating every cell | SnapATAC105 | High-throughput | Dependent on antibody quality |
| Cost-efficient | Barcoding collisions can occur if not enough barcode diversity | |||
| Scalable to tens of thousands of nuclei |
| Single-cell technology | Major findings |
|---|---|
| Tumor immune microenvironment | |
| scRNA-seq | NQO1+ macrophages may have an immunosuppressive effect on HCC106 |
| scRNA-seq | Four distinct TAMs were identified. Specifically, TAM-c4 was enriched in the advanced-stage patients or those receiving ICT and found to be related to a short survival time and low abundance of CD8+ T cells in primary liver cancers107 |
| scRNA-seq, spatial transcriptomics and transcriptome profiling | A novel FMO2+ CAF subset serves as a critical regulator of microenvironmental immune properties and a predictive biomarker of the immunotherapy response in patients with HCC. CCL19 in combination with anti-PD-1 therapy may constitute a novel therapeutic strategy for HCC108 |
| Spatial transcriptomics and scRNA-seq | High protein kinase, DNA-activated, catalytic subunit expression is associated with shorter survival times and an abnormal tumor microenvironment, highlighting its impact on immune cell infiltration and HCC prognosis109 |
| Spatial transcriptomics and scRNA-seq | LAMA4+ CD90+ extracellular matrix CAFs provide immunosuppressive microenvironment for liver cancer through induction of CD8+ T cell senescence110 |
| scRNA-seq | Glycan-HCCs were associated with multifaceted immune distortion, including exhaustion of T cells and enriched SPP1+ macrophages and leads to worse survival111 |
| Single-cell, bulk, and spatial transcriptome profiling with multiplexed immunofluorescence | The crosstalk between DAB2+ TAMs and FAP+ CAFs seem to be a key determinant in shaping the tumor immune barrier. Therapeutic strategies that disrupt this interaction could potentiate immunotherapeutic responses and improve patient prognosis112 |
| scRNA-seq, whole-exome sequencing, whole-transcriptome sequencing, and NGS-based HBV integration analysis | In this study, patients were categorized into two groups on the basis of effector CD8+ T-cell exhaustion markers high and low exhaustion groups. The high-exhaustion group exhibited higher PDCD1 expression, higher TP53 mutation rates, clonal expansion of CD4+ regulatory T cells and follicular helper T cells, and more pronounced HBV integrations with elevated intrahepatic covalently cccDNA and pgRNA levels. These findings provide insight into the intricate relationship between high exhaustion, proliferation sub-type, increased HBV integrations, and enhanced HBV-induced oncogenic potential in virus-related HCC113 |
| Tumor cell heterogeneity & spatial features | |
| scRNA-seq | The analysis demonstrated regional heterogeneity in cellular composition and malignant potential across the tumor, with the T1 region displaying the greatest degree of malignancy, characterized by the upregulation of HMGB2 and TOP2A114 |
| Spatial transcriptomics and scRNA-seq | Identified three subtypes in tumor cells, including ARG1+ metabolism subtype (metab-subtype), TOP2A+ proliferation phenotype, and S100A6+ pro-metastatic subtype (EMT-subtype)115 |
| scCPA-Tag | Identified a tumour cell subtype (C2) with more aggressive features. This subtype was characterized by high chromatin accessibility and a lower abundance of H3K27me3 on tumour-promoting genes116 |
| Spatial transcriptomics sequencing | The study identified one LPC cluster, three LPC subpopulations, and four distinct cellular modules, indicating the heterogeneity within LPC and the diversity between LPCs and epithelial cells117 |
| Single-cell immune repertoire sequencing, mass cytometry, and multiplex immunofluorescence | The study describes the TME landscape and identified six prognosis-related cell subclusters in this landscape118 |
| Tumor endothelial cells | |
| scRNA-seq | The analysis uncovered significant heterogeneity among TECs in HCC, delineating two subgroups (TEC1 and TEC2) with distinct functional roles and signaling profiles119 |
| scRNA-seq | This analysis delineated the intratumoral cellular heterogeneity underlying MVI, identifying MARCKSL1+ MVI-positive malignant cells as key contributors to MVI progression via activation of the PTN signaling pathway120 |
| Relapse, prognostic risk, and cellular senescence | |
| scRNA-seq and bulk RNA-seq datasets | Related genes from senescence-related pathways were used to identify senescence-related molecular sub-types in HCC with G6PD as a key gene, potentially serving as a senescence-related target in liver cancer121 |
| scRNA-seq | Analysis of primary versus early-relapsed HCC demonstrated elevated infiltration of CD8+ T cells and malignant cells in relapsed tumors, alongside a marked decline in CD4+ T cell populations122 |
| scRNA-seq | Identified a distinct senescent-associated subset of CD34+ CLDN5+ endothelial cells, which mainly enriched in tumor tissue. These cells increased cholangiocellular phenotype in HCC via IGF2-IGF2R signaling123 |
| Bulk RNA sequencing, proteomic analysis, scRNA-seq, spatial transcriptomics sequencing, and genome sequencing | Reveals a novel subtype of HCC with biological and clinical relevance. Patients with tumor purity-TME high-risk subtypes show pronounced hypoxia and activation of Wnt/β-catenin, Notch, and TGF-β pathways. Notably, a novel XPO1+ epithelial subtype shares these high-risk signatures and aggressive behavior124 |
scRNA-seq, single-cell RNA sequencing; NQO1, NAD(P)H quinone dehydrogenase 1; HCC, hepatocellular carcinoma; TAM, tumor-associated macrophage; ICT, immune checkpoint therapy; FMO2, flavin-containing monooxygenase 2; CAF, cancer-associated fibroblast; CCL19, C-C motif chemokine ligand 19; PD-1, programmed death 1; LAMA4, laminin subunit alpha 4; SPP1, secreted phosphoprotein 1; DAB2, disabled homolog 2; FAP, fibroblast activation protein; NGS, next generation sequencing; HBV, hepatitis B virus; cccDNA, closed circular DNA; pgRNA, pregenomic RNA; HMGB2, high mobility group box 2; TOP2A, topoisomerase (DNA) II alpha; scCPA, single-cell clustering-based pathway analysis; EMT, epithelial-to-mesenchymal transition; LPC, liver progenitor cell; TEC, tumor endothelial cell; MVI, microvascular invasion; PTN, pleiotrophin; IGF2, insulin-like growth factor 2; IGF2R, insulin-like growth factor 2 receptor; TGF-β, transforming growth factor-beta; XPO1, exportin 1.
| Clinical trial identifier | Country | Year | Title | Status |
|---|---|---|---|---|
| NCT05677724 | China | 2022 | Single-cell RNA Sequencing Resolves the Regulatory Role of HBV on the Hepatocellular Carcinoma Immune Microenvironment | Unknown status |
| NCT05171439 | China | 2022 | Surufatinib in Advanced Hepatocellular Carcinoma Based on Single-cell Sequencing of Tumor Samples | Unknown status |
| NCT05540925 | China | 2022 | Vascular Invasion Signatures in cfDNA Support Re-staging of Liver Cancer | Completed* |
| Methods | Mechanisms | Analysis tool | Advantages | Disadvantages | Applications |
|---|---|---|---|---|---|
| Single-cell tagged reverse transcription (STRT-seq)20 | In this oligo dT primers containing barcode and primer binding sequence is used for the reverse transcription. Consequently, moloney murine leukemia virus reverse transcriptase (MMLV RT) introduces the barcode sequence at the 5’-end of the synthesized cDNA. MMLV RT also adds common sequence at the 5’-end of c-DNA. Sequencing reads are then generated specifically from 5’-end-tagged region. This allows for accurate quantification of gene expression based on transcription start sites | STRTprep pipeline72 | Early barcoding supports multiplexing | Complex protocol | Especially useful for transcript counting and identifying TSS |
| This is followed for single cell downstream analysis on Seurat | Exact location of the 5’-end of transcripts | Low sensitivity | |||
| Low RNA input needed14 | It only sequences a short region near the 5’-end of each transcript | ||||
| cDNA synthesis starts at the 3’-end of RNA using an oligo dT primer, which can lead to inefficient capture of degraded mRNAs or exclude transcripts without intact poly(A) tails | |||||
| Switching mechanism at the 5’-end of RNA template sequencing (SMART-seq2)21 | In this MMLV RT adds few nucleotides at 3’- end of cDNA due to the terminal transferase activity of MMLV RT. In this TSO containing LNA is used to enhance the stability and efficiency of reverse transcription. Additionally, betaine is used to reduce secondary structure of transcript. This strategy enables capturing of the 5’ cap-proximal region, preserving fulllength information | ScPipe73 | Full-length coverage across transcripts | Barcoding is not done and therefore, high throughput multiplexing is not favoured | Suitable for isoforms, splicing, low expression genes |
| nf-core/scrnaseq74 | Detects low-abundance transcripts | Expensive | |||
| Require as little as 50 pg RNA | Labor intensive | ||||
| Use of LNA-modified TSO, optimized oligo dT, and betaine improves reverse transcription efficiency | Lack of strand specificity | ||||
| Unable to detect nonpolyadenylated (poly[A]-) RNA | |||||
| Cell expression level RNA sequencing (CEL-seq)17 | In this method primers containing oligo dT, barcode, an Illumina 5’ sequencing adaptor, and a T7 promoter are used for RT. Briefly, RNA is isolated after cell lysis, and then converted to cDNA with CEL-seq-primer. Once barcoded, the cDNAs from multiple cells are combined into one tube. After completion of second-strand cDNA synthesis, several samples are combined and subjected to IVT reactions | zUMIs75 | Minimize amplification bias because the technique uses IVT and not PCR | Captures just the 3’-ends of mRNA transcripts | Suitable for balance throughput and sensitivity |
| Early barcoding allows pooling of samples and thus reduces batch effects, and cost | Multiple enzymatic steps are used and thus it is more technically demanding and time-consuming | ||||
| Highly sensitive and capable of detecting a large number of genes per cell, even at low input RNA levels | Usually done in plate formats and thus lower throughput | ||||
| High reproducibility | |||||
| Indexed droplet sequencing (InDrop)9,19 | Briefly, in this method, individual cells are encapsulated into nanoliter-sized droplets containing lysis buffer, RT mix, and primers composed of poly(dT) sequences, UMIs, cell barcodes, sequencing adaptors, T7 RNA polymerase promoters, and photocleavable spacers. Following photocleavage, primers are released, enabling cDNA synthesis and the incorporation of cell-specific barcodes during reverse transcription. After generating the second cDNA strand, IVT is conducted to amplify the material. The droplets are subsequently disrupted, and the amplified RNA is fragmented via zinc-ion-mediated cleavage. These RNA fragments are then reverse-transcribed to produce a cDNA library suitable for next-generation sequencing. Finally, the cDNA libraries are sequenced on Illumina sequencing platforms | zUMIs75 | High throughput, low cost | Detects fewer transcripts per cell | |
| The method can index over 15,000 cells per hour, demonstrates minimal technical variability, and is highly adaptable for integration with other sequencing-based platforms | Captures mainly the 3’-end of mRNAs | ||||
| Requires microfluidic devices and specialized expertise | |||||
| Scalable and automated | Doublet’s formation | ||||
| Barcodes collision or uneven capture | |||||
| Geographical position sequencing (GEO-seq)15,18 | It combines LCM with SMART-seq2-based RNA-seq. In this cell are precisely isolated from tissues on the basis of their location. Following that RNA is extracted and reverse transcribed. Full length cDNA is amplified like SMART-seq2 | Standard scRNA-seq pipelines76,77 | Avoids biases introduced by enzymatic dissociation or cell sorting, preserving native cell states | Requires high-quality tissue sections | Study spatial heterogeneity |
| High-efficiency, high-resolution strategy for spatial transcriptome analysis | Laser capture microdissection is timeconsuming and labour-intensive | ||||
| Wide application potentials such as prospective cell fates, biological functions and gene regulatory networks | Requires specialized equipment (LCM system) and skilled operators, increasing cost and technical barriers | ||||
| Multiple annealing and tailing-based quantitative scRNA-seq (MATQ)18,22 | In this method instead of oligo dT, multiple random primers are used that anneal across RNA. Poly(C) tail is added at the 3’-end of first stand of cDNA by terminal deoxynucleotidyl transferase. Similar to SMART-seq, reverse transcriptase adds an Illumina Truseq adaptor to the 5’-end. Because of the random priming and optimized reverse transcription, MATQseq captures full-length transcripts, including non-coding and non-polyadenylated RNA | Standard full-length scRNA-seq pipelines76,77 | High sensitivity | Random priming can lead to nonspecific amplification | Capture non-coding and mitochondrial RNA |
| Captures both poly(A) and nonpoly(A) RNAs | Complex library preparation | ||||
| No built-in barcoding | |||||
| Lower throughput | |||||
| 10x Chromium Genomics9,16,18 | In this method, microfluidic chip is used to combine single cell, barcoded beads having millions of oligonucleotides, enzymes and reagents for RT. RT is initiated inside each droplet. Following that barcodes and UMIs are incorporated. Following this, the GEMs are disrupted to release the barcoded cDNAs, which are then pooled for subsequent amplification and library construction | CellRanger78,79 | High throughput | Captures only a fraction (to 10-20%) of total mRNA in a cell | Analyse thousands of cells |
| Seurat78,80 | Automated and reproducible | High cost | |||
| Low technical noise | Requires high-quality single-cell suspensions | ||||
| Time saving | Doublets or multiplets | ||||
| Supports immune profiling, chromatin accessibility, spatial transcriptomics | Only captures the 3’- or 5’-end of transcripts |
| Methods | Mechanisms | Analysis tool | Advantages | Disadvantages |
|---|---|---|---|---|
| Reduced representation bisulfite sequencing (RRBS)9,26 | It utilizes MspI restriction enzyme-which cuts DNA at all CCGG sites, regardless of their DNA methylation status. After digestion, selected fragments are size-selected (typically 40-220 bp), enriching for CpG-rich regions such as promoters and CpG islands. Bisulphite convert unmethylated DNA to uracil (read as T in sequencing). In sequencing presence of T confirms unmethylation while presence of C confirms methylated site. It enables the measurement of DNA methylation levels at 5-10% of all CpG sites in the mammalian genome26 | RnBeads81,82 | Relatively low cost | Limited to regions near Msp1 sites, hence low coverage |
| Standardized and well-validated | Not suitable for global methylation patterns | |||
| Whole genome bisulfite sequencing (WGBS)83,84 | It involves fragmentation of DNA and did not use Msp1. Following this entire genome is then sequenced after bisulphite treatment85 | Bismark81,86 | Covers nearly all CpG sites | Low library complexity |
| MethPipe87 | Can detect non-CpG methylation (e.g., CHH, CHG) | More input DNA required than reduced methods | ||
| MethylDackel88 | ||||
| MethylKit89 | ||||
| DSS90 | ||||
| RnBeads81,82 | ||||
| CpG island sequencing (CGI-seq)9,83,91 | In this after fragmentation of DNA, methylated or unmethylated DNA is enriched either using MBD proteins or using specific restriction enzymes respectively. Following that enriched DNA is sequenced using standard next-generation sequencing platforms | MEDIPS81,92 | High efficiency, simplified procedure | Inconsistent and/or low coverage |
| MethylKit89 | Targeted enrichment | Biased toward CpG islands | ||
| Assay for transposase-accessible chromatin using sequencing (ATAC-seq)9,83,93 | In this Tn5 transposase cut and tags accessible DNA. Tn5 inserts adapters preferentially to nucleosome free regions. Following that tagged fragments are PCR amplified and sequenced | nf-core/atacseq94 | High sensitivity for detecting open chromatin | Low recovery of DNA fragments |
| ENCODE-ATAC95 | Useful for mapping transcription factor footprints | Tn5 sequence bias | ||
| Low input can lead to noisy data | ||||
| DNase-seq9,28 | DNase I cuts DNA at accessible site. The cleaved fragments are sequenced after that. Sequences that bound to regulatory proteins are protected from DNase l digestion. Deep sequencing provides identify accurate location of regulatory proteins in the genome | ENCODE DNase-seq96 | Simplicity | Large amount of material needed, high error rate |
| Can detect open chromatin4 | DNase l is sequence-specific and hypersensitive sites might not account for the entire genome6 | |||
| No prior knowledge of the sequence or binding protein is required | DNA loss through the multiple purification steps limits sensitivity | |||
| Technically challenging, including optimization of DNase digestion | ||||
| chromatin immunoprecipitation followed by sequencing (ChIP-seq)27,83 | Cells are treated with formaldehyde to crosslink DNA and proteins. A specific antibody is used to pull down DNA-protein complex. After that cross-linking is reversed and DNA is sequenced | ENCODE ChIP-seq96 | High resolution, high coverage | Highly dependent on the quality of antibody |
| nf-core/chipseq pipeline74 | Reveals regulatory landscapes | Requires high-quality, specific antibodies | ||
| DROMPAplus27 | Works with various cell types and conditions | Crosslinking and immunoprecipitation can be inefficient or biased | ||
| High input requirement | ||||
| High background noise | ||||
| Droplet chromatin immunoprecipitation followed by sequencing (Drop-ChIP)24 | Individual nuclei of cells, barcoded beads, and reagents are co-encapsulated in droplets using a microfluidic device. ChIP is performed using antibodies targeting specific histone modifications DNA fragments are captured on barcoded beads. Following that emulsions are broken, and DNA is purified, amplified, and sequenced | Standard tools for single-cell epigenomic data are used97,98 | High throughput | Low coverage |
| High specificity | Technically challenging, requires microfluidic devices | |||
| Provides greater resolution of chromatin state heterogeneity in complex tissues | Limited to histone marks | |||
| Single-nucleus methylcytosine sequencing (snmC-seq)23,83 | Unlike standard WGBS, snmC-seq uses post-bisulfite adapter tagging to prevent DNA degradation and loss during bisulfite treatment. Following that DNA is sequenced to read methylation patterns | Bismark86 | Ultra-low-input DNA requirement | Suffers from DNA degradation |
| PCR bias can be introduced during amplification | ||||
| single-cell genome and epigenome transfer sequencing (scGET-seq1)99 | The method utilizes a recombinant transposase (TnH), created by fusing Tn5 with the chromodomain of HP1-α, which imparts binding affinity for H3K9me3-enriched heterochromatin. The combined use of Tn5 and TnH allows for comprehensive analysis of accessible and compacted chromatin states and their dynamic alterations | Snakemake100,101 | High resolution | Technically complex protocol |
| Requires advanced bioinformatics tools to integrate ATAC-seq and histone signal properly | ||||
| Dependent on quality of antibody | ||||
| Single nucleus barcoding assay for transposase-accessible chromatin sequencing (SNuBar-ATAC)29 | A single oligonucleotide adaptor containing unique barcodes is used during the tagmentation step, where a transposase enzyme inserts DNA fragments into open chromatin regions. Libraries are prepared for sequencing, and the barcodes are used to identify the origin of each fragment | SNuBar-ATAC pipeline29 | Robust chromatin accessibility profiling even from frozen or difficult tissues, and it is well-suited for tissues where nuclei isolation is more feasible than whole-cell isolation | Requires nuclei isolation optimization |
| Multiplexing capability | Each nucleus provides limited signal, requiring deep sequencing | |||
| Scalable | ||||
| Single-cell chromatin integration labelling sequencing (scChIL-seq)83,102 | Tn5 transposase is fused to protein A (pA-Tn5) or introduced via secondary antibodies, so it binds to the target histone mark. Transposition occurs in situ at the antibody-bound regions, integrating sequencing adapters directly at the chromatin site. The resulting DNA fragments are then amplified and sequenced | nf-core/chipseq pipeline74 | No need for nuclei isolation or bulk chromatin fragmentation | Antibody-dependent |
| Suitable for clinical or frozen samples | Require specialized bioinformatics pipelines | |||
| Single-cell cleavage under targets and tagmentation (scCUT&Tag)103 | In this method cells/nuclei are permeabilized. Primary anti-bodies bind to the target protein/epigenetic mark (e.g., H3K27me3, H3K4me1). A secondary antibody is used to tether a fusion protein of Protein A/G and Tn5 transposase (pA-Tn5) to the chromatin at the mark. Upon activation, Tn5 integrates sequencing adapters into nearby DNA. In scCUT&Tag, this is done on a single-cell platform. DNA fragments are barcoded per cell and sequenced | ChromHMMand104 | Low input requirement | Dependent on antibody quality |
| Segway104 | High signal-to-noise ratio (less background than ChIP) | |||
| Captures histone modifications, TF binding, and chromatin accessibility | ||||
| Compatible with frozen or fixed tissues | ||||
| Single-cell combinatorial indexing for transposase-accessible chromatin profiling sequencing (sciTIP-seq)25,83 | The transposase inserts sequencing adapters near the bound protein sites. Nuclei are indexed multiple times. In the first round of indexing fixed nuclei are randomly distributed across a 96- or 384-well plate. Each well contains a unique barcode. Barcodes are introduced by tagmentation or ligation. All nuclei are pooled together and then redistributed randomly into a new plate for the second round of indexing, introducing a second barcode. Each nucleus receives a unique combination of barcodes enabling single-nucleus resolution without physically isolating every cell | SnapATAC105 | High-throughput | Dependent on antibody quality |
| Cost-efficient | Barcoding collisions can occur if not enough barcode diversity | |||
| Scalable to tens of thousands of nuclei |
| Single-cell technology | Major findings |
|---|---|
| Tumor immune microenvironment | |
| scRNA-seq | NQO1+ macrophages may have an immunosuppressive effect on HCC106 |
| scRNA-seq | Four distinct TAMs were identified. Specifically, TAM-c4 was enriched in the advanced-stage patients or those receiving ICT and found to be related to a short survival time and low abundance of CD8+ T cells in primary liver cancers107 |
| scRNA-seq, spatial transcriptomics and transcriptome profiling | A novel FMO2+ CAF subset serves as a critical regulator of microenvironmental immune properties and a predictive biomarker of the immunotherapy response in patients with HCC. CCL19 in combination with anti-PD-1 therapy may constitute a novel therapeutic strategy for HCC108 |
| Spatial transcriptomics and scRNA-seq | High protein kinase, DNA-activated, catalytic subunit expression is associated with shorter survival times and an abnormal tumor microenvironment, highlighting its impact on immune cell infiltration and HCC prognosis109 |
| Spatial transcriptomics and scRNA-seq | LAMA4+ CD90+ extracellular matrix CAFs provide immunosuppressive microenvironment for liver cancer through induction of CD8+ T cell senescence110 |
| scRNA-seq | Glycan-HCCs were associated with multifaceted immune distortion, including exhaustion of T cells and enriched SPP1+ macrophages and leads to worse survival111 |
| Single-cell, bulk, and spatial transcriptome profiling with multiplexed immunofluorescence | The crosstalk between DAB2+ TAMs and FAP+ CAFs seem to be a key determinant in shaping the tumor immune barrier. Therapeutic strategies that disrupt this interaction could potentiate immunotherapeutic responses and improve patient prognosis112 |
| scRNA-seq, whole-exome sequencing, whole-transcriptome sequencing, and NGS-based HBV integration analysis | In this study, patients were categorized into two groups on the basis of effector CD8+ T-cell exhaustion markers high and low exhaustion groups. The high-exhaustion group exhibited higher PDCD1 expression, higher TP53 mutation rates, clonal expansion of CD4+ regulatory T cells and follicular helper T cells, and more pronounced HBV integrations with elevated intrahepatic covalently cccDNA and pgRNA levels. These findings provide insight into the intricate relationship between high exhaustion, proliferation sub-type, increased HBV integrations, and enhanced HBV-induced oncogenic potential in virus-related HCC113 |
| Tumor cell heterogeneity & spatial features | |
| scRNA-seq | The analysis demonstrated regional heterogeneity in cellular composition and malignant potential across the tumor, with the T1 region displaying the greatest degree of malignancy, characterized by the upregulation of HMGB2 and TOP2A114 |
| Spatial transcriptomics and scRNA-seq | Identified three subtypes in tumor cells, including ARG1+ metabolism subtype (metab-subtype), TOP2A+ proliferation phenotype, and S100A6+ pro-metastatic subtype (EMT-subtype)115 |
| scCPA-Tag | Identified a tumour cell subtype (C2) with more aggressive features. This subtype was characterized by high chromatin accessibility and a lower abundance of H3K27me3 on tumour-promoting genes116 |
| Spatial transcriptomics sequencing | The study identified one LPC cluster, three LPC subpopulations, and four distinct cellular modules, indicating the heterogeneity within LPC and the diversity between LPCs and epithelial cells117 |
| Single-cell immune repertoire sequencing, mass cytometry, and multiplex immunofluorescence | The study describes the TME landscape and identified six prognosis-related cell subclusters in this landscape118 |
| Tumor endothelial cells | |
| scRNA-seq | The analysis uncovered significant heterogeneity among TECs in HCC, delineating two subgroups (TEC1 and TEC2) with distinct functional roles and signaling profiles119 |
| scRNA-seq | This analysis delineated the intratumoral cellular heterogeneity underlying MVI, identifying MARCKSL1+ MVI-positive malignant cells as key contributors to MVI progression via activation of the PTN signaling pathway120 |
| Relapse, prognostic risk, and cellular senescence | |
| scRNA-seq and bulk RNA-seq datasets | Related genes from senescence-related pathways were used to identify senescence-related molecular sub-types in HCC with G6PD as a key gene, potentially serving as a senescence-related target in liver cancer121 |
| scRNA-seq | Analysis of primary versus early-relapsed HCC demonstrated elevated infiltration of CD8+ T cells and malignant cells in relapsed tumors, alongside a marked decline in CD4+ T cell populations122 |
| scRNA-seq | Identified a distinct senescent-associated subset of CD34+ CLDN5+ endothelial cells, which mainly enriched in tumor tissue. These cells increased cholangiocellular phenotype in HCC via IGF2-IGF2R signaling123 |
| Bulk RNA sequencing, proteomic analysis, scRNA-seq, spatial transcriptomics sequencing, and genome sequencing | Reveals a novel subtype of HCC with biological and clinical relevance. Patients with tumor purity-TME high-risk subtypes show pronounced hypoxia and activation of Wnt/β-catenin, Notch, and TGF-β pathways. Notably, a novel XPO1+ epithelial subtype shares these high-risk signatures and aggressive behavior124 |
| Clinical trial identifier | Country | Year | Title | Status |
|---|---|---|---|---|
| NCT05677724 | China | 2022 | Single-cell RNA Sequencing Resolves the Regulatory Role of HBV on the Hepatocellular Carcinoma Immune Microenvironment | Unknown status |
| NCT05171439 | China | 2022 | Surufatinib in Advanced Hepatocellular Carcinoma Based on Single-cell Sequencing of Tumor Samples | Unknown status |
| NCT05540925 | China | 2022 | Vascular Invasion Signatures in cfDNA Support Re-staging of Liver Cancer | Completed |
scRNA-seq, single-cell RNA sequencing; dT, deoxythymidine; cDNA, complementary DNA; STRTprep, Single-cell tagged reverse transcription-preparation; TSS, transcription start sites; TSO, template switching oligonucleotide; LNA, locked nucleic acid; IVT, in vitro transcription; PCR, polymerase chain reaction; RT, reverse transcription; UMIs, unique molecular identifiers; LCM, laser capture microdissection; GEMs, gel bead-in-emulsions.
MBD, methyl-CpG binding domain; PCR, polymerase chain reaction; DNase-seq, DNase I hypersensitive sites sequencing; WGBS, whole genome bisulfite sequencing; TnH, Tn5 transposase hyperactive; HP1-α, heterochromatin protein 1-alpha; TF, transcription factor.
scRNA-seq, single-cell RNA sequencing; NQO1, NAD(P)H quinone dehydrogenase 1; HCC, hepatocellular carcinoma; TAM, tumor-associated macrophage; ICT, immune checkpoint therapy; FMO2, flavin-containing monooxygenase 2; CAF, cancer-associated fibroblast; CCL19, C-C motif chemokine ligand 19; PD-1, programmed death 1; LAMA4, laminin subunit alpha 4; SPP1, secreted phosphoprotein 1; DAB2, disabled homolog 2; FAP, fibroblast activation protein; NGS, next generation sequencing; HBV, hepatitis B virus; cccDNA, closed circular DNA; pgRNA, pregenomic RNA; HMGB2, high mobility group box 2; TOP2A, topoisomerase (DNA) II alpha; scCPA, single-cell clustering-based pathway analysis; EMT, epithelial-to-mesenchymal transition; LPC, liver progenitor cell; TEC, tumor endothelial cell; MVI, microvascular invasion; PTN, pleiotrophin; IGF2, insulin-like growth factor 2; IGF2R, insulin-like growth factor 2 receptor; TGF-β, transforming growth factor-beta; XPO1, exportin 1.
HBV, hepatitis B virus; cfDNA, cell-free DNA. No results posted.