Anat Cell Biol 2023; 56(1): 94-108
Published online March 31, 2023
https://doi.org/10.5115/acb.22.161
Copyright © Korean Association of ANATOMISTS.
Minjeong Kim , Kwang-Woo Jo
, Hyojin Kim
, Myoung-Eun Han
, Sae-Ock Oh
Department of Anatomy, School of Medicine, Pusan National University, Yangsan, Korea
Correspondence to:Sae-Ock Oh
Department of Anatomy, School of Medicine, Pusan National University, 49 Busandaehak-ro, Mulgeum-eup, Yangsan 50612, Korea
E-mail: hedgehog@pusan.ac.kr
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Cancer cell heterogeneity is a serious problem in the control of tumor progression because it can cause chemoresistance and metastasis. Heterogeneity can be generated by various mechanisms, including genetic evolution of cancer cells, cancer stem cells (CSCs), and niche heterogeneity. Because the genetic heterogeneity of CSCs has been poorly characterized, the genetic mutation status of CSCs was examined using Exome-Seq and RNA-Seq data of liver cancer. Here we show that different surface markers for liver cancer stem cells (LCSCs) showed a unique propensity for genetic mutations. Cluster of differentiation 133 (CD133)-positive cells showed frequent mutations in the IRF2, BAP1, and ERBB3 genes. However, leucine-rich repeat-containing G protein-coupled receptor 5-positive cells showed frequent mutations in the CTNNB1, RELN, and ROBO1 genes. In addition, some genetic mutations were frequently observed irrespective of the surface markers for LCSCs. BAP1 mutations was frequently observed in CD133-, CD24-, CD13-, CD90-, epithelial cell adhesion molecule-, or keratin 19-positive LCSCs. ASXL2, ERBB3, IRF2, TLX3, CPS1, and NFATC2 mutations were observed in more than three types of LCSCs, suggesting that common mechanisms for the development of these LCSCs. The present study provides genetic heterogeneity depending on the surface markers for LCSCs. The genetic heterogeneity of LCSCs should be considered in the development of LCSC-targeting therapeutics.
Keywords: Genetic heterogeneity, Cancer stem cell, Liver cancer
Cancer stem cells (CSCs) have been identified in various cancers, including liver cancer [1-3]. Owing to their biological characteristics, including chemoresistance, radio resistance, dormancy, and metastatic potential, they have drawn a lot of attention from the research field. Although chemotherapeutics can successfully reduce tumor volume, chemoresistant CSCs can survive and cause cancer relapse. Since the initial report of liver CSCs (LCSCs), many surface markers (epithelial cell adhesion molecule [EpCAM], leucine-rich repeat-containing G protein-coupled receptor 5 [LGR5], cluster of differentiation 133 [CD133], CD44, CD24, CD90, etc.) have been reported [2, 3]. In addition to identifying LCSC surface markers, the molecular maintenance mechanism of their stemness has also been explored. Signaling pathways, including WNT, hedgehog, and Notch pathways, and transcription factors, including Nanog, Oct4, Sox2, and Myc, have been reported to be important for the maintenance of stemness in CSCs. Based on these advances in CSC research, some of them have been applied to the development of therapeutics.
Liver cancer is the third most common cause of cancer-related death worldwide (GLOBOCAN 2020). It is the sixth most common type of cancer. The mortality rate has not improved over the last three decades. Notably, its incidence is expected to increase by approximately 60% between 2020 and 2040 in Asia. Hepatocellular carcinoma (HCC) is the most common form of primary liver cancer. Therefore, new insights into the mechanisms underlying HCC are required for the development of diagnostics and therapeutics for liver cancer.
Although therapeutics targeting CSC surface markers have been developed, their use frequently encounters new resistance, partly owing to the flexible hierarchy of CSCs [1]. New CSCs are generated from other daughter cells under inflammatory conditions, although CSCs that express specific CSC markers are eliminated by therapeutics. Another cause for the resistance in applying the therapeutics is the heterogeneity of CSCs. Single-cell transcriptomics and reports of many types of surface markers suggest the heterogeneity of CSCs. The CSC hypothesis is based on the assumption that these cells have the same genetic background as their daughter cells, although they are epigenetically different from each other. However, cancer cells within a tumor can have different genetic backgrounds [4]. Therefore the genetic heterogeneity of LCSCs needs to be examined in detail.
In this study, to reveal genetic heterogeneity of LCSCs, we examined the genetic mutation status of LCSCs based on their surface markers (CD133, CD44, CD24, CD47, CD13, CD90, ICAM1, EpCAM, LGR5, keratin 19 [KRT19]). By analyzing Exome-Seq and RNA-Seq data of HCC (n=366) in the Cancer Genome Atlas (TCGA) database, we found that LCSCs are genetically heterogenous depending on their surface markers and some mutations are associated with a specific surface marker of LCSCs.
RNA-Seq and Exome-Seq data from TCGA were downloaded from the c-BioPortal. The TCGA cohort includes genomic information of 366 HCC patients. RNA and DNA were extracted from tumor and adjacent normal tissue specimens using a modification of the DNA/RNA AllPrep kit (QIAGEN, not single cell-Seq). The expression data for mRNA were batch-corrected to adjust for platform differences between the GAII and HiSeq Illumina sequencers. Somatic exome variant analysis was performed to remove potential germline calls as well as non-exonic variants. RNA-Seq data of TCGA were further classified based on the expression level (RNA-seq value) of each CSC marker. The high or low group based on each marker’s expression level was defined as a higher 25% or lower 25% of patients among HCC patients in the TCGA HCC cohort. The intermediate group was defined as 25%–75% of patients. The genes which were harboring mutations were further selected based on their relationship with cancer, whether they were cancer genes (OncoKB), and the frequency of HCC (>1.0%). Somatic exome mutations for each gene were further selected by SIFT (deleterious group) or PolyPhen score (probably damaging group), which predicts significant functional changes.
The correlation between the expression level of each cancer stem marker and the mutation of each cancer gene was evaluated using the chi-square test. Results with a
CD133 is a five-transmembrane single-chain glycoprotein that is usually localized in membrane protrusions [5-8]. It may function as an organizer of cellular protrusion and regulate the number of microvilli and the structure of filopodia. Since its discovery in hematopoietic stem cells [5], its expression in other adult stem cells has been reported [9]. Its biological roles in proliferation, migration, metastasis, spheroid formation, and
When we compared CD133-high HCC tissues with CD133-low tissues, we found that many kinds of cancer gene mutations were enriched in the CD133-high group (Fig. 1A, Table 1). Mutations in
Table 1 . Summary of frequently mutated genes in each LCSC group
Markers | Frequently mutated genes |
---|---|
CD133 | |
CD44 | |
CD24 | |
CD47 | |
CD13 | |
CD90 | |
ICAM1 | |
EpCAM | |
LGR5 | |
KRT19 |
LCSC, liver cancer stem cell; CD, cluster of differentiation; ICAM1, Intercellular adhesion molecule 1; EpCAM, epithelial cell adhesion molecule; LGR5, leucine-rich repeat-containing G protein-coupled receptor 5; KRT19, keratin 19.
The association of some mutations in these cancer genes with CD133-positive CSCs has been reported.
CD44 is a lymphocyte-homing receptor that interacts with the extracellular matrix, cell-cell interaction, adhesion, and migration [41-43]. It also functions as a glycoprotein receptor that interacts with various molecules, including hyaluronic acid, osteopontin, chondroitin, collagen, fibronectin, and metalloproteinases [44-48]. Notably, alternatively spliced variants are critical for cancer progression [49]. Moreover, its intracellular domain interacts with
When we compared CD44-high HCC tissues with CD44-low tissues, we found that many kinds of cancer gene mutations were enriched in the CD44-high group (Fig. 1B, Table 1). Mutations in
The association of some mutations in these cancer genes with CD44 expression has been reported. In addition to multiple myeloma [52, 56], the JAK-STAT signaling pathway regulates the expression of CD44. The interaction of STAT signaling and CD44 has been reported in breast cancer and ovarian CSCs [57, 58]. The inhibition of
CD24 is a mucin-like glycoprotein that regulates the growth and differentiation of B-lymphocytes, neutrophils, and neuroblasts [65, 66]. It has been suggested to act as a brake on the immune system and as an antiphagocytic surface protein [67]. It has been reported to be expressed in ovarian cancer, breast cancer, non-small cell lung cancer, prostate cancer, pancreatic cancer, and HCC [68, 69]. It regulates proliferation, migration, and invasion of cancer cells [66, 68, 70]. It has been suggested as a surface marker for CSC in breast cancer, gastric cancer, cervical cancer, multiple myeloma, cholangiocarcinoma, and HCC and regulates the differentiation and metastasis of liver cancer cells [71-76]. Moreover, its expression was associated with a poor prognosis in patients with liver cancer. Its association with the STAT3-NANOG pathway has been reported [77].
When we compared CD24-high HCC tissues with CD24-low tissues, we found that many types of cancer gene mutations were enriched in the CD24-high group (Fig. 2A, Table 1). Mutations in
The association of some mutations in these cancer genes with CD24 expression has been reported. CDH1 knockdown has been associated with the enrichment of CD24-positive colon CSCs [78]. ERBB2/3 contributes to the conversion of induced pluripotent stem cells into CSCs. Hypoxia activates HIF, resulting in the upregulation of the “do not-eat-me” signal surface markers (CD24 and CD47) and vascular endothelial growth factor, which bind to FLT1 in liver cancer [79]. IDH mutation induced phenotypic reprogramming in glioma, resulting in the generation of CSCs and an increase in CD24-positive cells [80, 81].
CD47 is a transmembrane protein that belongs to the immunoglobulin superfamily and forms supramolecular complexes with integrins, G proteins, and cholesterol [82, 83]. It interacts with signal-regulatory protein α (SIRPα), thrombospondin 1, and integrins, and is involved in proliferation, migration, phagocytosis, apoptosis, and immune homeostasis. Its overexpression has been frequently observed in various cancer cells, including myeloma, leiomyosarcoma, acute lymphocytic leukemia, non-Hodgkin’s lymphoma, breast cancer, osteosarcoma, head and neck squamous cell carcinoma, and liver cancer [82]. It inhibits macrophage-mediated phagocytosis by interacting with SIRPα [84, 85]. It has also been found to be expressed in CSCs in leukemia, glioma, pancreatic cancer, and liver cancer [86-89]. It regulates self-renewal, metastasis, and chemoresistance in LCSCs [87]. It is a poor prognostic factor in liver cancer and is associated with the cathepsin S (CTSS)-protease-activated receptor 2 signaling pathway by preferentially secreting CTSS [87]. Therapeutics targeting CD47 have been actively investigated [90-93].
When we compared CD47-high HCC tissues with CD47-low tissues, we found that many kinds of cancer gene mutations were enriched in the CD47-high group (Fig. 2B, Table 1). Mutations in
CD13 is a zinc-dependent type II exopeptidase located in the plasma membrane and engaged in the post-secretory processing of secreted signaling peptides and their binding to their receptors [95]. It is expressed in the kidney, intestine, liver, and central nervous system. It regulates proliferation, invasion, angiogenesis, chemoresistance, and radio resistance of cancer cells [96-98]. Its expression has been associated with poor prognosis in various cancers, including pancreatic and colon cancers, non-small cell lung cancer, malignant pleural mesothelioma, hepatoblastoma, and soft tissue sarcoma [97, 99-103]. It has been suggested to be a surface marker for CSC in liver cancer and is associated with the TGF-β signaling pathway [104-106].
When we compared CD13-high HCC tissues with CD13-low tissues, we found that many kinds of cancer gene mutations were enriched in the CD13-high group (Fig. 3A, Table 1). Mutations in
The association of some mutations in these cancer genes with CD13 expression has been reported. Overexpression of KDM4C was observed in CD13-positive LCSCs [107]. Its depletion decreased tumor initiation, as examined by sphere formation and xenograft assays. Angiopoietin increased the expression of CD13 via the TEK (TIE2) receptor in pericytes [108].
CD90 is a glycosylphosphatidylinositol-anchored glycoprotein expressed in thymocytes, neurons, mesenchymal stem cells, hepatic stem cells, natural killer cells, T cells, and endothelial cells [109, 110]. It is involved in cell-cell and cell-matrix interactions, apoptosis, and migration. It can promote tumorigenesis, metastasis, and chemoresistance and is a prognostic factor [111]. It has been suggested as a surface marker for CSC in HCC that does not express EpCAM [112, 113]. IL6, hedgehog, and AKT signaling pathways have been associated with CD90 [114, 115].
When we compared CD90-high HCC tissues with CD90-low tissues, we found that many types of cancer gene mutations were enriched in the CD90-high group (Fig. 3B, Table 1). Mutations in
ICAM1 is a cell surface glycoprotein and an adhesion receptor in various cell types, including immune, endothelial, and epithelial cells [117]. It is a ligand for the leukocyte adhesion protein LFA1. It regulates the metastasis and tumorigenic potential of liver cancer cells and is a marker for CSC [118, 119]. It is a poor prognostic factor for liver cancer and is associated with Nanog expression [119].
When we compared ICAM1-high HCC tissues with ICAM1-low tissues, we found that many kinds of cancer gene mutations were enriched in the ICAM1-high group (Fig. 4A, Table 1). Mutations in
The association of some mutations in these cancer genes with ICAM1 expression has been reported. NFAT belongs to the Rel homology domain-containing family of transcription factors and can recognize DNA sequences that can be recognized by NF-κB [120, 121]. NFATC2 siRNA decreased TNFα-induced ICAM1 expression and cell adhesion in human retinal microvascular endothelial cells [122]. TGF-β1 increased Notch3 and ICAM1 expression levels in hepatic stellate cells, and the Notch pathway was important in TGF-β1 induced activation of hepatic stellate cells [123].
EpCAM is a type I membrane protein containing two EGF-like domains [124, 125]. Its expression has been reported in various types of carcinomas. It regulates the self-renewal and tumorigenesis of cancer cells and is a surface marker for CSC [126]. WNT, CHD4, and OSM signaling pathways have been linked to EpCAM-positive liver cancer cells [127-129].
When we compared EpCAM-high HCC tissues with EpCAM-low tissues, we found that many kinds of cancer gene mutations were enriched in the EpCAM-high group (Fig. 4B, Table 1). Mutations in
The association of some mutations in these cancer genes with EpCAM expression has been reported.
LGR5 is a G-protein-coupled receptor that contains seven transmembrane domains [135]. It can bind to R-spondin 1–4 proteins and associate with phosphorylated LRP6 and frizzled receptors. It is expressed in stem cells of the intestine, ovary, hair follicle, mammary gland, and stomach [136-139]. It regulates regeneration of the intestine, colon, liver, pancreas, and stomach [140-142]. In liver cancer cells, it regulates tumorigenic potential, chemoresistance, and migration [143]. LGR5-positive liver cancer cells have been associated with the HGF, LSD1, Prickle, and WNT signaling pathways [144, 145].
When we compared LGR5-high HCC tissues with LGR5-low tissues, we found that many types of cancer gene mutations were enriched in the LGR5-high group (Fig. 5A, Table 1). Mutations in
The association of some mutations in these cancer genes with LGR5 expression has been reported. LGR5 is a target gene of the WNT signaling pathway [146]. Therefore, the
KRT19 is an intermediate filament protein involved in myofiber organization [152]. It is expressed in the bipotential cells of the liver and regulates fluorine-18 deoxyglucose accumulation [153]. It regulates proliferation, chemoresistance, tumorigenicity, invasion, and metastasis of liver cancer cells [154]. It has been reported to be a poor prognostic factor [155, 156]. TGF-β, PDGFR, and HGF signaling pathways have been associated with KRT19-positive liver cancer cells [153, 157, 158].
When we compared KRT19-high HCC tissues with KRT19-low tissues, we found that many types of cancer gene mutations were enriched in the KRT19-high group (Fig. 5B, Table 1). Mutations in
The association of some mutations in these cancer genes with KRT19 expression has been reported. Inhibition of Notch signaling reduced KRT19-positive cells during murine lacrimal gland formation [159]. Moreover, the expression of KRT19 in HCC cells was associated with the expression of
The heterogeneous composition of cancer cells in the tumor is a great challenge for oncologists because it is a critical cause of tumor relapse, chemoresistance, and metastasis. Genetic and epigenetic changes are the underlying mechanisms of cancer cell heterogeneity in tumors. In the present study, we examined the genetic mutation status of LCSCs and found genetic heterogeneity depending on markers for LCSCs (Table 1).
Different surface markers for LCSCs showed a unique pattern of propensity for genetic mutations (Table 1). CD133-positive cells showed frequent mutations in
Notably, some genetic mutations were frequently observed irrespective of the surface markers of LCSCs (Table 1).
In conclusion, based on above results we conclude that LCSCs are genetically heterogenous depending on their surface markers. In addition some mutations are frequently found in a LCSC group which expresses a specific surface marker. These results suggest that when therapeutics targeting LCSCs which express a specific marker, are considered for the treatment, mutational profiling of patients needs to be examined for the possibility of combination therapy. In addition, the existence or the new generation of other kinds of LCSCs needs to be considered for the treatment because the mutation status of patients is complex and always changing, which can lead to the new generation of other kinds of LCSCs.
This work was supported by a 2-year research grant from Pusan National University.
Conceptualization: MK, SO. Data acquisition: MK, KWJ, HK, MEH, SOO. Data analysis or interpretation: MK, KWJ, HK, MEH, SOO. Drafting of the manuscript: MK, SOO. Critical revision of the manuscript: SOO. Approval of the final version of the manuscript: all authors.
No potential conflict of interest relevant to this article was reported.