Identification of differentially expressed genomic repeats in primary hepatocellular carcinoma and their potential links to biological processes and survival

Identification of differentially expressed genomic repeats in primary hepatocellular carcinoma and their potential links to biological processes and survival

Hepatocellular carcinoma (HCC) is one of the deadliest cancers. Research on HCC so far primarily focused on genes and provided limited information on genomic repeats, which constitute more than half of the human genome and contribute to genomic stability. In line with this, repeat dysregulation was significantly shown to be pathological in various cancers and other diseases. In this study, we aimed to determine the full repeat expression profile of HCC for the first time. We utilised two independent RNA-seq datasets obtained from primary HCC tumours with matched normal tissues of 20 and 17 HCC patients, respectively. We quantified repeat expressions and analysed their differential expression. We also identified repeats that are cooperatively expressed with genes by constructing a gene coexpression network. Our results indicated that HCC tumours in both datasets harbour 24 differentially expressed repeats and even more elements were coexpressed with genes involved in various metabolic pathways. We discovered that two L1 elements (L1M3b, L1M3de) were downregulated and a handful of HERV subfamily repeats (HERV-Fc1-int, HERV3-int, HERVE_a-int, HERVK11D-int, HERVK14C-int, HERVL18-int) were upregulated with the exception of HERV1_LTRc, which was downregulated. Various LTR elements (LTR32, LTR9, LTR4, LTR52-int, LTR70) and MER elements (MER11C, MER11D, MER57C1, MER9a1, MER74C) were implicated along with few other subtypes including Charlie12, MLT2A2, Tigger15a, Tigger 17b. The only satellite repeat differentially expressed in both datasets was GSATII, whose expression was upregulated in 33 (>90%) out of 37 patients. Notably, GSATII expression correlated with HCC survival genes. Elements discovered here promise future studies to be considered for biomarker and HCC therapy research. The coexpression pattern of the GSATII satellite with HCC survival genes and the fact that it has been upregulated in the vast majority of patients make this repeat particularly stand out for HCC.

___

  • Anwar SL, Hasemeier B, Schipper E, Vogel A, Kreipe H et al. (2019). LINE-1 hypomethylation in human hepatocellular carcinomas correlates with shorter overall survival and CIMP phenotype. PLoS One 14: e0216374.
  • Arroyo M, Bautista R, Larrosa R, Cobo M, Claros MG (2019). Biomarker potential of repetitive-element transcriptome in lung cancer. PeerJ 7: e8277.
  • Bao W, Kojima KK, Kohany O (2015). Repbase update, a database of repetitive elements in eukaryotic genomes. Mob DNA 6: 11.
  • Bard-Chapeau EA, Nguyen AT, Rust AG, Sayadi A, Lee P et al. (2014). Transposon mutagenesis identifies genes driving hepatocellular carcinoma in a chronic hepatitis B mouse model. Nature Genetics 46: 24-32.
  • Bersani F, Lee E, Kharchenko PV, Xu AW, Liu M et al. (2015). Pericentromeric satellite repeat expansions through RNAderived DNA intermediates in cancer. Proceedings of the National Academy of Sciences of the United States of America 112: 15148-15153.
  • Biscotti MA, Olmo E, Heslop-Harrison JS (2015). Repetitive DNA in eukaryotic genomes. Chromosome Research 23: 415-420.
  • Branco MR, Chuong EB (2020). Crossroads between transposons and gene regulation. Philosophical Transactions of the Royal Society of London Series B, Biological Sciences 375: 20190330.
  • Burns KH (2017). Transposable elements in cancer. Nature Reviews Cancer 17: 415-424.
  • Chen C, Wang G (2015). Mechanisms of hepatocellular carcinoma and challenges and opportunities for molecular targeted therapy. World Journal of Hepatology 7: 1964-1970.
  • Conesa A, Madrigal P, Tarazona S, Gomez-Cabrero D, Cervera A et al. (2016). A survey of best practices for RNA-seq data analysis. Genome Biology 17: 13.
  • De Cecco M, Ito T, Petrashen AP, Elias AE, Skvir NJ et al. (2019). L1 drives IFN in senescent cells and promotes age-associated inflammation. Nature 566: 73-78.
  • Fernández-Barrena MG, Arechederra M, Colyn L, Berasain C, Avila MA (2020). Epigenetics in hepatocellular carcinoma development and therapy: the tip of the iceberg. JHEP Reports: Innovation in Hepatology 2: 100167.
  • Hashimoto K, Suzuki AM, Dos Santos A, Desterke C, Collino A et al. (2015). CAGE profiling of ncRNAs in hepatocellular carcinoma reveals widespread activation of retroviral LTR promoters in virus-induced tumors. Genome Research 25: 1812-1824.
  • Hernandez-Segura A, De Jong TV, Melov S, Guryev V, Campisi J et al. (2017). Unmasking transcriptional heterogeneity in senescent cells. Current Biology 27 (17): 2652-2660.e4.
  • Honda T (2016). Links between human LINE-1 retrotransposons and hepatitis virus-related hepatocellular carcinoma. Frontiers in Chemistry 4: 21.
  • Hu Y, Pan J, Xin Y, Mi X, Wang J et al. (2018). Gene expression analysis reveals novel gene signatures between young and old adults in human prefrontal cortex. Frontiers in Aging Neuroscience 10: 259.
  • Hubley R, Finn RD, Clements J, Eddy SR, Jones TA et al. (2016). The Dfam database of repetitive DNA families. Nucleic Acids Research 44: D81-89. Iglesias N, Moazed D (2017). Silencing repetitive DNA. eLife 6: e29503.
  • Ishak CA, Classon M, De Carvalho DD (2018). Deregulation of retroelements as an emerging therapeutic opportunity in cancer. Trends in Cancer 4: 583-597.
  • Kim JH, Ebersole T, Kouprina N, Noskov VN, Ohzeki J et al. (2009). Human gamma-satellite DNA maintains open chromatin structure and protects a transgene from epigenetic silencing. Genome Research 19: 533-544.
  • Kishikawa T, Otsuka M, Yoshikawa T, Ohno M, Ijichi H et al. (2016). Satellite RNAs promote pancreatic oncogenic processes via the dysfunction of YBX1. Nature Communications 7: 13006.
  • Komissarov AS, Gavrilova EV, Demin SJ, Ishov AM, Podgornaya OI (2011). Tandemly repeated DNA families in the mouse genome. BMC Genomics 12: 531.
  • Kondratova VN, Botezatu IV, Shelepov VP, Likhtenshtein AV (2014). [Transcripts of satellite DNA in blood plasma: probable markers of tumor growth]. Molekuliarnaia Biologiia 48: 999- 1007.
  • Langfelder P, Horvath S (2008). WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9: 559.
  • Langfelder P, Zhang B, Horvath S (2008). Defining clusters from a hierarchical cluster tree: the dynamic tree cut package for R. Bioinformatics 24: 719-720.
  • Laska MJ, Brudek T, Nissen KK, Christensen T, Møller-Larsen A et al. (2012). Expression of HERV-Fc1, a human endogenous retrovirus, is increased in patients with active multiple sclerosis. Journal of Virology 86: 3713-3722.
  • Leinonen R, Sugawara H, Shumway M (2011). The sequence read archive. Nucleic Acids Research 39: D19-21.
  • Li H, Handsaker B, Wysoker A, Fennell T, Ruan J et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25: 2078-2079.
  • Li S, Hu Z, Zhao Y, Huang S, He X (2019). Transcriptome-wide analysis reveals the landscape of aberrant alternative splicing events in liver cancer. Hepatology 69: 359-375.
  • Liao Y, Smyth GK, Shi W (2014). featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30: 923-930.
  • Liao Y, Smyth GK, Shi W (2019). The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Research 47: e47.
  • Llovet JM, Zucman-Rossi J, Pikarsky E, Sangro B, Schwartz M et al. (2016). Hepatocellular carcinoma. Nature Reviews Disease Primers 2: 16018.
  • McDermaid A, Monier B, Zhao J, Liu B, Ma Q (2019). Interpretation of differential gene expression results of RNA-seq data: review and integration. Briefings in Bioinformatics 20: 2044-2054.
  • Niu ZS, Niu XJ, Wang WH (2016). Genetic alterations in hepatocellular carcinoma: an update. World Journal of Gastroenterology 22: 9069-9095.
  • Probst AV, Okamoto I, Casanova M, El Marjou F, Le Baccon P et al. (2010). A strand-specific burst in transcription of pericentric satellites is required for chromocenter formation and early mouse development. Developmental Cell 19: 625-638.
  • Rau A, Marot G, Jaffrézic F (2014). Differential meta-analysis of RNA-seq data from multiple studies. BMC Bioinformatics 15: 91.
  • Richard GF, Kerrest A, Dujon B (2008). Comparative genomics and molecular dynamics of DNA repeats in eukaryotes. Microbiology and Molecular Biology Reviews 72 (4): 686-727.
  • Robinson MD, McCarthy DJ, Smyth GK (2010). edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26: 139-140.
  • Robinson MD, Oshlack A (2010). A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biology 11: R25.
  • Saito Y, Kanai Y, Sakamoto M, Saito H, Ishii H et al. (2001). Expression of mRNA for DNA methyltransferases and methylCpG-binding proteins and DNA methylation status on CpG islands and pericentromeric satellite regions during human hepatocarcinogenesis. Hepatology 33: 561-568.
  • Schauer SN, Carreira PE, Shukla R, Gerhardt DJ, Gerdes P et al. (2018). L1 retrotransposition is a common feature of mammalian hepatocarcinogenesis. Genome Research 28: 639- 653.
  • Shapiro JA, Von Sternberg R (2005). Why repetitive DNA is essential to genome function. Biological Reviews of the Cambridge Philosophical Society 80: 227-250.
  • Solovyov A, Vabret N, Arora KS, Snyder A, Funt SA et al. (2018). Global cancer transcriptome quantifies repeat element polarization between immunotherapy responsive and T cell suppressive classes. Cell Reports 23: 512-521.
  • Tang Z, Li C, Kang B, Gao G, Li C et al. (2017). GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Research 45: W98-W102.
  • The Cancer Genome Atlas Research Network (2017). Comprehensive and integrative genomic characterization of hepatocellular carcinoma. Cell 169: 1327-1341.e1323.
  • Ting DT, Lipson D, Paul S, Brannigan BW, Akhavanfard S et al. (2011). Aberrant overexpression of satellite repeats in pancreatic and other epithelial cancers. Science 331: 593-596.
  • Togni R, Bagla N, Muiesan P, Miquel R, O’Grady J et al. (2009). Microsatellite instability in hepatocellular carcinoma in noncirrhotic liver in patients older than 60 years. Hepatology Research 39: 266-273.
  • Toh TB, Lim JJ, Chow EK (2019). Epigenetics of hepatocellular carcinoma. Clinical and Translational Medicine 8: 13.
  • Treangen TJ, Salzberg SL (2011). Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nature Reviews Genetics 13: 36-46.
  • Tummala KS, Brandt M, Teijeiro A, Grana O, Schwabe RF et al. (2017). Hepatocellular carcinomas originate predominantly from hepatocytes and benign lesions from hepatic progenitor cells. Cell Reports 19: 584-600.
  • Velazquez Camacho O, Galan C, Swist-Rosowska K, Ching R, Gamalinda M et al. (2017). Major satellite repeat RNA stabilize heterochromatin retention of Suv39h enzymes by RNAnucleosome association and RNA:DNA hybrid formation. eLife 6: e25293.
  • Wickham H (2016). ggplot2: elegant graphics for data analysis. 2nd ed. Cham, Switzerland: Springer International.
  • Wong MC, Jiang JY, Goggins WB, Liang M, Fang Y et al. (2017). International incidence and mortality trends of liver cancer: a global profile. Scientific Reports 7: 45846.
  • Wu Y, Zhao Y, Huan L, Zhao J, Zhou Y et al. (2020). An LTR retrotransposon-derived long noncoding RNA lncMER52A promotes hepatocellular carcinoma progression by binding p120-Catenin. Cancer Research 80: 976-987.
  • Yandim C, Karakulah G (2019). Expression dynamics of repetitive DNA in early human embryonic development. BMC Genomics 20: 439.
  • Yandım C, Karakülah G (2019). Dysregulated expression of repetitive DNA in ER+/HER2- breast cancer. Cancer Genetics 239: 36- 45.
  • Yang Y, Chen L, Gu J, Zhang H, Yuan J et al. (2017). Recurrently deregulated lncRNAs in hepatocellular carcinoma. Nature Communications 8: 14421.
  • Yu G, Wang LG, Han Y, He QY (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: A Journal of Integrative Biology 16: 284-287.
  • Zhang B, Horvath S (2005). A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology 4 (1). doi: 10.2202/1544- 6115.1128
  • Zhang B, Zhang Y, Zou X, Chan AW, Zhang R et al. (2017). The CCCTC-binding factor (CTCF)-forkhead box protein M1 axis regulates tumour growth and metastasis in hepatocellular carcinoma. The Journal of Pathology 243: 418-430.
  • Zhang L, Li H, Ge C, Li M, Zhao FY et al. (2014). Inhibitory effects of transcription factor Ikaros on the expression of liver cancer stem cell marker CD133 in hepatocellular carcinoma. Oncotarget 5: 10621-10635.
  • Zheng Y, Hlady RA, Joyce BT, Robertson KD, He C et al. (2019). DNA methylation of individual repetitive elements in hepatitis C virus infection-induced hepatocellular carcinoma. Clinical Epigenetics 11: 145.
  • Zhu Q, Pao GM, Huynh AM, Suh H, Tonnu N et al. (2011). BRCA1 tumour suppression occurs via heterochromatin-mediated silencing. Nature 477: 179-184.