G-Quadruplex enrichment analysis reveals their role as intronic regulatory elements in plants

G-Quadruplex enrichment analysis reveals their role as intronic regulatory elements in plants

G-Quadruplexes, a class of noncanonical but highly stable nucleic acid structures, have the potential to be part of theregulatory mechanism of cells. They can form in the genome where the double-stranded helix is unwound to facilitate formation of aG-quadruplex. The biological significance of these structures is yet to be understood entirely. This work presents a novel approach andinvestigates common characteristics in the distribution of G-quadruplexes relative to genes in plants through analysis of genomes andgene expressions. The results indicate that G-quadruplex distribution has gone through significant changes with the evolution of higherplants and, for the first time, that G-quadruplexes enriched at the beginning of introns may have a regulatory role during transcription.

___

  • Andorf CM, Kopylov M, Dobbs D, Koch KE, Stroupe ME et al. (2014). G-quadruplex (G4) motifs in the maize (Zea mays L.) genome are enriched at specific locations in thousands of genes coupled to energy status, hypoxia, low sugar, and nutrient deprivation. Journal of Genetics and Genomics 41 (12): 627–647. doi: 10.1016/j.jgg.2014.10.004
  • Argout X, Salse J, Aury JM, Guiltinan MJ, Droc G et al. (2011). The genome of Theobroma cacao. Nature Genetics 43 (2): 101-108.
  • Beaudoin JD, Perreault JP (2013). Exploring mRNA 3’-UTR Gquadruplexes: evidence of roles in both alternative polyadenylation and mRNA shortening. Nucleic Acids Research 41 (11): 5898-5911. doi: 10.1093/nar/gkt265
  • Biswas B, Kandpal M, Jauhari UK, Vivekanandan P (2016). Genomewide analysis of G-quadruplexes in herpesvirus genomes. BMC Genomics 17 (1): 949. doi: 10.1186/s12864-016-3282-1
  • Burge S, Parkinson GN, Hazel P, Todd AK, Neidle S (2006). Quadruplex DNA: sequence, topology and structure. Nucleic Acids Resesearch 34 (19): 5402-5415. doi: 10.1093/nar/gkl655
  • Chan AP, Crabtree J, Zhao Q, Lorenzi H, Orvis J et al. (2010). Draft genome sequence of the oilseed species Ricinus communis. Nature Biotechnology 28 (9): 951-956.
  • Cogoi S, Xodo LE (2006). G-quadruplex formation within the promoter of the KRAS proto-oncogene and its effect on transcription. Nucleic Acids Research 34 (9): 2536-2549. doi: 10.1093/ nar/gkl286
  • Davidson RM, Gowda M, Moghe G, Lin H, Vaillancourt B et al. (2012). Comparative transcriptomics of three Poaceae species reveals patterns of gene expression evolution. The Plant Journal 71 (3): 492-502. doi: 10.1111/j.1365-313X.2012.05005.x
  • Dohm JC, Minoche AE, Holtgräwe D, Capella-Gutiérrez S, Zakrzewski F et al. (2014). The genome of the recently domesticated crop plant sugar beet (Beta vulgaris). Nature 505 (7484): 546- 549.
  • Eddy A, Galloway DJ, John DM, Tittley I (1992). Lower plant diversity. In: Groombridge B (editor). Global Biodiversity. Dordrecht, the Netherlands: Springer, pp. 55-63.
  • Eddy J, Maizels N (2006). Gene function correlates with potential for G4 DNA formation in the human genome. Nucleic Acids Research 34 (14): 3887-3896. doi: 10.1093/nar/gkl529
  • Fernando H, Reszka AP, Huppert J, Ladame S, Rankin S et al. (2006). A conserved quadruplex motif located in a transcription activation site of the human c-kit oncogene. Biochemistry 45 (25): 7854-7860. doi: 10.1021/bi0601510
  • Fletcher TM, Sun D, Salazar M, Hurley LH (1998). Effect of DNA secondary structure on human telomerase activity. Biochemistry 37 (16): 5536-5541. doi: 10.1021/bi972681p
  • Gallegos JE, Rose AB (2015). The enduring mystery of intron-mediated enhancement. Plant Science 237: 8-15. doi: 10.1016/j. plantsci.2015.04.017
  • Garcia-Mas J, Benjak A, Sanseverino W, Bourgeois M, Mir G et al. (2012). The genome of melon (Cucumis melo L.). Proceedings of the National Academy of Sciences of the USA 109 (29): 11872-11877. doi: 10.1073/pnas.1205415109
  • Garg R, Aggarwal J, Thakkar B (2016). Genome-wide discovery of Gquadruplex forming sequences and their functional relevance in plants. Scientific Reports 6: 28211.
  • Grand CL, Powell TJ, Nagle RB, Bearss DJ, Tye D et al. (2004). Mutations in the G-quadruplex silencer element and their relationship to c-MYC overexpression, NM23 repression, and therapeutic rescue. Proceedings of the National Academy of Sciences of the USA 101 (16): 6140-6145. doi: 10.1073/pnas.0400460101
  • He G, Zhu X, Elling AA, Chen L, Wang X et al. (2010). Global epigenetic and transcriptional trends among two rice subspecies and their reciprocal hybrids. The Plant Cell 22 (1): 17-33. doi: 10.1105/tpc.109.072041
  • Hernandez-Garcia CM, Finer JJ (2014). Identification and validation of promoters and cis-acting regulatory elements. Plant Science 217-218: 109-119. doi: 10.1016/j.plantsci.2013.12.007
  • Hershman SG, Chen Q, Lee JY, Kozak ML, Yue P et al. (2008). Genomic distribution and functional analyses of potential G-quadruplex-forming sequences in Saccharomyces cerevisiae. Nucleic Acids Research 36 (1): 144-156. doi: 10.1093/nar/gkm986
  • Hu TT, Pattyn P, Bakker EG, Cao J, Cheng JF et al. (2011). The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nature Genetics 43 (5): 476-481.
  • Huppert JL (2005). Prevalence of quadruplexes in the human genome. Nucleic Acids Research 33 (9): 2908-2916. doi: 10.1093/ nar/gki609 Huppert JL, Balasubramanian S (2007). G-quadruplexes in promoters throughout the human genome. Nucleic Acids Research 35 (2): 406-413. doi: 10.1093/nar/gkl1057
  • Huppert JL, Bugaut A, Kumari S, Balasubramanian S (2008). Gquadruplexes: the beginning and end of UTRs. Nucleic Acids Research 36 (19): 6260-6268. doi: 10.1093/nar/gkn511 International Rice Genome Sequencing Project. (2005). The mapbased sequence of the rice genome. Nature 436 (7052): 793- 800.
  • Jaillon O, Aury JM, Noel B, Policriti A, Clepet C et al. (2007). The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449 (7161): 463-467.
  • Kaplan OI, Berber B, Hekim N, Doluca O (2016). G-quadruplex prediction in E. coli genome reveals a conserved putative Gquadruplex-hairpin-duplex switch. Nucleic Acids Research 44 (19): 9083-9095. doi: 10.1093/nar/gkw769
  • Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR et al. (2013). Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 6 (1): 4. doi: 10.1186/1939-8433-6-4
  • Kwok CK, Ding Y, Shahid S, Assmann SM, Bevilacqua PC (2015). A stable RNA G-quadruplex within the 5’-UTR of Arabidopsis thaliana ATR mRNA inhibits translation. Biochemical Journal 467 (1): 91-102. doi: 10.1042/BJ20141063
  • Majewski J (2002). Distribution and characterization of regulatory elements in the human genome. Genome Research 12 (12): 1827-1836. doi: 10.1101/gr.606402
  • Mascarenhas D, Mettler IJ, Pierce DA, Lowe HW (1990). Intronmediated enhancement of heterologous gene expression in maize. Plant Molecular Biology 15 (6): 913-920. doi: 10.1007/ BF00039430
  • Merchant SS, Prochnik SE, Vallon O, Harris EH, Karpowicz SJ et al. (2007). The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science 318 (5848): 245-250. doi: 10.1126/science.1143609
  • Michener CD, Sokal RR. (1957). A quantitative approach to a problem of classification. Evolution 11: 490-499.
  • Mullen MA, Olson KJ, Dallaire P, Major F, Assmann SM et al. (2010). RNA G-Quadruplexes in the model plant species Arabidopsis thaliana: prevalence and possible functional roles. Nucleic Acids Research 38 (22): 8149-8163. doi: 10.1093/nar/gkq804
  • Myburg AA, Grattapaglia D, Tuskan GA, Hellsten U, Hayes RD et al. (2014). The genome of Eucalyptus grandis. Nature 510 (7505): 356-362.
  • Palenik B, Grimwood J, Aerts A, Rouzé P, Salamov A et al. (2007). The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation. Proceedings of the National Academy of Sciences of the USA 104 (18): 7705-7710. doi: 10.1073/pnas.0611046104
  • Prochnik S, Marri PR, Desany B, Rabinowicz PD, Kodira C et al. (2012). The Cassava genome: current progress, future directions. Tropical Plant Biology 5 (1): 88-94. doi: 10.1007/s12042- 011-9088-z
  • Proost S, Van Bel M, Vaneechoutte D, Van de Peer Y, Inze D et al. (2014). PLAZA 3.0: an access point for plant comparative genomics. Nucleic Acids Research 43 (D1): D974-D981.
  • Rensing SA, Lang D, Zimmer AD, Terry A, Salamov A et al. (2008). The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science 319 (5859): 64-69.
  • Sayers EW, Barrett T, Benson DA, Bryant SH, Canese K et al. (2009). Database resources of the National Center for Biotechnology Information. Nucleic Acids Research 37 (Database issue): D5- 15. Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T et al. (2010). Genome sequence of the palaeopolyploid soybean. Nature 463 (7278): 178-183.
  • Schnable PS, Ware D, Fulton RS, Stein JC, Wei F et al. (2009). The B73 maize genome: complexity, diversity, and dynamics. Science 326 (5956): 1112-1115. doi: 10.1126/science.1178534
  • Slotte T, Hazzouri KM, Ågren JA, Koenig D, Maumus F et al. (2013). The Capsella rubella genome and the genomic consequences of rapid mating system evolution. Nature Genetics 45 (7): 831- 835.
  • Sun D, Liu WJ, Guo K, Rusche JJ, Ebbinghaus S et al. (2008). The proximal promoter region of the human vascular endothelial growth factor gene has a G-quadruplex structure that can be targeted by G-quadruplex-interactive agents. Molecular Cancer Therapeutics 7 (4): 880-889. doi: 10.1158/1535-7163.MCT07-2119
  • Takahashi H, Nakagawa A, Kojima S, Takahashi A, Cha BY et al. (2012). Discovery of novel rules for G-quadruplex-forming sequences in plants by using bioinformatics methods. Journal of Bioscience and Bioengineering 114 (5): 570-575. doi: 10.1016/j.jbiosc.2012.05.017
  • The Arabidopsis Genome Initiative (2000). Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796-815.
  • Todd AK, Johnston M, Neidle S (2005). Highly prevalent putative quadruplex sequence motifs in human DNA. Nucleic Acids Research 33 (9): 2901-2907. doi: 10.1093/nar/gki553 Tomato Genome Consortium (2012). The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485 (7400): 635-641.
  • Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I et al. (2006). The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313 (5793): 1596-1604. doi: 10.1126/ science.1128691
  • Verde I, Abbott AG, Scalabrin S, Jung S, Shu S et al. (2013). The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nature Genetics 45 (5): 487-494. doi: 10.1038/ ng.2586
  • Verma A, Halder K, Halder R, Yadav VK, Rawal P et al. (2008). Genome-wide computational and expression analyses reveal Gquadruplex DNA motifs as conserved cis-regulatory elements in human and related species. Journal of Medicinal Chemistry 51 (18): 5641-5649. doi: 10.1021/jm800448a
  • Wang JC, Lynch SA (1996). Effects of DNA supercoiling on gene expression. In: Lin ECC, Lynch SA (editors). Regulation of Gene Expression in Escherichia coli. Boston, MA, USA: Springer, pp. 127-147.
  • Wang K, Wang Z, Li F, Ye W, Wang J et al. (2012). The draft genome of a diploid cotton Gossypium raimondii. Nature Genetics 44 (10): 1098-1103.
  • Wang Y, Zhao M, Zhang Q, Zhu GF, Li FF et al. (2015). Genomic distribution and possible functional roles of putative G-quadruplex motifs in two subspecies of Oryza sativa. Computational Biology and Chemistry 56: 122-130. doi: 10.1016/j.compbiolchem.2015.04.009 Wieland M, Hartig JS (2009). Investigation of mRNA quadruplex formation in Escherichia coli. Nature Protocols 4 (11): 1632- 1640. doi: 10.1038/nprot.2009.111
  • Xu Q, Chen LL, Ruan X, Chen D, Zhu A et al. (2013). The draft genome of sweet orange (Citrus sinensis). Nature Genetics 45 (1): 59-66.
  • Young ND, Debellé F, Oldroyd GED, Geurts R, Cannon SB et al. (2011). The Medicago genome provides insight into the evolution of rhizobial symbioses. Nature 480 (7378): 520-524.
  • Zemach A, McDaniel IE, Silva P, Zilberman D (2010). Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science 328 (5980): 916-919. doi: 10.1126/science.1186366
  • Zhang C, Liu HH, Zheng KW, Hao YH, Tan Z (2013). DNA G-quadruplex formation in response to remote downstream transcription activity: long-range sensing and signal transducing in DNA double helix. Nucleic Acids Research 41 (14): 7144-7152. doi: 10.1093/nar/gkt443
  • Zhao Y, Du Z, Li N (2007). Extensive selection for the enrichment of G4 DNA motifs in transcriptional regulatory regions of warm blooded animals. FEBS Letters 581 (10): 1951-1956. doi: 10.1016/j.febslet.2007.04.017