G-Quadruplex enrichment analysis reveals their role as intronic regulatory elements in plants
G-Quadruplex enrichment analysis reveals their role as intronic regulatory elements in plants
G-Quadruplexes, a class of noncanonical but highly stable nucleic acid structures, have the potential to be part of theregulatory mechanism of cells. They can form in the genome where the double-stranded helix is unwound to facilitate formation of aG-quadruplex. The biological significance of these structures is yet to be understood entirely. This work presents a novel approach andinvestigates common characteristics in the distribution of G-quadruplexes relative to genes in plants through analysis of genomes andgene expressions. The results indicate that G-quadruplex distribution has gone through significant changes with the evolution of higherplants and, for the first time, that G-quadruplexes enriched at the beginning of introns may have a regulatory role during transcription.
___
- Andorf CM, Kopylov M, Dobbs D, Koch KE, Stroupe ME et al.
(2014). G-quadruplex (G4) motifs in the maize (Zea mays L.)
genome are enriched at specific locations in thousands of genes
coupled to energy status, hypoxia, low sugar, and nutrient deprivation. Journal of Genetics and Genomics 41 (12): 627–647.
doi: 10.1016/j.jgg.2014.10.004
- Argout X, Salse J, Aury JM, Guiltinan MJ, Droc G et al. (2011). The
genome of Theobroma cacao. Nature Genetics 43 (2): 101-108.
- Beaudoin JD, Perreault JP (2013). Exploring mRNA 3’-UTR Gquadruplexes: evidence of roles in both alternative polyadenylation and mRNA shortening. Nucleic Acids Research 41 (11):
5898-5911. doi: 10.1093/nar/gkt265
- Biswas B, Kandpal M, Jauhari UK, Vivekanandan P (2016). Genomewide analysis of G-quadruplexes in herpesvirus genomes.
BMC Genomics 17 (1): 949. doi: 10.1186/s12864-016-3282-1
- Burge S, Parkinson GN, Hazel P, Todd AK, Neidle S (2006). Quadruplex DNA: sequence, topology and structure. Nucleic Acids
Resesearch 34 (19): 5402-5415. doi: 10.1093/nar/gkl655
- Chan AP, Crabtree J, Zhao Q, Lorenzi H, Orvis J et al. (2010). Draft
genome sequence of the oilseed species Ricinus communis. Nature Biotechnology 28 (9): 951-956.
- Cogoi S, Xodo LE (2006). G-quadruplex formation within the promoter of the KRAS proto-oncogene and its effect on transcription. Nucleic Acids Research 34 (9): 2536-2549. doi: 10.1093/
nar/gkl286
- Davidson RM, Gowda M, Moghe G, Lin H, Vaillancourt B et al.
(2012). Comparative transcriptomics of three Poaceae species
reveals patterns of gene expression evolution. The Plant Journal 71 (3): 492-502. doi: 10.1111/j.1365-313X.2012.05005.x
- Dohm JC, Minoche AE, Holtgräwe D, Capella-Gutiérrez S, Zakrzewski F et al. (2014). The genome of the recently domesticated
crop plant sugar beet (Beta vulgaris). Nature 505 (7484): 546-
549.
- Eddy A, Galloway DJ, John DM, Tittley I (1992). Lower plant diversity. In: Groombridge B (editor). Global Biodiversity. Dordrecht,
the Netherlands: Springer, pp. 55-63.
- Eddy J, Maizels N (2006). Gene function correlates with potential
for G4 DNA formation in the human genome. Nucleic Acids
Research 34 (14): 3887-3896. doi: 10.1093/nar/gkl529
- Fernando H, Reszka AP, Huppert J, Ladame S, Rankin S et al. (2006).
A conserved quadruplex motif located in a transcription activation site of the human c-kit oncogene. Biochemistry 45 (25):
7854-7860. doi: 10.1021/bi0601510
- Fletcher TM, Sun D, Salazar M, Hurley LH (1998). Effect of DNA
secondary structure on human telomerase activity. Biochemistry 37 (16): 5536-5541. doi: 10.1021/bi972681p
- Gallegos JE, Rose AB (2015). The enduring mystery of intron-mediated enhancement. Plant Science 237: 8-15. doi: 10.1016/j.
plantsci.2015.04.017
- Garcia-Mas J, Benjak A, Sanseverino W, Bourgeois M, Mir G et al.
(2012). The genome of melon (Cucumis melo L.). Proceedings of the National Academy of Sciences of the USA 109 (29):
11872-11877. doi: 10.1073/pnas.1205415109
- Garg R, Aggarwal J, Thakkar B (2016). Genome-wide discovery of Gquadruplex forming sequences and their functional relevance
in plants. Scientific Reports 6: 28211.
- Grand CL, Powell TJ, Nagle RB, Bearss DJ, Tye D et al. (2004). Mutations in the G-quadruplex silencer element and their relationship to c-MYC overexpression, NM23 repression, and therapeutic rescue. Proceedings of the National Academy of Sciences
of the USA 101 (16): 6140-6145. doi: 10.1073/pnas.0400460101
- He G, Zhu X, Elling AA, Chen L, Wang X et al. (2010). Global epigenetic and transcriptional trends among two rice subspecies
and their reciprocal hybrids. The Plant Cell 22 (1): 17-33. doi:
10.1105/tpc.109.072041
- Hernandez-Garcia CM, Finer JJ (2014). Identification and validation
of promoters and cis-acting regulatory elements. Plant Science
217-218: 109-119. doi: 10.1016/j.plantsci.2013.12.007
- Hershman SG, Chen Q, Lee JY, Kozak ML, Yue P et al. (2008). Genomic distribution and functional analyses of potential G-quadruplex-forming sequences in Saccharomyces cerevisiae. Nucleic Acids Research 36 (1): 144-156. doi: 10.1093/nar/gkm986
- Hu TT, Pattyn P, Bakker EG, Cao J, Cheng JF et al. (2011). The Arabidopsis lyrata genome sequence and the basis of rapid genome
size change. Nature Genetics 43 (5): 476-481.
- Huppert JL (2005). Prevalence of quadruplexes in the human genome. Nucleic Acids Research 33 (9): 2908-2916. doi: 10.1093/
nar/gki609
Huppert JL, Balasubramanian S (2007). G-quadruplexes in promoters throughout the human genome. Nucleic Acids Research 35
(2): 406-413. doi: 10.1093/nar/gkl1057
- Huppert JL, Bugaut A, Kumari S, Balasubramanian S (2008). Gquadruplexes: the beginning and end of UTRs. Nucleic Acids
Research 36 (19): 6260-6268. doi: 10.1093/nar/gkn511
International Rice Genome Sequencing Project. (2005). The mapbased sequence of the rice genome. Nature 436 (7052): 793-
800.
- Jaillon O, Aury JM, Noel B, Policriti A, Clepet C et al. (2007). The
grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449 (7161): 463-467.
- Kaplan OI, Berber B, Hekim N, Doluca O (2016). G-quadruplex
prediction in E. coli genome reveals a conserved putative Gquadruplex-hairpin-duplex switch. Nucleic Acids Research 44
(19): 9083-9095. doi: 10.1093/nar/gkw769
- Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie
WR et al. (2013). Improvement of the Oryza sativa Nipponbare
reference genome using next generation sequence and optical
map data. Rice 6 (1): 4. doi: 10.1186/1939-8433-6-4
- Kwok CK, Ding Y, Shahid S, Assmann SM, Bevilacqua PC (2015).
A stable RNA G-quadruplex within the 5’-UTR of Arabidopsis
thaliana ATR mRNA inhibits translation. Biochemical Journal
467 (1): 91-102. doi: 10.1042/BJ20141063
- Majewski J (2002). Distribution and characterization of regulatory
elements in the human genome. Genome Research 12 (12):
1827-1836. doi: 10.1101/gr.606402
- Mascarenhas D, Mettler IJ, Pierce DA, Lowe HW (1990). Intronmediated enhancement of heterologous gene expression in
maize. Plant Molecular Biology 15 (6): 913-920. doi: 10.1007/
BF00039430
- Merchant SS, Prochnik SE, Vallon O, Harris EH, Karpowicz SJ et al.
(2007). The Chlamydomonas genome reveals the evolution of
key animal and plant functions. Science 318 (5848): 245-250.
doi: 10.1126/science.1143609
- Michener CD, Sokal RR. (1957). A quantitative approach to a problem of classification. Evolution 11: 490-499.
- Mullen MA, Olson KJ, Dallaire P, Major F, Assmann SM et al. (2010).
RNA G-Quadruplexes in the model plant species Arabidopsis
thaliana: prevalence and possible functional roles. Nucleic Acids Research 38 (22): 8149-8163. doi: 10.1093/nar/gkq804
- Myburg AA, Grattapaglia D, Tuskan GA, Hellsten U, Hayes RD et al.
(2014). The genome of Eucalyptus grandis. Nature 510 (7505):
356-362.
- Palenik B, Grimwood J, Aerts A, Rouzé P, Salamov A et al. (2007).
The tiny eukaryote Ostreococcus provides genomic insights
into the paradox of plankton speciation. Proceedings of the
National Academy of Sciences of the USA 104 (18): 7705-7710.
doi: 10.1073/pnas.0611046104
- Prochnik S, Marri PR, Desany B, Rabinowicz PD, Kodira C et al.
(2012). The Cassava genome: current progress, future directions. Tropical Plant Biology 5 (1): 88-94. doi: 10.1007/s12042-
011-9088-z
- Proost S, Van Bel M, Vaneechoutte D, Van de Peer Y, Inze D et al.
(2014). PLAZA 3.0: an access point for plant comparative genomics. Nucleic Acids Research 43 (D1): D974-D981.
- Rensing SA, Lang D, Zimmer AD, Terry A, Salamov A et al. (2008).
The Physcomitrella genome reveals evolutionary insights into
the conquest of land by plants. Science 319 (5859): 64-69.
- Sayers EW, Barrett T, Benson DA, Bryant SH, Canese K et al. (2009).
Database resources of the National Center for Biotechnology
Information. Nucleic Acids Research 37 (Database issue): D5-
15.
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T et al. (2010). Genome sequence of the palaeopolyploid soybean. Nature 463
(7278): 178-183.
- Schnable PS, Ware D, Fulton RS, Stein JC, Wei F et al. (2009). The
B73 maize genome: complexity, diversity, and dynamics. Science 326 (5956): 1112-1115. doi: 10.1126/science.1178534
- Slotte T, Hazzouri KM, Ågren JA, Koenig D, Maumus F et al. (2013).
The Capsella rubella genome and the genomic consequences
of rapid mating system evolution. Nature Genetics 45 (7): 831-
835.
- Sun D, Liu WJ, Guo K, Rusche JJ, Ebbinghaus S et al. (2008). The
proximal promoter region of the human vascular endothelial
growth factor gene has a G-quadruplex structure that can be
targeted by G-quadruplex-interactive agents. Molecular Cancer Therapeutics 7 (4): 880-889. doi: 10.1158/1535-7163.MCT07-2119
- Takahashi H, Nakagawa A, Kojima S, Takahashi A, Cha BY et al.
(2012). Discovery of novel rules for G-quadruplex-forming
sequences in plants by using bioinformatics methods. Journal of Bioscience and Bioengineering 114 (5): 570-575. doi:
10.1016/j.jbiosc.2012.05.017
- The Arabidopsis Genome Initiative (2000). Analysis of the genome
sequence of the flowering plant Arabidopsis thaliana. Nature
408: 796-815.
- Todd AK, Johnston M, Neidle S (2005). Highly prevalent putative
quadruplex sequence motifs in human DNA. Nucleic Acids
Research 33 (9): 2901-2907. doi: 10.1093/nar/gki553
Tomato Genome Consortium (2012). The tomato genome sequence
provides insights into fleshy fruit evolution. Nature 485 (7400):
635-641.
- Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I et al.
(2006). The genome of black cottonwood, Populus trichocarpa
(Torr. & Gray). Science 313 (5793): 1596-1604. doi: 10.1126/
science.1128691
- Verde I, Abbott AG, Scalabrin S, Jung S, Shu S et al. (2013). The
high-quality draft genome of peach (Prunus persica) identifies
unique patterns of genetic diversity, domestication and genome evolution. Nature Genetics 45 (5): 487-494. doi: 10.1038/
ng.2586
- Verma A, Halder K, Halder R, Yadav VK, Rawal P et al. (2008). Genome-wide computational and expression analyses reveal Gquadruplex DNA motifs as conserved cis-regulatory elements
in human and related species. Journal of Medicinal Chemistry
51 (18): 5641-5649. doi: 10.1021/jm800448a
- Wang JC, Lynch SA (1996). Effects of DNA supercoiling on gene expression. In: Lin ECC, Lynch SA (editors). Regulation of Gene
Expression in Escherichia coli. Boston, MA, USA: Springer, pp.
127-147.
- Wang K, Wang Z, Li F, Ye W, Wang J et al. (2012). The draft genome
of a diploid cotton Gossypium raimondii. Nature Genetics 44
(10): 1098-1103.
- Wang Y, Zhao M, Zhang Q, Zhu GF, Li FF et al. (2015). Genomic distribution and possible functional roles of putative G-quadruplex motifs in two subspecies of Oryza sativa. Computational
Biology and Chemistry 56: 122-130. doi: 10.1016/j.compbiolchem.2015.04.009
Wieland M, Hartig JS (2009). Investigation of mRNA quadruplex
formation in Escherichia coli. Nature Protocols 4 (11): 1632-
1640. doi: 10.1038/nprot.2009.111
- Xu Q, Chen LL, Ruan X, Chen D, Zhu A et al. (2013). The draft genome of sweet orange (Citrus sinensis). Nature Genetics 45 (1):
59-66.
- Young ND, Debellé F, Oldroyd GED, Geurts R, Cannon SB et al.
(2011). The Medicago genome provides insight into the evolution of rhizobial symbioses. Nature 480 (7378): 520-524.
- Zemach A, McDaniel IE, Silva P, Zilberman D (2010). Genome-wide
evolutionary analysis of eukaryotic DNA methylation. Science
328 (5980): 916-919. doi: 10.1126/science.1186366
- Zhang C, Liu HH, Zheng KW, Hao YH, Tan Z (2013). DNA G-quadruplex formation in response to remote downstream transcription activity: long-range sensing and signal transducing in
DNA double helix. Nucleic Acids Research 41 (14): 7144-7152.
doi: 10.1093/nar/gkt443
- Zhao Y, Du Z, Li N (2007). Extensive selection for the enrichment
of G4 DNA motifs in transcriptional regulatory regions of
warm blooded animals. FEBS Letters 581 (10): 1951-1956. doi:
10.1016/j.febslet.2007.04.017