Single nucleotide polymorphism discovery through Illumina-based transcriptome sequencing and mapping in lentil

Lentil, which belongs to the family Leguminosae (Fabaceae), is a diploid (2n = 2x = 14 chromosomes) self-pollinating crop with a genome size of 4063 Mbp. Because of its nutritional importance and role in the fixation of nitrogen from the atmosphere, lentil is a widely used crop species in molecular genetic studies. By using DNA markers, to date, a limited number of polymorphic bands have been generated. Therefore, it is necessary to develop additional markers to saturate the genome at high density. Single nucleotide polymorphism (SNP) markers are promising for this purpose because of their abundance, stability, and heredity; they can be used to generate a large number of markers over a short distance that are distributed in both intragenic and intergenic regions. Transcriptome sequencing technology was applied to 2 lentil genotypes, and cDNAs were sequenced using the Illumina platform. A total of 111,105,153 sequence reads were generated after trimming. The high-quality reads were assembled, producing 97,528 contigs with an N50 of 1996 bp. The Genome Analysis Tool Kit Unified Genotyper algorithm detected 50,960 putative SNP primers. A genetic linkage map was constructed by using JoinMap4.0 and the map consists of 7 major linkage groups that could be represented as 7 chromosomes of lentil. The extensive sequence information and large number of SNPs obtained in this study could potentially be used for future high-density linkage map construction and association mapping. The large number of contigs obtained in this study could be used for the identification of orthologous transcripts from cDNA data on other organisms.

Single nucleotide polymorphism discovery through Illumina-based transcriptome sequencing and mapping in lentil

Lentil, which belongs to the family Leguminosae (Fabaceae), is a diploid (2n = 2x = 14 chromosomes) self-pollinating crop with a genome size of 4063 Mbp. Because of its nutritional importance and role in the fixation of nitrogen from the atmosphere, lentil is a widely used crop species in molecular genetic studies. By using DNA markers, to date, a limited number of polymorphic bands have been generated. Therefore, it is necessary to develop additional markers to saturate the genome at high density. Single nucleotide polymorphism (SNP) markers are promising for this purpose because of their abundance, stability, and heredity; they can be used to generate a large number of markers over a short distance that are distributed in both intragenic and intergenic regions. Transcriptome sequencing technology was applied to 2 lentil genotypes, and cDNAs were sequenced using the Illumina platform. A total of 111,105,153 sequence reads were generated after trimming. The high-quality reads were assembled, producing 97,528 contigs with an N50 of 1996 bp. The Genome Analysis Tool Kit Unified Genotyper algorithm detected 50,960 putative SNP primers. A genetic linkage map was constructed by using JoinMap4.0 and the map consists of 7 major linkage groups that could be represented as 7 chromosomes of lentil. The extensive sequence information and large number of SNPs obtained in this study could potentially be used for future high-density linkage map construction and association mapping. The large number of contigs obtained in this study could be used for the identification of orthologous transcripts from cDNA data on other organisms.

___

  • Anithakumari AM, Tang J, van Eck HJ, Visser RG, Leunissen JA, Vosman B, van der Linden CG (2010). A pipeline for high throughput detection and mapping of SNPs from EST databases. Mol Breeding 26: 65–75.
  • Arumuganathan K, Earle ED (1991). Nuclear DNA content of some important plant species. Plant Mol Biol Rep 9: 208–218.
  • Birol I, Jackman SD, Nielsen CB, Qian JQ, Varhol R (2009). De novo transcriptome assembly with ABySS. Bioinformatics 25: 2872– 2877.
  • Blanca JM, Cañizares J, Ziarsolo P, Esteras C, Mir G, Nuez F, Garcia- Mas J, Pico MB (2011). Melon transcriptome characterization. SSRs and SNPs discovery for high throughput genotyping across the species. Plant Genome 4: 118–131.
  • Blanca J, Esteras C, Ziarsolo P, Perez D, Fernandez-Pedrosa V, Collado C, de Pablos RR, Ballester A, Roig C, Cnizares J et al. (2012). Transcriptome sequencing for SNP discovery across Cucumis melo. BMC Genomics 13: 280.
  • Brookes AJ (1999). The essence of SNPs. Gene 234: 177–186.
  • Callaway JC (2004). Hempseed as a nutritional resource: an overview. Euphytica 140: 65–72.
  • Choi IY, Hyten DL, Matukumalli LK, Song Q, Chaky JM, Quigley CV, Chase K, Lark KG, Reiter RS, Yoon MS (2007). A soybean transcript map: gene distribution, haplotype and single nucleotide polymorphism analysis. Genetics 176: 685–696.
  • Close TJ, Bhat PR, Lonardi S, Wu Y, Rostoks N, Ramsay L, Druka A, Stein N, Svensson JT, Wanamaker S et al. (2009). Development and implementation of high-throughput SNP genotyping in barley. BMC Genomics 10: 582.
  • Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M (2005). Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674–3676.
  • Cortés AJ, Chavarro MC, Blair MW (2011). SNP marker diversity in common bean (Phaseolus vulgaris L.). Theor Appl Genet 123: 827–845.
  • Croucher NJ, Fookes MC, Perkins TT, Turner DJ, Marguerat SB, Keane T, Quail MA, He M, Assefa S, Bahler J et al. (2009). A simple method for directional transcriptome sequencing using Illumina technology. Nucleic Acids Res 37: e148.
  • David JP, Coissac E, Melodelima C, Poupardin R, Riaz MA, Proust AC, Reynaud S (2010). Transcriptome response to pollutants and insecticides in the dengue vector Aedes aegypti using next- generation sequencing technology. BMC Genomics 11: 216.
  • Der JP, Barker MS, Wickett NJ, dePamphilis CW, Wolf PG (2011). De novo characterization of the gametophyte transcriptome in bracken fern, Pteridium aquilinum. BMC Genomics 12: 99.
  • Doyle JJ, Doyle JL (1990). Isolation of plant DNA from fresh tissue. Focus 12: 13–15.
  • Duran Y, Fratini R, Garcia P, Perez de la Vega M (2003). An intersubspecific genetic map of Lens. Theor Appl Genet 108: 1265–1273.
  • Erskine W (1997). Lessons for breeders from land races of lentil. Euphytica 93: 107–112.
  • Erskine W, Chandra S, Chaudhry M, Malik IA, Sarker A, Sharma B, Tufail M, Tyagi MC (1998). A bottleneck in lentil: widening its genetic base in South Asia. Euphytica 101: 207–211.
  • Eujayl I, Baum M, Powell W, Erskine W, Pehu E (1998). A genetic linkage map of lentil (Lens sp.) based on RAPD and AFLP markers using recombinant inbred lines sp.) based on RAPD and AFLP markers. Theor Appl Genet 97: 83–89.
  • Fan H, Xiao Y, Yang Y, Xia W, Mason AS, Xia Z, Qiao F, Zhao S, Tang H (2013). RNA-Seq analysis of Cocos nucifera: transcriptome sequencing and de novo assembly for subsequent functional genomics approaches. PLoS One 8: e59997.
  • Feldmeyer B, Wheat CW, Krezdorn N, Rotter B, Pfenninger M (2011). Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance. BMC Genomics 12: 317.
  • Fikiru E, Tesfaye K, Bekele E (2007). Genetic diversity and population structure of Ethiopian lentil (Lens culinaris Medikus) landraces as revealed by ISSR marker. Afr J Biotechnol 6: 1460–1468.
  • Folta KM, Gardiner SE (2009). Genetics and Genomics of Rosaceae. Vol. 6. New York, NY, USA: Springer Science and Business Media.
  • Gupta M, Verma B, Kumar N, Chahota RK, Rathour R, Sharma SK, Bhatia S, Sharma TR (2012). Construction of intersubspeciŞc molecular genetic map of lentil based on ISSR, RAPD and SSR markers. Journal of Genetics 91: 279–287.
  • Hale MC, McCormick CR, Jackson JR, DeWoody JA (2009). Next- generation pyrosequencing of gonad transcriptomes in the polyploid lake sturgeon (Acipenser fulvescens): the relative merits of normalization and rarefaction in gene discovery. BMC Genomics 10: 203.
  • Hamwieh A, Udupa SM, Choumane W, Sarker A, Dreyer F, Jung C, Baum M (2005). A genetic linkage map of Lens sp. based on microsatellite and AFLP markers and the localization of fusarium vascular wilt resistance. Theor Appl Genet 110: 669– 677.
  • Havey MH, Muehlbauer FJ (1989). Linkages between restriction fragment length, isozyme, and morphological markers in lentil. Theor Appl Genet 77: 395–401.
  • Helyar SJ, Limborg MT, Bekkevold D, Babbucci M, Van-Houdt JKJ, Maes GE, Bargelloni L, Nielsen RO, Bendixen C, Taylor MI et al. (2012). SNP discovery using next generation transcriptomic sequencing in Atlantic herring (Clupea harengus). PLoS One 7: e42089.
  • Hubert S, Higgins B, Borza T, Bowman S (2010). Development of a SNP resource and a genetic linkage map for Atlantic cod (Gadus morhua). BMC Genomics 11: 191.
  • Hyten DL, Choi IY, Song Q, Specht JE, Carter TE, Shoemaker RC, Hwang EY, Matukumalli LK, Cregan PB (2010a). A high density integrated genetic linkage map of soybean and the development of a 1536 universal soy linkage panel for quantitative trait locus mapping. Crop Sci 50: 960–968.
  • Hyten DL, Song Q, Fickus EW, Quigley CV, Lim JS, Choi IY, Hwang EY, Pastor-Corrales M, Cregan PB (2010b). High-throughput SNP discovery and assay development in common bean. BMC Genomics 11: 475.
  • Hyten DL, Song Q, Zhu Y, Choi IY, Nelson RL, Costa JM, Specht JE, Shoemaker RC, Cregan PB (2006). Impacts of genetic bottlenecks on soybean genome diversity. P Natl Acad Sci USA 103: 16666–16671.
  • Joy N, Asha S, Mallika V, Soniya EV (2013). De novo transcriptome sequencing reveals a considerable bias in the incidence of simple sequence repeats towards the downstream of ‘Pre- miRNAs’ of black pepper. PLoS One 8: e56694.
  • Kanehisa M, Goto S (2000). KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28: 27–30.
  • Karim Mojein H, Alizadeh HM, Majnoon Hoseini N, Payghmbari SA (2003). Effect of herbicides and hand weeding in control of weed in winter and spring sown lentil (Lens culinaris L.). Iranian Journal of Crop Sciences 6: 68–79.
  • Kaur S, Cogan NO, Pembleton LW, Shinozuka M, Savin KW, Materne M, Forster JW (2011). Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery. BMC Genomics 12: 265.
  • Kaya HB, Cetin O, Kaya H, Sahin M, Sefer F, Kahraman A, Tanyolac B (2013). SNP Discovery by Illumina-based transcriptome sequencing of the olive and the genetic characterization of Turkish olive genotypes revealed by AFLP, SSR and SNP markers. PLoS One 8: e73674.
  • Kirst M, Myburg AA, De Leon JP, Kirst ME, Scott J, Sederoff R (2004). Coordinated genetic regulation of growth and lignin revealed by quantitative trait locus analysis of cDNA microarray data in an interspecific backcross of eucalyptus. Plant Physiol 135: 2368–2378.
  • Kosambi DD (1944). The estimation of map distance from recombination values. Ann Eugen 12: 172–175.
  • Lai K, Duran C, Berkman PJ, Lorenc MT, Stiller J, Manoli S, Hayden MJ, Forrest KL, Fleury D, Baumann U et al. (2012). Single nucleotide polymorphism discovery from wheat next- generation sequence data. Plant Biotechnol J 10: 743–749.
  • Li H, Durbin R (2009). Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25: 1754–1760.
  • Li H, Lu L, Manly KF, Chesler EJ, Bao L, Wang J, Zhou M, Williams RW, Cui Y (2005). Inferring gene transcriptional modulatory relations: a genetical genomics approach. Hum Mol Genet 14: 1119–1125.
  • Liu T, Zhu S, Tang Q, Chen P, Yu Y, Tang S (2013). De novo assembly and characterization of transcriptome using Illumina paired- end sequencing and identification of CesA gene in ramie (Boehmeria nivea L. Gaud). BMC Genomics 14: 125.
  • Loridon K, Burgarella C, Chantret N, Martins F, Gouzy J, Prosperi JM, Ronfort J (2013). Single-nucleotide polymorphism discovery and diversity in the model legume Medicago truncatula. Mol Ecol Resourc 13: 84–95.
  • Maccaferri M, Sanguineti MC, Corneti S, Ortega JLA, Salem MB, Bort J, DeAmbrogio E, del Moral LFG, Demontis A, El-Ahmed A et al. (2008). Quantitative trait loci for grain yield and adaptation of durum wheat (Triticum durum Desf.) across a wide range of water availability. Genetics 178: 489–511.
  • Martin M (2011). Cutadapt removes adapter sequences from high- throughput sequencing reads. EMBnet.journal 17: 10–12.
  • McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M et al. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20: 1297–1303.
  • Mignon GL, Désert C, Pitel F, Leroux S, Demeure O, Guernec G, Abasht B, Douaire M, Roy PL, Lagarrigue S (2009). Using transcriptome profiling to characterize QTL regions on chicken chromosome 5. BMC Genomics 10: 575.
  • Mootha VK, Lepage P, Miller K, Bunkenborg J, Reich M, Hjerrild M, Delmonte T, Villeneuve A, Sladek R, Xu F et al. (2003). Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics. P Natl Acad Sci USA 100: 605–610.
  • Ooijen V (2006). JoinMap4, Software for the Calculation of Genetic Linkage Maps in Experimental Populations. Wageningen, the Netherlands: Kyazma B.V.
  • Parchman T, Geist K, Grahnen J, Benkman C, Buerkle C (2010). Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics 11: 180.
  • Phan HTT, Ellwood SR, Hane JK, Ford R, Materne M, Oliver RP (2007). Extensive macrosynteny between Medicago truncatula and Lens culinaris ssp. culinaris. Theor Appl Genet 114: 549– 558.
  • Rajesh PN, White D, Saha G, Chen W, Muehlbauer F (2008). Development and genetic analysis of SSR markers in lentil. In: Plant & Animal Genomes XVI Conference, 12-16 January 2008, San Diego, CA, USA.
  • Robertson G, Schein J, Chiu R, Corbett R, Field M, Jackman SD, Mungall K, Lee S, Okada HM, Qian JQ et al. (2010). De novo assembly and analysis of RNA-seq data. Nat Methods 7: 909– 912.
  • Rozen S, Skaletsky H (1999). Primer3 on the WWW for general users and for biologist programmers. Methods in Molecular Biology 132: 365–386.
  • Rubeena FR, Taylor PWJ (2003). Construction of an intraspecific linkage map of lentil (Lens culinaris ssp. culinaris). Theor Appl Genet 107: 910–916.
  • Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, Sieberts SK, Monks S, Reitman M, Zhang C et al. (2005). An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet 37: 710–717.
  • Schulz MH, Zerbino DR, Vingron M, Birney E (2012). Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics 28: 1086–1092.
  • Seeb JE, Carvalho G, Hauser L, Naish K, Roberts S, Seeb LW (2011). Single-nucleotide polymorphism (SNP) discovery and applications of SNP genotyping in nonmodel organisms. Mol Ecol Resourc 11 (Suppl. 1): 1-8.
  • Sharpe AG, Ramsay L, Sanderson LA, Fedoruk MJ, Clarke WE, Li R, Kagale S, Vijayan P, Vandenberg A, Bett KE (2013). Ancient orphan crop joins modern era: gene-based SNP discovery and mapping in lentil. BMC Genomics 14: 192.
  • Shirasawa K, Isobe S, Hirakawa H, Asamizu E, Fukuoka H, Just D, Rothan C, Sasamoto S, Fujishiro T, Kishida Y et al. (2010). SNP discovery and linkage map construction in cultivated tomato. DNA Res 17: 381–391.
  • Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I (2009). ABySS: A parallel assembler for short read sequence data. Genome Res 19: 1117–1123.
  • Slate J, Gratten J, Beraldi D (2009). Gene mapping in the wild with SNPs: guidelines and future directions. Genetica 136: 97–107.
  • Sun L, Yang W, Zhang Q, Cheng T, Pan H, Xu Z, Zhang J, Chen C (2013). Genome-wide characterization and linkage mapping of simple sequence repeats in mei (Prunus mume Sieb. et Zucc.). PLoS One 8: e59562.
  • Tadmor Y, Zamir D, Ladizinsky G (1987). Genetic mapping of an ancient translocation in the genus Lens. Theor Appl Genet 73: 883–892.
  • Tahir M, Simon CJ, Muehlbauer FJ (1993). Gene map of lentil: a review. LENS Newsletter 2: 3–10.
  • Tanyolac B, Ozatay S, Kahraman A, Muehlbauer F (2010). Linkage mapping of lentil (Lens culinaris L.) genome using recombinant inbred lines revealed by AFLP, ISSR, RAPD and some morphologic markers. Agricultural Biotechnology and Sustainable Development 2: 1–6.
  • Trick M, Long Y, Meng J, Bancroft I (2009). Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol J 7: 334–346.
  • Tullu A, Tar’an B, Warkentin T, Vandenberg A (2008). Construction of an intraspecific linkage map and QTL analysis for earliness and plant height in lentil. Crop Sci 48: 2254–2264.
  • Varlı İ (2009). Use of SSR markers in construction of genetic linkage map of lentil. MSc, Harran University, Şanlıurfa, Turkey (in Turkish with English abstract).
  • Varshney RK, Nayak SN, May GD, Jackson SA (2009). Next- generation sequencing technologies and their implications for crop genetics and breeding. Trends Biotechnol 27: 522–530.
  • Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, Marden JH (2008). Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol 17: 1636–1647.
  • Verma P, Shah N, Bhatia S (2013). Development of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome. Plant Biotechnol J 11: 894–905.
  • Vorrips RE (2002). MapChart: software for the graphical presentation of linkage maps and QTLs. Journal Hered 93: 77–78.
  • Wang SL, Sha ZX, Sonstegard TS, Liu H, Xu P (2008). Quality assessment parameters for EST-derived SNPs from catfish. BMC Genomics 9: 45.
  • Wang Z, Fang B, Chen J, Zhang X, Luo Z, Huang L, Chen X, Li Y (2010). De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweet potato (Ipomoea batatas). BMC Genomics 11: 726.
  • Wheat CW (2010). Rapidly developing functional genomics in ecological model systems via 454 transcriptome sequencing. Genetica 138: 433–451.
  • Wong PP (1980). Nitrate and carbohydrate effects on nodulation and nitrogen fixation (acetylene reduction) activity of lentil (Lens esculenta Moench). Plant Physiol 66: 78–81.
  • Wu N, Matand K, Wu H, Li B, Li Y, Zhang X, He Z, Qian J, Liu X, Conley S et al. (2013). De novo next-generation sequencing, assembling and annotation of Arachis hypogaea L. Spanish botanical type whole plant transcriptome. Theor Appl Genet 126: 1145–1149.
  • Xu J, Ji P, Zhao Z, Zhang Y, Feng J, Wang J, Li J, Zhang X, Zhao L, Liu G (2012). Genome-wide SNP discovery from transcriptome of four common carp strains. PLoS One 7: e48140.
  • Xu JY, Xu GB, Chen SL (2009). A new method for SNP discovery. Biotechniques 46: 201–208.
  • Yadav SS, McNeil DL, Stevenson PC (2007). Lentil: An Ancient Crop for Modern Times. Dordrecht, the Netherlands: Springer.
  • Zamir D, Ladizinsky G (1984). Genetics of allozyme variants and linkage groups in lentil. Euphytica 33: 329–336.
  • Zerbino DR, Birney E (2008). Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18: 821– 829.
  • Zhu YL, Song QJ, Hyten DL, Van Tassell CP, Matukumalli LK, Grimm DR, Hyatt SM, Fickus EW, Young ND, Cregan PB (2003). Single-nucleotide polymorphisms in soybean. Genetics 163: 1123–1134.
Turkish Journal of Agriculture and Forestry-Cover
  • ISSN: 1300-011X
  • Yayın Aralığı: Yılda 6 Sayı
  • Yayıncı: TÜBİTAK
Sayıdaki Diğer Makaleler

Genetic variability, heritability, and genetic advance in strawberry(Fragaria × ananassa Duch.)

Pranav Kumar MISHRA, Ram Badan RAM, Neeraj KUMAR

Sources of genetic resistance in maize to Fusarium stalk rot andtheir variations at molecular level

Sajjad Hussain QURESHI, Abdul QAYYUM, Will FIERS

Single nucleotide polymorphism discovery through Illumina-based transcriptome sequencing and mapping in lentil

HÜLYA YILMAZ TEMEL, DENİZ GÖL, HİLAL BETÜL KAYA AKKALE, ABDULLAH KAHRİMAN, Muhammed Bahattin TANYOLAÇ

Preface - Special Issue on: 'Genomics and transcriptomics for utilization of plant genetic resources and crop improvement'

Turgay ÜNVER, Gabriel DORADO, Baohong ZHANG

Physiological characteristics, antioxidant enzyme activities, and gene expression in 2 spring canola (Brassica napus L.) cultivars under drought stress conditions

SEYED MOHAMMAD HOSSEINI, TAHEREH HASANLOO, SAEED MOHAMMADI

Sequencing of plant genomes ? a review

Mine TÜRKTAŞ, Kuaybe Yücebilgili KURTOĞLU, Gabriel DORADO, Baohong ZHANG, Pilar HERNANDEZ, Turgay ÜNVER

Partial characterization and development of sensitive and reliablediagnostic for the detection of cucumber mosaic virus

SHAHANAVAJ KHAN

Identification of expressed resistance gene analog sequences in coconut leaf transcriptome and their evolutionary analysis

Muliyar Krishna RAJESH, Kaitheri Edathil RACHANA, Sudalaimuthu Asari NAGANEESWARAN, Rahman SHAFEEQ, Regi Jacob THOMAS, Mohammed SHAREEFA, Babu MERIN, Karun ANITHA

Transcriptome characterization and large-scale identification of SSR/SNPmarkers in symbiotic nitrogen fixation crop faba bean (Vicia faba L.)

SUNDAN SURESH, TAE-SUNG KIM, SEBASTIN RAVEENDAR, JOON-HYEONG CHO, JUNG YOON YI, MYUNG CHUL LEE, SOK-YOUNG LEE, HYUNG-JIN BAEK, GYU-TAEK CHO, JONG WOOK CHUNG

Cytoplasmic-nuclear variation in a diversity-fixed foundation set ofBrassica juncea (L.) Czern & Coss.

JAVED AKHATAR, NITIN KUMAR, CHHAYA ATRI, SURINDER SINGH BANGA