9.9
CiteScore
7.1
Impact Factor
Turn off MathJax
Article Contents

Whole-genome sequencing of 2032 diverse tobacco accessions reveals genetic variation and population differentiation

doi: 10.1016/j.jgg.2026.02.025
Funds:

We are grateful for the grants from the China National Tobacco Company (110202101038 [JY-15]), and Yunnan Tobacco Company (2024530000241003, 2022530000241009, 2025530000241005).

  • Received Date: 2025-09-01
  • Accepted Date: 2026-02-26
  • Rev Recd Date: 2026-02-24
  • Available Online: 2026-03-04
  • Nicotiana tabacum is an allotetraploid hybrid and a widely used model organism in plant genetics. Despite its agricultural and biological significance, large-scale genomic studies and comprehensive analysis of population differentiation in tobacco remain limited. Here, we perform whole-genome resequencing of 2032 diverse tobacco accessions, identifying 59 million single-nucleotide polymorphisms (SNPs) and 8.3 million small insertions and deletions (InDels). These variants contribute to substantial genetic diversity both within and between populations. Population genetic structure analysis reveals two major genetic subpopulations, with the differentiation primarily driven by breeding practices. Notably, the genetic differentiation between varieties is greater than that observed between tobacco types. Shared divergent regions across six types are enriched in defense, epidermal development, and lipid metabolism pathways, reflecting selection for stress adaptation and plant growth. We further identify 302 accessions exhibiting strong signatures of selection in immune-related genes, highlighting their potential as parental lines in resistance breeding programs. Selective sweeps in major cultivars overlap with 19 previously identified QTLs, primarily associated with agronomic traits such as leaf morphology, disease resistance, and chemical traits. This study provides a high-resolution genomic resource for in-depth tobacco genomics research and offers valuable insights into precision breeding.
  • loading
  • Alexander, D.H., Novembre, J., Lange, K., 2009. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655-1664.
    Arumuganathan, K., Martin, G.B., Telenius, H., Tanksley, S.D., Earle, E.D., 1994. Chromosome 2-specific DNA clones from flow-sorted chromosomes of tomato. Mol. Gen. Genet. 242, 551-558.
    Bally, J., Jung, H., Mortimer, C., Naim, F., Philips, J.G., Hellens, R., Bombarely, A., Goodin, M.M., Waterhouse, P.M., 2018. The rise and rise of Nicotiana benthamiana: A plant for all reasons. Annu. Rev. Phytopathol. 56, 405-426.
    Barton, K.A., Binns, A.N., Matzke, A.J., Chilton, M.D., 1983. Regeneration of intact tobacco plants containing full length copies of genetically engineered T-DNA, and transmission of T-DNA to R1 progeny. Cell 32, 1033-1043.
    Capella-Gutierrez, S., Silla-Martinez, J.M., Gabaldon, T., 2009. Trimal: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972-1973.
    Chen, H., Patterson, N., Reich, D., 2010. Population differentiation as a test for selective sweeps. Genome Res. 20, 393-402.
    Chen, S., 2023. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. Imeta 2, e107.
    Cingolani, P., Platts, A., Wang le, L., Coon, M., Nguyen, T., Wang, L., Land, S.J., Lu, X., Ruden, D.M., 2012. A program for annotating and predicting the effects of single nucleotide polymorphisms, Snpeff: Snps in the genome of drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80-92.
    Clarkson, J.J., Lim, K.Y., Kovarik, A., Chase, M.W., Knapp, S., Leitch, A.R., 2005. Long-term genome diploidization in allopolyploid Nicotiana section repandae (Solanaceae). New Phytol. 168, 241-252.
    Creager, A.N., Scholthof, K.B., Citovsky, V., Scholthof, H.B., 1999. Tobacco mosaic virus. Pioneering research for a century. Plant Cell 11, 301-308.
    Danecek, P., Auton, A., Abecasis, G., Albers, C.A., Banks, E., DePristo, M.A., Handsaker, R.E., Lunter, G., Marth, G.T., Sherry, S.T., et al., 2011. The variant call format and vcftools. Bioinformatics 27, 2156-2158.
    Drake-Stowe, K., Bakaher, N., Goepfert, S., Philippon, B., Mark, R., Peterson, P., Lewis, R.S., 2017. Multiple disease resistance loci affect soilborne disease resistance in tobacco (Nicotiana tabacum). Phytopathology 107, 1055-1061.
    Edwards, K.D., Fernandez-Pozo, N., Drake-Stowe, K., Humphry, M., Evans, A.D., Bombarely, A., Allen, F., Hurst, R., White, B., Kernodle, S.P., et al., 2017. A reference genome for Nicotiana tabacum enables map-based cloning of homeologous loci implicated in nitrogen utilization efficiency. BMC Genom. 18, 448.
    Herrera-Estrella, L., Block, M.D., Messens, E., Hernalsteens, J.P., Montagu, M.V., Schell, J., 1983. Chimeric genes as dominant selectable markers in plant cells. EMBO J. 2, 987-995.
    Holmes, F.O., 1938. Inheritance of resistance to tobacco-mosaic disease in tobacco. Phytopathology 28, 553-561.
    Jakobsson, M., Rosenberg, N.A., 2007. Clumpp: A cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801-1806.
    Jia, L., Sun, M., He, M., Yang, M., Zhang, M., Yu, H., 2024. Study on the change of global ecological distribution of Nicotiana tabacum l. Based on maxent model. Front. Plant Sci. 15, 1371998.
    Jiang, X., Wang, J., Qin, Y., Li, Y., Ji, Y., Yang, A., Chen, L., Dai, P., Zhang, X., Liu, G., 2024. Whole-genome resequencing reveals genetic differentiation in cigar tobacco population. Ind. Crops Prod. 210.
    Kourelis, J., van der Hoorn, R.A.L., 2018. Defended to the nines: 25 years of resistance gene cloning identifies nine mechanisms for r protein function. Plant Cell 30, 285-299.
    Lan, T., Zheng, S., Yang, L., Wu, S., Wang, B., Zhang, S., Tong, Z., Chen, Y., Chen, S., Duan, Y., et al., 2014. Mapping of quantitative trait loci conferring resistance to bacterial wilt in tobacco (Nicotiana tabacum l.). Plant Breed. 133, 672-677.
    Leitch, I.J., Hanson, L., Lim, K.Y., Kovarik, A., Chase, M.W., Clarkson, J.J., Leitch, A.R., 2008. The ups and downs of genome size evolution in polyploid species of Nicotiana (Solanaceae). Ann. Bot. 101, 805-814.
    Lewis, R.S., Nicholson, J.S., 2006. Aspects of the evolution of Nicotiana tabacum l. And the status of the United States Nicotiana germplasm collection. Genet. Resour. Crop Evol. 54, 727-740.
    Li, H., Durbin, R., 2010. Fast and accurate long-read alignment with burrows-wheeler transform. Bioinformatics 26, 589-595.
    Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., Durbin, R., Genome Project Data Processing, S., 2009. The sequence alignment/map format and samtools. Bioinformatics 25, 2078-2079.
    Li, H., Peng, Z., Yang, X., Wang, W., Fu, J., Wang, J., Han, Y., Chai, Y., Guo, T., Yang, N., et al., 2013. Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat. Genet. 45, 43-50.
    Lim, K.Y., Skalicka, K., Koukalova, B., Volkov, R.A., Matyasek, R., Hemleben, V., Leitch, A.R., Kovarik, A., 2004. Dynamic changes in the distribution of a satellite homologous to intergenic 26-18s rDNA spacer in the evolution of Nicotiana. Genetics 166, 1935-1946.
    Liu, Y., Yuan, G., Si, H., Sun, Y., Jiang, Z., Liu, D., Jiang, C., Pan, X., Yang, J., Luo, Z., et al., 2022. Identification of QTLs associated with agronomic traits in tobacco via a biparental population and an eight-way magic population. Front. Plant Sci. 13, 878267.
    Ma, J.M., Heim, C., Humphry, M., Nifong, J.M., Lewis, R.S., 2019. Genetic analysis of phn7.1, a major qtl conferring partial resistance to Phytophthora nicotianae in Nicotiana tabacum. Mol. Breed. 39.
    Martinez-Vazquez, J.P., Loera-Muro, A., Gomez-Aguirre, Y.A., Morales-Dominguez, J.F., 2021. Identification and characterization of the EXPA7, EXPA18 and EXT10 genes in Turbinicarpus lophophoroides (Werderm.) Buxb. & Backeb; and their expression analysis in the root under abiotic stress. Mol. Biol. Rep. 48, 1633-1644.
    McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., et al., 2010. The genome analysis toolkit: A mapreduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297-1303.
    Members, C.-N., Partners, 2024. Database resources of the National Genomics Data Center, China National Center for Bioinformation in 2024. Nucleic Acids Res. 52, D18-D32.
    Minh, B.Q., Schmidt, H.A., Chernomor, O., Schrempf, D., Woodhams, M.D., von Haeseler, A., Lanfear, R., 2020. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530-1534.
    Morales-Navarro, S., Perez-Diaz, R., Ortega, A., de Marcos, A., Mena, M., Fenoll, C., Gonzalez-Villanueva, E., Ruiz-Lara, S., 2018. Overexpression of a SDD1-like gene from wild tomato decreases stomatal density and enhances dehydration avoidance in Arabidopsis and cultivated tomato. Front. Plant Sci. 9, 940.
    Novak, J., Blüthner, W.-D. (Eds.), 2020. Medicinal, Aromatic and Stimulant Plants, Handbook of Plant Breeding. Springer International Publishing, Cham, pp. 346-375.
    Otten, L., De Greve, H., Hernalsteens, J.P., Van Montagu, M., Schieder, O., Straub, J., Schell, J., 1981. Mendelian transmission of genes introduced into plants by the Ti plasmids of Agrobacterium tumefaciens. Mol. Gen. Genet. 183, 209-213.
    Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A., Bender, D., Maller, J., Sklar, P., de Bakker, P.I., Daly, M.J., et al., 2007. Plink: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559-575.
    Renny-Byfield, S., Chester, M., Kovarik, A., Le Comber, S.C., Grandbastien, M.A., Deloger, M., Nichols, R.A., Macas, J., Novak, P., Chase, M.W., et al., 2011. Next generation sequencing reveals genome downsizing in allotetraploid Nicotiana tabacum, predominantly through the elimination of paternally derived repetitive DNAs. Mol. Biol. Evol. 28, 2843-2854.
    Rowland, O., Zheng, H., Hepworth, S.R., Lam, P., Jetter, R., Kunst, L., 2006. CER4 encodes an alcohol-forming fatty acyl-coenzyme A reductase involved in cuticular wax production in Arabidopsis. Plant Physiol. 142, 866-877.
    Sallaud, C., Giacalone, C., Topfer, R., Goepfert, S., Bakaher, N., Rosti, S., Tissier, A., 2012. Characterization of two genes for the biosynthesis of the labdane diterpene Z-abienol in tobacco (Nicotiana tabacum) glandular trichomes. Plant J. 72, 1-17.
    Shi, R., Jin, J., Nifong, J.M., Shew, D., Lewis, R.S., 2022. Homoeologous chromosome exchange explains the creation of a QTL affecting soil-borne pathogen resistance in tobacco. Plant Biotechnol. J. 20, 47-58.
    Sierro, N., Auberson, M., Dulize, R., Ivanov, N.V., 2024. Chromosome-level genome assemblies of Nicotiana tabacum, Nicotiana sylvestris, and Nicotiana tomentosiformis. Sci. Data. 11, 135.
    Sierro, N., Battey, J.N., Ouadi, S., Bakaher, N., Bovet, L., Willig, A., Goepfert, S., Peitsch, M.C., Ivanov, N.V., 2014. The tobacco genome sequence and its comparison with those of tomato and potato. Nat. Commun. 5, 3833.
    Song, Y., Wang, Y., Liu, Y., Li, H., Ding, J., Wu, X., Li, Y., Jiao, F., Yang, L., 2025. Whole genome re-sequencing in 437 tobacco germplasms identifies plant height candidate genes. Sci. Rep. 15, 4734.
    Takatsuka, H., Umeda-Hara, C., Umeda, M., 2015. Cyclin-dependent kinase-activating kinases CDKD;1 and CDKD;3 are essential for preserving mitotic activity in Arabidopsis thaliana. Plant J. 82, 1004-1017.
    Tanaka, H., Watanabe, M., Sasabe, M., Hiroe, T., Tanaka, T., Tsukaya, H., Ikezaki, M., Machida, C., Machida, Y., 2007. Novel receptor-like kinase ALE2 controls shoot development by specifying epidermis in Arabidopsis. Development 134, 1643-1652.
    Thimmegowda, G.C., Ramadoss, S.K., Kaikala, V., Rathinavelu, R., Thamalampudi, V.R., Dhavala, V.N.C., Saiprasad, G.V.S., 2018. Whole genome resequencing of tobacco (Nicotiana tabacum l.) genotypes and high-throughput snp discovery. Mol. Breed. 38.
    Tong, Z., Ullah, A., Kamran, M., Chen, X., Lin, F., Zhu, T., Xu, H., Xiao, B., 2025. QTL mapping and association analysis revealed complicate genetic architecture of 17 smoke and chemical traits in tobacco (Nicotiana tabacum l.). Ind. Crops Prod. 225.
    Tong, Z., Zhou, J., Xiu, Z., Jiao, F., Hu, Y., Zheng, F., Chen, X., Li, Y., Fang, D., Li, S., et al., 2020. Construction of a high-density genetic map with whole genome sequencing in Nicotiana tabacum l. Genomics 112, 2028-2033.
    Wang, J., Zhang, Q., Tung, J., Zhang, X., Liu, D., Deng, Y., Tian, Z., Chen, H., Wang, T., Yin, W., et al., 2024. High-quality assembled and annotated genomes of Nicotiana tabacum and Nicotiana benthamiana reveal chromosome evolution and changes in defense arsenals. Mol. Plant 17, 423-437.
    Wang, Y., Zhou, J., 1995. Parentage analysis of major tobacco varieties and tobacco breeding in America and China. Acta Tabacaria Sinica 2, 11-22.
    Yang, H., Geng, X., Zhao, S., Shi, H., 2020. Genomic diversity analysis and identification of novel ssr markers in four tobacco varieties by high-throughput resequencing. Plant Physiol. Biochem. 150, 80-89.
    Yuan, G., Sun, K., Yu, W., Jiang, Z., Jiang, C., Liu, D., Wen, L., Si, H., Wu, F., Meng, H., et al., 2022. Development of a magic population and high-resolution quantitative trait mapping for nicotine content in tobacco. Front. Plant Sci. 13, 1086950.
    Zan, Y., Chen, S., Ren, M., Liu, G., Liu, Y., Han, Y., Dong, Y., Zhang, Y., Si, H., Liu, Z., et al., 2025. The genome and genebank genomics of allotetraploid Nicotiana tabacum provide insights into genome evolution and complex trait regulation. Nat. Genet. 57, 986-996.
    Zhang, C., Dong, S.S., Xu, J.Y., He, W.M., Yang, T.L., 2019. Poplddecay: A fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35, 1786-1788.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (26) PDF downloads (0) Cited by ()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return