Pan-genome analysis and functional characterisation of the terpene synthase (TPS) gene family in five varieties of yellowhorn (Xanthoceras sorbifolium)
Tao Lu A # , Shuaiyu Jiang A # , Xinyu Liu A # , Zhen Lu A , Muhammad Sadaqat B , Chen Chen A * , Xiangyu Zuo A * and Muhammad Tahir ul Qamar
A
B
C
# These authors contributed equally to this paper
Handling Editor: Muhammad Nadeem
Abstract
The terpene synthase (TPS) gene family is integral to the biosynthesis of terpenoids, which are vital for plant defence, development, and interaction with the environment. Yellowhorn (Xanthoceras sorbifolium) has gained attention for its bioactive compounds, particularly terpenoids, which have applications in pharmaceuticals, biofuels, and cosmetics. This study provides a comprehensive pan-genome-wide analysis of the TPS gene family across five yellowhorn varieties (Xg11, Xzs4, Xwf8, Xjg, and Xzg2). A total of 257 TPS genes were identified and characterised, showing diversity in their evolutionary patterns. Phylogenetic analysis revealed distinct clades corresponding to functional classes of TPS genes. Conserved domains and motifs of these genes were analysed to highlight their structural characteristics. Furthermore, expression profiling under abiotic stresses, including cold and drought, was conducted, revealing the roles of specific TPS genes in stress tolerance. Tissue-specific expression analysis demonstrated the involvement of TPS genes in key physiological processes across different plant organs. This research advances our understanding of the TPS gene family in yellowhorn, with implications for improving crop resilience and biotechnological applications.
Keywords: abiotic stress, enrichment analysis, gene expression, genome-wide analysis, pan-genes, phylogenetic analysis, terpene synthase, Yellowhorn.
References
Aubourg S, Lecharny A, Bohlmann J (2002) Genomic analysis of the terpenoid synthase (AtTPS) gene family of Arabidopsis thaliana. Molecular Genetics and Genomics 267, 730-745.
| Crossref | Google Scholar | PubMed |
Bailey TL, Johnson J, Grant CE, et al. (2015) The MEME suite. Nucleic Acids Research 43(W1), W39-W49.
| Crossref | Google Scholar | PubMed |
Camacho C, Coulouris G, Avagyan V, et al. (2009) BLAST+: architecture and applications. BMC Bioinformatics 10, 421.
| Crossref | Google Scholar |
Cao Z, Ma Q, Weng Y, et al. (2023) Genome-wide identification and expression analysis of TPS gene family in Liriodendron chinense. Genes 14(3), 770.
| Crossref | Google Scholar | PubMed |
Chen F, Tholl D, Bohlmann J, et al. (2011) The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom. The Plant Journal 66(1), 212-229.
| Crossref | Google Scholar | PubMed |
Chen Z, Vining K, Qi X, et al. (2021) Genome-wide analysis of terpene synthase gene family in Mentha longifolia and catalytic activity analysis of a single terpene synthase. Genes 12(04), 518.
| Crossref | Google Scholar |
Fàbregas N, Fernie AR (2022) The reliance of phytohormone biosynthesis on primary metabolite precursors. Journal of Plant Physiology 268, 153589.
| Crossref | Google Scholar | PubMed |
Fatima K, Sadaqat M, Azeem F, et al. (2023) Integrated omics and machine learning-assisted profiling of cysteine-rich-receptor-like kinases from three peanut spp. revealed their role in multiple stresses. Frontiers in Genetics 14, 1252020.
| Crossref | Google Scholar |
Han X, Zhang J, Han S, et al. (2022) The chromosome-scale genome of Phoebe bournei reveals contrasting fates of terpene synthase (TPS)-a and TPS-b subfamilies. Plant Communications 3(6), 100410.
| Crossref | Google Scholar |
Horton P, Park K-J, Obayashi T, et al. (2007) WoLF PSORT: protein localization predictor. Nucleic Acids Research 35(suppl_2), W585-W587.
| Crossref | Google Scholar |
Hu B, Jin J, Guo A-Y, et al. (2015) GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics 31(8), 1296-1297.
| Crossref | Google Scholar | PubMed |
Huala E, Dickerman AW, Garcia-Hernandez M, et al. (2001) The Arabidopsis information resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Research 29(1), 102-105.
| Crossref | Google Scholar | PubMed |
Huang H, Kuo Y-W, Chuang Y-C, et al. (2021) Terpene synthase-b and terpene synthase-e/f genes produce monoterpenes for Phalaenopsis bellina floral scent. Frontiers in Plant Science 12, 700958.
| Crossref | Google Scholar | PubMed |
Hunter S, Apweiler R, Attwood TK, et al. (2009) InterPro: the integrative protein signature database. Nucleic Acids Research 37(suppl_1), D211-D215.
| Crossref | Google Scholar |
Irmisch S, Jiang Y, Chen F, et al. (2014) Terpene synthases and their contribution to herbivore-induced volatile emission in western balsam poplar (Populus trichocarpa). BMC Plant Biology 14, 270.
| Crossref | Google Scholar |
Jia Q, Brown R, Köllner TG, et al. (2022) Origin and early evolution of the plant terpene synthase family. Proceedings of the National Academy of Sciences 119(15), e2100361119.
| Crossref | Google Scholar |
Jiang L, Chen S, Wang X, et al. (2024) An improved genome assembly of Chrysanthemum nankingense reveals expansion and functional diversification of terpene synthase gene family. BMC Genomics 25(1), 593.
| Crossref | Google Scholar | PubMed |
Keeling CI, Weisshaar S, Ralph SG, et al. (2011) Transcriptome mining, functional characterization, and phylogeny of a large terpene synthase gene family in spruce (Picea spp.). BMC Plant Biology 11, 43.
| Crossref | Google Scholar |
Kim D, Paggi JM, Park C, et al. (2019) Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nature Biotechnology 37(8), 907-915.
| Crossref | Google Scholar | PubMed |
Kovaka S, Zimin AV, Pertea GM, et al. (2019) Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biology 20(1), 278.
| Crossref | Google Scholar |
Kumar Y, Khan F, Rastogi S, et al. (2018) Genome-wide detection of terpene synthase genes in holy basil (Ocimum sanctum L.). PLoS ONE 13(11), e0207097.
| Crossref | Google Scholar | PubMed |
Lescot M, Déhais P, Thijs G, et al. (2002) PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Research 30(1), 325-327.
| Crossref | Google Scholar | PubMed |
Letunic I, Bork P (2021) Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Research 49(W1), W293-W296.
| Crossref | Google Scholar | PubMed |
Letunic I, Doerks T, Bork P (2015) SMART: recent updates, new developments and status in 2015. Nucleic Acids Research 43(D1), D257-D260.
| Crossref | Google Scholar |
Li R-S, Zhu J-H, Guo D, et al. (2021) Genome-wide identification and expression analysis of terpene synthase gene family in Aquilaria sinensis. Plant Physiology and Biochemistry 164, 185-194.
| Crossref | Google Scholar | PubMed |
Li M, Li X, Zhou J, et al. (2022) Genome-wide identification and analysis of terpene synthase (TPS) genes in celery reveals their regulatory roles in terpenoid biosynthesis. Frontiers in Plant Science 13, 1010780.
| Crossref | Google Scholar | PubMed |
Liu G, Liu F, Pan L, et al. (2024) Agronomic, physiological and transcriptional characteristics provide insights into fatty acid biosynthesis in yellowhorn (Xanthoceras sorbifolium Bunge) during fruit ripening. Frontiers in Genetics 15, 1325484.
| Crossref | Google Scholar | PubMed |
Marchler-Bauer A, Lu S, Anderson JB, et al. (2011) CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Research 39(Suppl. 1), D225-D229.
| Crossref | Google Scholar |
Martin DM, Aubourg S, Schouwey MB, et al. (2010) Functional annotation, genome organization and phylogeny of the grapevine (Vitis vinifera) terpene synthase gene family based on genome assembly, FLcDNA cloning, and enzyme assays. BMC Plant Biology 10, 226.
| Crossref | Google Scholar |
Mehari TG, Fang H, Feng W, et al. (2023) Genome-wide identification and expression analysis of terpene synthases in Gossypium species in response to gossypol biosynthesis. Functional & Integrative Genomics 23(2), 197.
| Crossref | Google Scholar | PubMed |
Mistry J, Chuguransky S, Williams L, et al. (2021) Pfam: the protein families database in 2021. Nucleic Acids Research 49(D1), D412-D419.
| Crossref | Google Scholar | PubMed |
Nieuwenhuizen NJ, Green SA, Chen X, et al. (2013) Functional genomics reveals that a compact terpene synthase gene family can account for terpene volatile production in apple. Plant Physiology 161(2), 787-804.
| Crossref | Google Scholar | PubMed |
Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, et al. (2017) DnaSP 6: DNA sequence polymorphism analysis of large data sets. Molecular Biology and Evolution 34(12), 3299-3302.
| Crossref | Google Scholar | PubMed |
Sadaqat M, Umer B, Attia KA, et al. (2023) Genome-wide identification and expression profiling of two-component system (TCS) genes in Brassica oleracea in response to shade stress. Frontiers in Genetics 14, 1142544.
| Crossref | Google Scholar |
Sadaqat M, Fatima K, Azeem F, et al. (2024) Computational analysis and expression profiling of Two-component system (TCS) gene family members in Mango (Mangifera indica) indicated their roles in stress response. Functional Plant Biology 51, FP24055.
| Crossref | Google Scholar |
Tahir ul Qamar M, Fatima K, Rao MJ, et al. (2025) Comparative genomics profiling of Citrus species reveals the diversity and disease responsiveness of the GLP pangenes family. BMC Plant Biology 25(1), 388.
| Crossref | Google Scholar |
Tetali SD (2019) Terpenes and isoprenoids: a wealth of compounds for global use. Planta 249, 1-8.
| Crossref | Google Scholar | PubMed |
Thompson JD, Gibson TJ, Higgins DG (2003) Multiple sequence alignment using ClustalW and ClustalX. Current Protocols in Bioinformatics 1-22.
| Crossref | Google Scholar |
Törönen P, Medlar A, Holm L (2018) PANNZER2: a rapid functional annotation web server. Nucleic Acids Research 46(W1), W84-W88.
| Crossref | Google Scholar | PubMed |
Trifinopoulos J, Nguyen L-T, von Haeseler A, et al. (2016) W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Research 44(W1), W232-W235.
| Crossref | Google Scholar | PubMed |
ul Qamar MT, Sadaqat M, Zhu X-T, et al. (2023) Comparative genomics profiling revealed multi-stress responsive roles of the CC-NBS-LRR genes in three mango cultivars. Frontiers in Plant Science 14, 1285547.
| Crossref | Google Scholar |
Wang X, Wang Q, Jia Q, et al. (2023) Optimal tree architecture for high-yield yellowhorn (Xanthoceras sorbifolium) management. Food and Energy Security 12(5), e500.
| Crossref | Google Scholar |
Xu Y, Wang Y, Mattson N, et al. (2017) Genome-wide analysis of the Solanum tuberosum (potato) trehalose-6-phosphate synthase (TPS) gene family: evolution and differential expression during development and stress. BMC Genomics 18(1), 926.
| Crossref | Google Scholar |
Yang Y, Liu X, Xu H, et al. (2024) Physiological and transcriptome analysis provided insights for the response of yellowhorn to drought stress. Trees 38(3), 725-742.
| Crossref | Google Scholar |
Zameer R, Sadaqat M, Fatima K, et al. (2021) Two-component system genes in Sorghum bicolor: genome-wide identification and expression profiling in response to environmental stresses. Frontiers in Genetics 12, 794305.
| Crossref | Google Scholar |
Zameer R, Fatima K, Azeem F, et al. (2022) Genome-wide characterization of superoxide dismutase (SOD) Genes in daucus carota: novel insights into structure, expression, and binding interaction with hydrogen peroxide (H2O2) under abiotic stress condition. Frontiers in Plant Science 13, 870241.
| Crossref | Google Scholar |
Zhang C-P, Zhang J-L, Sun Z-R, et al. (2022) Genome-wide identification and characterization of terpene synthase genes in Gossypium hirsutum. Gene 828, 146462.
| Crossref | Google Scholar | PubMed |
Zhang Y, Ma L, Su P, et al. (2023) Cytochrome P450s in plant terpenoid biosynthesis: discovery, characterization and metabolic engineering. Critical Reviews in Biotechnology 43(1), 1-21.
| Crossref | Google Scholar | PubMed |
Zhang W, Xie X, Le L, et al. (2024) Transcriptional profiling reveals key regulatory roles of the WUSCHEL-related homeobox gene family in Yellowhorn (Xanthoceras sorbifolia Bunge). Forests 15(2), 376.
| Crossref | Google Scholar |
Zhao L, Zhao X, Francis F, et al. (2021) Genome-wide identification and characterization of the TPS gene family in wheat (Triticum aestivum L.) and expression analysis in response to aphid damage. Acta Physiologiae Plantarum 43(4), 64.
| Crossref | Google Scholar |
Zhu R-B, Wang Q, Guan W-B, et al. (2019) Conservation of genetic diversity hotspots of the high-valued relic yellowhorn (Xanthoceras sorbifolium) considering climate change predictions. Ecology and Evolution 9(6), 3251-3263.
| Crossref | Google Scholar | PubMed |
Zia K, Rao MJ, Sadaqat M, et al. (2022) Pangenome-wide analysis of cyclic nucleotide-gated channel (CNGC) gene family in citrus Spp. Revealed their intraspecies diversity and potential roles in abiotic stress tolerance. Frontiers in Genetics 13, 1034921.
| Crossref | Google Scholar | PubMed |
Zia K, Sadaqat M, Ding B, et al. (2024) Comparative genomics and bioinformatics approaches revealed the role of CC-NBS-LRR genes under multiple stresses in passion fruit. Frontiers in Genetics 15, 1358134.
| Crossref | Google Scholar |