Glycogenomics
Glycogenomics
Definition
Glycogenomics is a field of biochemistry that focuses on the comprehensive study of genes and enzymes involved in carbohydrate metabolism and glycosylation within an organism.:[1] It encompasses the analysis of carbohydrate-active enzymes (CAZymes), glycan-related genes, and the genome-wide organization of carbohydrate biosynthesis, modification, and degradation. By integrating concepts from genomics, bioinformatics, and glycobiology, glycogenomics seeks to understand how genetic information determines the structure, diversity, and biological roles of complex carbohydrates.[2]
Background
Interplay between Glycobiology and Genomics
Glycans (monosaccharides, oligosaccharides, and polysaccharides) and glycoconjugates, including glycoproteins, glycolipids, and glycosylated natural products, perform a wide range of essential biological functions. These include roles in cell–cell communication, molecular recognition, immune response, energy storage, structural integrity, and pathogenesis.[3]
Unlike nucleic acids and proteins, glycan biosynthesis is not template-driven (DNA →RNA →Protein). Instead, it is governed by complex networks of glycosyltransferases, glycosidases, lectins, and other carbohydrate-active enzymes (CAZymes) that function in a coordinated and often cell–type–specific manner. [4]Due to this complexity, understanding the structure and function of glycans necessitates an integrative approach that combines genomics, proteomics, metabolomics, and structural biology.
Glycogenomics lies at the interface of genomics, which provides information on gene sequences and families; enzymology, which studies the catalytic functions of CAZymes; and glycomics, which investigates glycan composition and biological roles. It encompasses genome mapping of glycan-related genes, prediction of enzyme functions, and linking genomic context to glycan phenotypes across diverse organisms.[5]
The emergence of large-scale bioinformatics resources, such as the CAZy database and UniProt, has enabled the genome-wide annotation of carbohydrate-related genes in bacteria, fungi, plants, and animals, thereby accelerating discoveries in glycobiology and biotechnology.[1]
Glycogenomics and the CAZy System
A cornerstone of glycogenomics research is the Carbohydrate-Active Enzymes (CAZy) classification system, which organizes carbohydrate-related enzymes into families based on sequence similarity and catalytic mechanisms. Main CAZy categories include[1]:
- Glycoside hydrolases (GHs) – enzymes that hydrolyze glycosidic bonds.
- Glycosyltransferases (GTs) – enzymes that assemble glycans by forming glycosidic linkages.
- Carbohydrate esterases (CEs) – enzymes that modify carbohydrate esters.
- Polysaccharide lyases (PLs) – enzymes that cleave polysaccharides through eliminative mechanisms.
- Auxiliary activities (AAs) – redox enzymes involved in lignocellulose degradation.
- Carbohydrate-binding modules (CBMs) – non-catalytic domains that bind carbohydrates
Through genome-wide glycogenomic profiling, researchers can identify and characterize the “CAZome” the complete set of CAZy-related genes encoded by an organism. Comparative analysis of CAZomes across species reveals insights into their metabolic capacities, ecological adaptations, and the evolutionary diversification of carbohydrate-processing systems.
The term “glycogenes” refers to genes that encode enzymes, transporters, and accessory proteins involved in the biosynthesis, modification, or conjugation of glycans, often functioning within the secretory pathway.In humans, approximately 200–300 glycogenes have been identified, representing roughly 1–2% of the genome.[6]
Mapping expression patterns of glycogenes (tissue-specific, developmental stage-specific) is part of glycogenomics. The “glycome” of a given cell or tissue represents the full set of glycans present, which reflects the combined activity of expressed glycogens.
Genome Mining for Glycosylated Natural Products
In microbial genomics, glycogenomic methods are used to identify gene clusters involved in the biosynthesis of glycosylated natural products, such as antibiotics and pigments. By screening sequenced microbial genomes for clusters encoding glycosyltransferases, deoxysugar biosynthetic enzymes, and tailoring enzymes, researchers can predict and discover novel glycosylated metabolites. Mass spectrometry (MS), particularly tandem MS of glycosyl units, is often used to connect genomic data with the chemical structures of natural products, enabling genome-guided discovery.[7]
Bioinformatics and Evolutionary Analysis
Glycogenomics involves a combination of computational and evolutionary approaches to analyze carbohydrate-active genes[8]:
- Annotating glycoenzyme families in newly sequenced genomes (predicting GH, GT, CE, and PL families)
- Studying evolutionary relationships among CAZymes, including modularity, multidomain architecture, gene duplication, and convergent evolution.
- Integrating data from glycomics, genomics, and proteomics to link genetic variation in glycogenes to observed glycan phenotypes.
Methods
Bioinformatics Approaches
- CAZy database: A comprehensive catalog of carbohydrate-active enzyme families[1].
- dbCAN2 server: A web-based tool for automated CAZyme identification[9].
- HMMER and BLAST: Identify conserved domains in protein sequences.[10]
- Comparative genomics and phylogenetic analysis: Reveal the diversity and evolution of carbohydrate-processing systems.[11]
Experimental Approaches
- Gene knockout and heterologous expression to confirm enzyme activity.
- Mass spectrometry (MS) and High-performance liquid chromatography (HPLC) are used to analyze glycan structures.[6]
- Reporter assays, such as the blue pigment synthetase A (BpsA) system, are used to study gene expression in Streptomyces and related bacteria.
Applications
Glycogenomics has broad applications in biotechnology, microbiology, Agriculture, and medicine, including[12]
- Natural product discovery: Identification of genes producing glycosylated bioactive compounds.
- Industrial biotechnology: Engineering microorganisms for biofuel production and bioconversion of plant biomass.
- Agricultural science: Understanding plant cell wall synthesis to improve crop yield and quality.
- Human health: Understanding host-microbe interactions through microbial glycan recognition and degradation pathways.
- Drug development: Discovery of carbohydrate-active enzymes for antibiotic and antiviral compound biosynthesis.
Related Fields
- Glycomics: Large-scale study of glycan structures within a cell or organism
- Glycobiology: Study of glycans and their biological role
- Glycoproteomics/Glycolipidomics: Large-scale profiling of glycosylated proteins and lipids
- Carbohydrate active enzymes (CAZymes) : Enzymes acting on Carbohydrates, and central to glycogenomics.
- Glycogenes: Genes encoding glycosylation machinery
Relation to other Genomic Fields
Glycogenomics shares many principles with other -omics disciplines:
- Genomics: Large-scale gene mapping and annotation.
- Proteomics: Study of enzyme expression, regulation and activity.
- Metabolomics: Linking gene activity to biochemical pathways and metabolites.
Together with glycomics, glycogenomics forms part of the broader field of systems glycobiology, which seeks to understand how genes, enzymes, and carbohydrates work together to shape biological function.
Databases and Resources
- CAZy database: Central reference for carbohydrate-active enzyme families.
- dbCAN2: Web-based tool for automated CAZyme annotation.
- UniProt[13]: Comprehensive protein sequence and functional database
- KEGG Glycan: Database linking glycans to metabolism and pathways.
See also
- Glycomics[14]
- Genomics[15]
- Proteomics[16]
- Carbohydrate-active enzyme[17]
- Bioinformatics[18]
- Metagenomics[19]
References
- ↑ 1.0 1.1 1.2 1.3 "CAZy", Wikipedia, 2025-08-13, retrieved 2025-11-16
- ↑ Pooresmaeil, Soheyla (2024-01-31). "The Role of Glycogenomics in Advancing Glycobiology". Journal of Glycobiology. 13 (1): 1–2. doi:10.35841/2168-958X.24.13.268 (inactive 16 November 2025). ISSN 2168-958X.
- ↑ Gagneux, Pascal; Hennet, Thierry; Varki, Ajit (2022), Varki, Ajit; Cummings, Richard D.; Esko, Jeffrey D.; Stanley, Pamela, eds., "Biological Functions of Glycans", Essentials of Glycobiology (4th ed.), Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press, doi:10.1101/glycobiology.4e.7 (inactive 16 November 2025), ISBN 978-1-62182-421-3, PMID 35536978 Check
|pmid=value (help), retrieved 2025-11-16 - ↑ Rini, James; Esko, Jeffrey; Varki, Ajit (2009), Varki, Ajit; Cummings, Richard D.; Esko, Jeffrey D.; Freeze, Hudson H., eds., "Glycosyltransferases and Glycan-processing Enzymes", Essentials of Glycobiology (2nd ed.), Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press, ISBN 978-0-87969-770-9, PMID 20301247, retrieved 2025-11-16
- ↑ West, Christopher M.; Malzl, Daniel; Hykollari, Alba; Wilson, Iain B. H. (2021). "Glycomics, Glycoproteomics, and Glycogenomics: An Inter-Taxa Evolutionary Perspective". Molecular & Cellular Proteomics: MCP. 20. doi:10.1074/mcp.R120.002263. ISSN 1535-9484. PMC 8724618 Check
|pmc=value (help). PMID 32994314 Check|pmid=value (help). Unknown parameter|article-number=ignored (help) - ↑ 6.0 6.1 Varki, A.; Cummings, R. D.; Esko, J. D.; Stanley, P.; Hart, G. W.; Aebi, M.; Mohnen, D.; Kinoshita, T.; Packer, N. H.; Prestegard, J. H.; Schnaar, R. L.; Seeberger, P. H. (2022). Varki, Ajit; Cummings, Richard D.; Esko, Jeffrey D.; Stanley, Pamela; Hart, Gerald W.; Aebi, Markus; Mohnen, Debra; Kinoshita, Taroh; Packer, Nicolle H., eds. Essentials of Glycobiology (4th ed.). Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press. doi:10.1101/9781621824213 (inactive 16 November 2025). ISBN 978-1-62182-421-3. PMID 35536922 Check
|pmid=value (help). Search this book on
- ↑ Kersten, Roland D.; Ziemert, Nadine; Gonzalez, David J.; Duggan, Brendan M.; Nizet, Victor; Dorrestein, Pieter C.; Moore, Bradley S. (2013-11-19). "Glycogenomics as a mass spectrometry-guided genome-mining method for microbial glycosylated molecules". Proceedings of the National Academy of Sciences of the United States of America. 110 (47): E4407–4416. Bibcode:2013PNAS..110E4407K. doi:10.1073/pnas.1315492110. ISSN 1091-6490. PMC 3839717. PMID 24191063.
- ↑ Windels, Alex; Franceus, Jorick; Pleiss, Jürgen; Desmet, Tom (2024). "CANDy: Automated analysis of domain architectures in carbohydrate-active enzymes". PLOS ONE. 19 (7): e0306410. Bibcode:2024PLoSO..1906410W. doi:10.1371/journal.pone.0306410. ISSN 1932-6203. PMC 11238990 Check
|pmc=value (help). PMID 38990885 Check|pmid=value (help). - ↑ Zhang, Han; Yohe, Tanner; Huang, Le; Entwistle, Sarah; Wu, Peizhi; Yang, Zhenglu; Busk, Peter K.; Xu, Ying; Yin, Yanbin (2018-07-02). "dbCAN2: a meta server for automated carbohydrate-active enzyme annotation". Nucleic Acids Research. 46 (W1): W95–W101. doi:10.1093/nar/gky418. ISSN 1362-4962. PMC 6031026. PMID 29771380.
- ↑ Genomics Aotearoa & NeSI. "Gene annotation I: BLAST-like and HMM - Metagenomics Summer School". genomicsaotearoa.github.io. Retrieved 2025-11-16.
- ↑ Pinard, Desre; Mizrachi, Eshchar; Hefer, Charles A.; Kersting, Anna R.; Joubert, Fourie; Douglas, Carl J.; Mansfield, Shawn D.; Myburg, Alexander A. (2015-05-22). "Comparative analysis of plant carbohydrate active enZymes and their role in xylogenesis". BMC Genomics. 16 (1): 402. doi:10.1186/s12864-015-1571-8. ISSN 1471-2164. PMC 4440533. PMID 25994181.
- ↑ Bains, Rajneesh K.; Nasseri, Seyed Amirhossein; Wardman, Jacob F.; Withers, Stephen G. (2024-06-01). "Advances in the understanding and exploitation of carbohydrate-active enzymes". Current Opinion in Chemical Biology. 80. doi:10.1016/j.cbpa.2024.102457. ISSN 1367-5931. PMID 38657391 Check
|pmid=value (help). Unknown parameter|article-number=ignored (help) - ↑ "UniProt", Wikipedia, 2025-07-30, retrieved 2025-11-16
- ↑ "Glycomics", Wikipedia, 2025-10-27, retrieved 2025-11-16
- ↑ "Genomics", Wikipedia, 2025-10-09, retrieved 2025-11-16
- ↑ "Proteomics", Wikipedia, 2025-10-02, retrieved 2025-11-16
- ↑ "CAZy", Wikipedia, 2025-08-13, retrieved 2025-11-16
- ↑ "Bioinformatics", Wikipedia, 2025-10-18, retrieved 2025-11-16
- ↑ "Metagenomics", Wikipedia, 2025-11-14, retrieved 2025-11-16
This article "Glycogenomics" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Glycogenomics. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.
