The UDP glycosyltransferase gene superfamily: Rcommended nomenclature update based on evolutionary divergence

Peter I. Mackenzie, Ida S. Owens, Brian Burchell, K. W. Bock, Amos Bairoch, Alain Bélanger, Sylvie Fournel-Gigleux, Mitchell Green, Dean W. Hum, Takashi Iyanagi, Doron Lancet, Pierre Louisot, Jacques Magdalou, Jayanta Roy-Chowdhury, Joseph K. Ritter, Harry Schachter, Thomas R. Tephly, Keith F. Tipton, Daniel W. Nebert

Research output: Contribution to journalArticle

904 Citations (Scopus)

Abstract

This review represents an update of the nomenclature system for the UDP glucuronosyltransferase gene superfamily, which is based on divergent evolution. Since the previous review in 1991, sequences of many related UDP glycosyltransferases from lower organisms have appeared in the database, which expand our database considerably. At latest count, in animals, yeast, plants and bacteria there are 110 distinct cDNAs/genes whose protein products all contain a characteristic 'signature sequence' and, thus, are regarded as members of the same superfamily. Comparison of a relatedness tree of proteins leads to the definition of 33 families, it should be emphasized that at least six cloned UDP-GlcNAc N-acetylglucosaminyltransferases are not sufficiently homologous to be included as members of this superfamily and may represent an example of convergent evolution. For naming each gene, it is recommended that the root symbol UGT for human (Ugt for mouse and Drosophila), denoting 'UDP glycosyltransferase,' be followed by an Arabic number representing the family a letter designating the subfamily, and an Arabic numeral denoting the individual gene within the family or subfamily, e.g, 'human UGT2B4' and mouse Ugt2b5'. We recommend the name 'UDP glycosyltransferase' because many of the proteins do not preferentially tially use UDP glucuronic acid, or their nucleotide sugar preference is unknown. Whereas the gene is italicized, the corresponding cDNA, transcript, protein and enzyme activity should be written with upper-case letters and without italics, e.g. 'human or mouse UGT1A1. 'The UGT1 gene (spanning > 500 kb) contains at least 12 promoters/first exons, which can be spliced and joined with common exons 2 through 5, leading to different N-terminal halves but identical C-terminal halves of the gene products; in this scheme each first exon is regarded as a distinct gene (e.g. UGT1A1, UGT1A2,... UGT1A12). When an orthologous gene between species cannot be identified with certainty, as occurs in the UGT2B subfamily, sequential naming of the genes is being carried out chronologically as they become characterized. We suggest that the Human Gene Nomenclature Guidelines (http://www.gene.acl.ac.uk/nomenclature/guidelines.html) be used for all species other than the mouse and Drosophila. Thirty published human UGT1A1 mutant alleles responsible for clinical hyperbilirubinemias are listed herein, and given numbers following an asterisk (e.g. UGT1A1*30) consistent with the Human Gene Nomenclature Guidelines. It is anticipated that this UGT gene nomenclature system will require updating on a regular basis.

Original languageEnglish (US)
Pages (from-to)255-269
Number of pages15
JournalPharmacogenetics
Volume7
Issue number4
DOIs
StatePublished - 1997

Fingerprint

Glycosyltransferases
Uridine Diphosphate
Terminology
Genes
Exons
Guidelines
Drosophila
N-Acetylglucosaminyltransferases
Proteins
Complementary DNA
Uridine Diphosphate Glucuronic Acid
Databases
Glucuronosyltransferase
Names

Keywords

  • Evolution
  • Gene nomenclature
  • Glucuronidation
  • Glycosylation
  • Human genetics
  • Hyperbilirubinemia

ASJC Scopus subject areas

  • Genetics
  • Pharmacology, Toxicology and Pharmaceutics(all)

Cite this

Mackenzie, P. I., Owens, I. S., Burchell, B., Bock, K. W., Bairoch, A., Bélanger, A., ... Nebert, D. W. (1997). The UDP glycosyltransferase gene superfamily: Rcommended nomenclature update based on evolutionary divergence. Pharmacogenetics, 7(4), 255-269. https://doi.org/10.1097/00008571-199708000-00001

The UDP glycosyltransferase gene superfamily : Rcommended nomenclature update based on evolutionary divergence. / Mackenzie, Peter I.; Owens, Ida S.; Burchell, Brian; Bock, K. W.; Bairoch, Amos; Bélanger, Alain; Fournel-Gigleux, Sylvie; Green, Mitchell; Hum, Dean W.; Iyanagi, Takashi; Lancet, Doron; Louisot, Pierre; Magdalou, Jacques; Roy-Chowdhury, Jayanta; Ritter, Joseph K.; Schachter, Harry; Tephly, Thomas R.; Tipton, Keith F.; Nebert, Daniel W.

In: Pharmacogenetics, Vol. 7, No. 4, 1997, p. 255-269.

Research output: Contribution to journalArticle

Mackenzie, PI, Owens, IS, Burchell, B, Bock, KW, Bairoch, A, Bélanger, A, Fournel-Gigleux, S, Green, M, Hum, DW, Iyanagi, T, Lancet, D, Louisot, P, Magdalou, J, Roy-Chowdhury, J, Ritter, JK, Schachter, H, Tephly, TR, Tipton, KF & Nebert, DW 1997, 'The UDP glycosyltransferase gene superfamily: Rcommended nomenclature update based on evolutionary divergence', Pharmacogenetics, vol. 7, no. 4, pp. 255-269. https://doi.org/10.1097/00008571-199708000-00001
Mackenzie, Peter I. ; Owens, Ida S. ; Burchell, Brian ; Bock, K. W. ; Bairoch, Amos ; Bélanger, Alain ; Fournel-Gigleux, Sylvie ; Green, Mitchell ; Hum, Dean W. ; Iyanagi, Takashi ; Lancet, Doron ; Louisot, Pierre ; Magdalou, Jacques ; Roy-Chowdhury, Jayanta ; Ritter, Joseph K. ; Schachter, Harry ; Tephly, Thomas R. ; Tipton, Keith F. ; Nebert, Daniel W. / The UDP glycosyltransferase gene superfamily : Rcommended nomenclature update based on evolutionary divergence. In: Pharmacogenetics. 1997 ; Vol. 7, No. 4. pp. 255-269.
@article{1cd8d98a4a824bd1b69bf6c61c4d9930,
title = "The UDP glycosyltransferase gene superfamily: Rcommended nomenclature update based on evolutionary divergence",
abstract = "This review represents an update of the nomenclature system for the UDP glucuronosyltransferase gene superfamily, which is based on divergent evolution. Since the previous review in 1991, sequences of many related UDP glycosyltransferases from lower organisms have appeared in the database, which expand our database considerably. At latest count, in animals, yeast, plants and bacteria there are 110 distinct cDNAs/genes whose protein products all contain a characteristic 'signature sequence' and, thus, are regarded as members of the same superfamily. Comparison of a relatedness tree of proteins leads to the definition of 33 families, it should be emphasized that at least six cloned UDP-GlcNAc N-acetylglucosaminyltransferases are not sufficiently homologous to be included as members of this superfamily and may represent an example of convergent evolution. For naming each gene, it is recommended that the root symbol UGT for human (Ugt for mouse and Drosophila), denoting 'UDP glycosyltransferase,' be followed by an Arabic number representing the family a letter designating the subfamily, and an Arabic numeral denoting the individual gene within the family or subfamily, e.g, 'human UGT2B4' and mouse Ugt2b5'. We recommend the name 'UDP glycosyltransferase' because many of the proteins do not preferentially tially use UDP glucuronic acid, or their nucleotide sugar preference is unknown. Whereas the gene is italicized, the corresponding cDNA, transcript, protein and enzyme activity should be written with upper-case letters and without italics, e.g. 'human or mouse UGT1A1. 'The UGT1 gene (spanning > 500 kb) contains at least 12 promoters/first exons, which can be spliced and joined with common exons 2 through 5, leading to different N-terminal halves but identical C-terminal halves of the gene products; in this scheme each first exon is regarded as a distinct gene (e.g. UGT1A1, UGT1A2,... UGT1A12). When an orthologous gene between species cannot be identified with certainty, as occurs in the UGT2B subfamily, sequential naming of the genes is being carried out chronologically as they become characterized. We suggest that the Human Gene Nomenclature Guidelines (http://www.gene.acl.ac.uk/nomenclature/guidelines.html) be used for all species other than the mouse and Drosophila. Thirty published human UGT1A1 mutant alleles responsible for clinical hyperbilirubinemias are listed herein, and given numbers following an asterisk (e.g. UGT1A1*30) consistent with the Human Gene Nomenclature Guidelines. It is anticipated that this UGT gene nomenclature system will require updating on a regular basis.",
keywords = "Evolution, Gene nomenclature, Glucuronidation, Glycosylation, Human genetics, Hyperbilirubinemia",
author = "Mackenzie, {Peter I.} and Owens, {Ida S.} and Brian Burchell and Bock, {K. W.} and Amos Bairoch and Alain B{\'e}langer and Sylvie Fournel-Gigleux and Mitchell Green and Hum, {Dean W.} and Takashi Iyanagi and Doron Lancet and Pierre Louisot and Jacques Magdalou and Jayanta Roy-Chowdhury and Ritter, {Joseph K.} and Harry Schachter and Tephly, {Thomas R.} and Tipton, {Keith F.} and Nebert, {Daniel W.}",
year = "1997",
doi = "10.1097/00008571-199708000-00001",
language = "English (US)",
volume = "7",
pages = "255--269",
journal = "Pharmacogenetics and Genomics",
issn = "1744-6872",
publisher = "Lippincott Williams and Wilkins",
number = "4",

}

TY - JOUR

T1 - The UDP glycosyltransferase gene superfamily

T2 - Rcommended nomenclature update based on evolutionary divergence

AU - Mackenzie, Peter I.

AU - Owens, Ida S.

AU - Burchell, Brian

AU - Bock, K. W.

AU - Bairoch, Amos

AU - Bélanger, Alain

AU - Fournel-Gigleux, Sylvie

AU - Green, Mitchell

AU - Hum, Dean W.

AU - Iyanagi, Takashi

AU - Lancet, Doron

AU - Louisot, Pierre

AU - Magdalou, Jacques

AU - Roy-Chowdhury, Jayanta

AU - Ritter, Joseph K.

AU - Schachter, Harry

AU - Tephly, Thomas R.

AU - Tipton, Keith F.

AU - Nebert, Daniel W.

PY - 1997

Y1 - 1997

N2 - This review represents an update of the nomenclature system for the UDP glucuronosyltransferase gene superfamily, which is based on divergent evolution. Since the previous review in 1991, sequences of many related UDP glycosyltransferases from lower organisms have appeared in the database, which expand our database considerably. At latest count, in animals, yeast, plants and bacteria there are 110 distinct cDNAs/genes whose protein products all contain a characteristic 'signature sequence' and, thus, are regarded as members of the same superfamily. Comparison of a relatedness tree of proteins leads to the definition of 33 families, it should be emphasized that at least six cloned UDP-GlcNAc N-acetylglucosaminyltransferases are not sufficiently homologous to be included as members of this superfamily and may represent an example of convergent evolution. For naming each gene, it is recommended that the root symbol UGT for human (Ugt for mouse and Drosophila), denoting 'UDP glycosyltransferase,' be followed by an Arabic number representing the family a letter designating the subfamily, and an Arabic numeral denoting the individual gene within the family or subfamily, e.g, 'human UGT2B4' and mouse Ugt2b5'. We recommend the name 'UDP glycosyltransferase' because many of the proteins do not preferentially tially use UDP glucuronic acid, or their nucleotide sugar preference is unknown. Whereas the gene is italicized, the corresponding cDNA, transcript, protein and enzyme activity should be written with upper-case letters and without italics, e.g. 'human or mouse UGT1A1. 'The UGT1 gene (spanning > 500 kb) contains at least 12 promoters/first exons, which can be spliced and joined with common exons 2 through 5, leading to different N-terminal halves but identical C-terminal halves of the gene products; in this scheme each first exon is regarded as a distinct gene (e.g. UGT1A1, UGT1A2,... UGT1A12). When an orthologous gene between species cannot be identified with certainty, as occurs in the UGT2B subfamily, sequential naming of the genes is being carried out chronologically as they become characterized. We suggest that the Human Gene Nomenclature Guidelines (http://www.gene.acl.ac.uk/nomenclature/guidelines.html) be used for all species other than the mouse and Drosophila. Thirty published human UGT1A1 mutant alleles responsible for clinical hyperbilirubinemias are listed herein, and given numbers following an asterisk (e.g. UGT1A1*30) consistent with the Human Gene Nomenclature Guidelines. It is anticipated that this UGT gene nomenclature system will require updating on a regular basis.

AB - This review represents an update of the nomenclature system for the UDP glucuronosyltransferase gene superfamily, which is based on divergent evolution. Since the previous review in 1991, sequences of many related UDP glycosyltransferases from lower organisms have appeared in the database, which expand our database considerably. At latest count, in animals, yeast, plants and bacteria there are 110 distinct cDNAs/genes whose protein products all contain a characteristic 'signature sequence' and, thus, are regarded as members of the same superfamily. Comparison of a relatedness tree of proteins leads to the definition of 33 families, it should be emphasized that at least six cloned UDP-GlcNAc N-acetylglucosaminyltransferases are not sufficiently homologous to be included as members of this superfamily and may represent an example of convergent evolution. For naming each gene, it is recommended that the root symbol UGT for human (Ugt for mouse and Drosophila), denoting 'UDP glycosyltransferase,' be followed by an Arabic number representing the family a letter designating the subfamily, and an Arabic numeral denoting the individual gene within the family or subfamily, e.g, 'human UGT2B4' and mouse Ugt2b5'. We recommend the name 'UDP glycosyltransferase' because many of the proteins do not preferentially tially use UDP glucuronic acid, or their nucleotide sugar preference is unknown. Whereas the gene is italicized, the corresponding cDNA, transcript, protein and enzyme activity should be written with upper-case letters and without italics, e.g. 'human or mouse UGT1A1. 'The UGT1 gene (spanning > 500 kb) contains at least 12 promoters/first exons, which can be spliced and joined with common exons 2 through 5, leading to different N-terminal halves but identical C-terminal halves of the gene products; in this scheme each first exon is regarded as a distinct gene (e.g. UGT1A1, UGT1A2,... UGT1A12). When an orthologous gene between species cannot be identified with certainty, as occurs in the UGT2B subfamily, sequential naming of the genes is being carried out chronologically as they become characterized. We suggest that the Human Gene Nomenclature Guidelines (http://www.gene.acl.ac.uk/nomenclature/guidelines.html) be used for all species other than the mouse and Drosophila. Thirty published human UGT1A1 mutant alleles responsible for clinical hyperbilirubinemias are listed herein, and given numbers following an asterisk (e.g. UGT1A1*30) consistent with the Human Gene Nomenclature Guidelines. It is anticipated that this UGT gene nomenclature system will require updating on a regular basis.

KW - Evolution

KW - Gene nomenclature

KW - Glucuronidation

KW - Glycosylation

KW - Human genetics

KW - Hyperbilirubinemia

UR - http://www.scopus.com/inward/record.url?scp=8544224973&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=8544224973&partnerID=8YFLogxK

U2 - 10.1097/00008571-199708000-00001

DO - 10.1097/00008571-199708000-00001

M3 - Article

C2 - 9295054

AN - SCOPUS:8544224973

VL - 7

SP - 255

EP - 269

JO - Pharmacogenetics and Genomics

JF - Pharmacogenetics and Genomics

SN - 1744-6872

IS - 4

ER -