“…Simplified phylogenetic tree of the amino acid sequences of eukaryotic GH1 proteins with known structures and those of rice and Arabidopsis GH1 gene products. The protein sequences of the eukaryotic proteins with known structures are marked with four-character PDB codes for one of their structures, including Trifolium repens cyanogenic b-glucosidase (1CBG; Barrett et al, 1995), Sinapsis alba myrosinase (1MYR; Burmeister et al, 1997), Zea mays ZmGlu1 b-glucosidase (1E1F; Czjzek et al, 2000), Sorghum bicolor Dhr1 dhurrinase (1V02; Verdoucq et al, 2004), Triticum aestivum b-glucosidase (2DGA; Sue et al, 2006), Rauvolfia serpentina strictosidine b-glucosidase (2JF6; Barleben et al, 2007), and Oryza sativa Os3BGlu7 (BGlu1) b-glucosidase (2RGL; Chuenchor et al, 2008) from plants, along with Brevicoryne brassicae myrosinase (1WCG; Husebye et al, 2005), Homo sapiens cytoplasmic (Klotho) b-glucosidase (2E9M; Hayashi et al, 2007), and Phanerochaete chrysosporium (2E3Z; Nijikken et al, 2007), while those encoded in the Arabidopsis and rice genomes are labeled with the systematic names given by Xu et al (2004) and Opassiri et al (2006), respectively. One or two example proteins from each plant are given for each of the eight clusters of genes shared by Arabidopsis (At) and rice (Os) and the Arabidopsis-specific clusters At I (b-glucosidases) and At II (myrosinases), with the number of Arabidopsis or rice enzymes in each cluster given in parentheses.…”