“…J is then evaluated at the nodes at a given level of the tree; if J < B for a given node, then J need not be evaluated at nodes under that node (J decreases as we proceed down the tree, since fewer features are then used, i.e., more features are omitted). To further reduce the searching space, some modifications include [11]: efficient evaluation of paths through the tree with no branches, starting searches at levels 1/4 and 1/2 of the way down the tree, ordering the features (nodes) by which are best using forward selection, and calculation of the initial bound B using the m best features selected by forward selection. At the upper levels of the search tree, we do not expect J < B, since only one or two features (the best ones by forward selection) are omitted.…”