This chapter takes advantage of using fuzzy classifier rules to capture the correlations between genes. The main motivation to conduct this study is that a fuzzy classifier rule is essentially an “if-then” rule that contains linguistic terms to represent the feature values. This representation of a rule that demonstrates the correlations among the genes is very simple to understand and interpret for domain experts. In this proposed gene selection procedure, instead of measuring the effectiveness of every single gene for building the classifier model, the authors incorporate the impotence of a gene correlation with other existing genes in the process of gene selection. That is, a gene is rejected if it is not in a significant correlation with other genes in the dataset. Furthermore, in order to improve the reliability of this approach, the process is repeated several times in these experiments, and the genes reported as the result are the genes selected in most experiments. This chapter reports test results on ?ve datasets and analyzes the achieved results from biological perspective.
A heavy path in a weighted graph represents a notion of connectivity and ordering that goes beyond two nodes. The heaviest path of length in the graph, simply means a sequence of nodes with edges between them, such that the sum of edge weights is maximum among all paths of length . It is trivial to state the heaviest edge in the graph is the heaviest path of length 1, that represents a heavy connection between (any) two existing nodes. This can be generalized in many different ways for more than two nodes, one of which is finding the heavy weight paths in the graph. In an influence network, this represents a highway for spreading information from a node to one of its indirect neighbors at distance . Moreover, a heavy path implies an ordering of nodes. For instance, we can discover which ordering of songs (tourist spots) on a playlist (travel itinerary) is more pleasant to a user or a group of users who enjoy all songs (tourist spots) on the playlist (itinerary). This can also serve as a hard optimization problem, maximizing different types of quantities of a path such as score, flow, probability or surprise, defined as edge weight. Therefore, if one can solve the Heavy Path Problem (HPP) efficiently, they can as well use HPP for modeling and reduce other complex problems to it. More precisely, we aim at finding k heaviest (top-k) paths of a given length . The weight of a path is defined as a monotone aggregation of individual edge weights using functions such as sum or product. We argue finding simple paths is way more practical than finding paths with cycles in applications such as: routing, playlist recommendation, itinerary planning and influence maximization. Avoiding cycles results in lists without repeated items and with better diversity. Simple paths are also expected to have a higher utility in practice. This makes HPP NP-hard and inapproximable. We propose an efficient algorithm that despite its exponential (theoretical) worst case running time, achieves the exact answer of the NP-hard problem in many useful cases and study the problem from different perspectives.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.