After Bonferroni correction, the results provide preliminary evidence that CTLA4 genetic variation at -1661 locus may render Iranian individuals to be more susceptible to MS, whereas harboring TACA haplotype might be protective.
Functional DNA sub-sequences and genome elements are spatially clustered through the genome just as keywords in literary texts. Therefore, some of the methods for ranking words in texts can also be used to compare different DNA sub-sequences. In analogy with the literary texts, here we claim that the distribution of distances between the successive sub-sequences (words) is q-exponential which is the distribution function in non-extensive statistical mechanics. Thus the q-parameter can be used as a measure of words clustering levels. Here, we analyzed the distribution of distances between consecutive occurrences of 16 possible dinucleotides in human chromosomes to obtain their corresponding q-parameters. We found that CG as a biologically important two-letter word concerning its methylation, has the highest clustering level. This finding shows the predicting ability of the method in biology. We also proposed that chromosome 18 with the largest value of q-parameter for promoters of genes is more sensitive to dietary and lifestyle. We extended our study to compare the genome of some selected organisms and concluded that the clustering level of CGs increases in higher evolutionary organisms compared to lower ones.
Here, we introduce PageRank (PR) in a seismic network as an appropriate alarming clue before the occurrence of the event to be worthwhile in hazard probabilistic evaluation of earthquakes. Studying PR changes of two main shocks in Iran and Italy by means of temporal and spatial windows reveals that their PR values increase drastically before the event, while there is no trend for other considered locations and/or other time intervals. Therefore, the PR value seems to be an appropriate index of a place induction by previous events and its susceptibility for having a new earthquake. Moreover, summing over the PRs of areas close to the Italy event location and tracking this newly defined PR behavior show an increasing trend before the main shock implying that the close regions are influenced and become highly connected before the event as well as the earthquake location itself. It is also indicated that PR behavior is not necessarily correlated to the number of occurring earthquakes and is inherently the result of points connectivity and interactions.
Background: Angiography, as the gold standard for the diagnosis of coronary artery disease, has made an attempt to predict coronary artery disease by comparing the efficiency of gene expression programming, as a new data mining technique, and artificial neural network, as a conventional technique. Besides, the study went further to present the results of feature selection based on stepwise backward elimination, classification and regression tree. Methods:The subjects were assessed for nine coronary artery disease risk factors to develop a prediction model for the disease. They included 13,288 patients who were chosen to undergo angiography for the diagnosis of coronary artery disease; from this sample, 4059 subjects were free from the disease while 9169 were suffering from it. Modeling was carried out based on gene expression programming and artificial neural network techniques. The Delong's test was then used to choose the final model based on the area under the Receiver Operating Characteristic (ROC) curve. Results:The model, developed based on artificial neural network, had AUC of 0.719, accuracy of 73.39%, sensitivity of 93.44% and specificity of 28.34%. On the other hand, the model, formulated based on gene expression programming, had AUC of 0.720, accuracy of 73.94%, sensitivity of 93.29% and specificity of 31.43%. Delong's test showed no significant difference between the two models (p value=0/789). Then, feature selection method was used to choose a model with four risk factors and an accuracy rate of 73.26%. Conclusion:Comparison of the results showed no significant difference between the two modeling techniques. The gene expression programming model was very easy to present and interpret; it could also be easily converted to other programming languages; so, with these features in mind, the researchers preferred to choose this technique. modeling, some setting initials are necessary, as can be seen in Table 2. Modeling based on ANN was done using a Multilayer Perceptron (MLP) neural network. Also, the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm, developed based on a quasi-Newton algorithm, was used for learning the network. This learning algorithm has a faster convergence rate than the gradient descend and the conjugate gradient algorithms and is one of the appropriate learning algorithms [26]. Since there is no equation for estimating parameters such as the number of neurons in the hidden layer, the layer activation function and error function of a neural network model could be adopted. So, with this point in mind, we created 100 neural network models by randomly selecting the parameter value, as can be seen in [29,30]. In line with this procedure, the stepwise backward elimination method was adopted to compare the results of ANN and GEP and to select the best possible model and technique. As such, the least important risk factors were also removed and the modeling process was carried out with the remaining risk factors. This process continued until there was no significant change in th...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.