Existing methods for predicting protein crystallization obtain high accuracy using various types of complemented features and complex ensemble classifiers, such as support vector machine (SVM) and Random Forest classifiers. It is desirable to develop a simple and easily interpretable prediction method with informative sequence features to provide insights into protein crystallization. This study proposes an ensemble method, SCMCRYS, to predict protein crystallization, for which each classifier is built by using a scoring card method (SCM) with estimating propensity scores of p-collocated amino acid (AA) pairs (p = 0 for a dipeptide). The SCM classifier determines the crystallization of a sequence according to a weighted-sum score. The weights are the composition of the p-collocated AA pairs, and the propensity scores of these AA pairs are estimated using a statistic with optimization approach. SCMCRYS predicts the crystallization using a simple voting method from a number of SCM classifiers. The experimental results show that the single SCM classifier utilizing dipeptide composition with accuracy of 73.90% is comparable to the best previously-developed SVM-based classifier, SVM_POLY (74.6%), and our proposed SVM-based classifier utilizing the same dipeptide composition (77.55%). The SCMCRYS method with accuracy of 76.1% is comparable to the state-of-the-art ensemble methods PPCpred (76.8%) and RFCRYS (80.0%), which used the SVM and Random Forest classifiers, respectively. This study also investigates mutagenesis analysis based on SCM and the result reveals the hypothesis that the mutagenesis of surface residues Ala and Cys has large and small probabilities of enhancing protein crystallizability considering the estimated scores of crystallizability and solubility, melting point, molecular weight and conformational entropy of amino acids in a generalized condition. The propensity scores of amino acids and dipeptides for estimating the protein crystallizability can aid biologists in designing mutation of surface residues to enhance protein crystallizability. The source code of SCMCRYS is available at http://iclab.life.nctu.edu.tw/SCMCRYS/.
Background Existing methods for predicting protein solubility on overexpression in Escherichia coli advance performance by using ensemble classifiers such as two-stage support vector machine (SVM) based classifiers and a number of feature types such as physicochemical properties, amino acid and dipeptide composition, accompanied with feature selection. It is desirable to develop a simple and easily interpretable method for predicting protein solubility, compared to existing complex SVM-based methods. Results This study proposes a novel scoring card method (SCM) by using dipeptide composition only to estimate solubility scores of sequences for predicting protein solubility. SCM calculates the propensities of 400 individual dipeptides to be soluble using statistic discrimination between soluble and insoluble proteins of a training data set. Consequently, the propensity scores of all dipeptides are further optimized using an intelligent genetic algorithm. The solubility score of a sequence is determined by the weighted sum of all propensity scores and dipeptide composition. To evaluate SCM by performance comparisons, four data sets with different sizes and variation degrees of experimental conditions were used. The results show that the simple method SCM with interpretable propensities of dipeptides has promising performance, compared with existing SVM-based ensemble methods with a number of feature types. Furthermore, the propensities of dipeptides and solubility scores of sequences can provide insights to protein solubility. For example, the analysis of dipeptide scores shows high propensity of α-helix structure and thermophilic proteins to be soluble. Conclusions The propensities of individual dipeptides to be soluble are varied for proteins under altered experimental conditions. For accurately predicting protein solubility using SCM, it is better to customize the score card of dipeptide propensities by using a training data set under the same specified experimental conditions. The proposed method SCM with solubility scores and dipeptide propensities can be easily applied to the protein function prediction problems that dipeptide composition features play an important role. Availability The used datasets, source codes of SCM, and supplementary files are available at http://iclab.life.nctu.edu.tw/SCM/.
Femoral neck fracture is common in the elderly, and its impact has increased in aging societies. Comorbidities, poor levels of activity and pain may contribute to the development of depression, but these factors have not been well addressed. This study aims to investigate the frequency and risk of major depression after a femoral neck fracture using a nationwide population-based study. The Taiwan Longitudinal Health Insurance Database was used in this study. A total of 4,547 patients who were hospitalized for femoral neck fracture within 2003 to 2007 were recruited as a study group; 13,641 matched non-fracture participants were enrolled as a comparison group. Each patient was prospectively followed for 3 years to monitor the occurrence of major depression. Cox proportional-hazards models were used to compute the risk of major depression between members of the study and comparison group after adjusting for residence and socio-demographic characteristics. The most common physical comorbidities that were present after the fracture were also analyzed. The incidences of major depression were 1.2% (n = 55) and 0.7% (n = 95) in the study and comparison groups, respectively. The stratified Cox proportional analysis showed a covariate-adjusted hazard ratio of major depression among patients with femoral neck fracture that was 1.82 times greater (95% CI, 1.30–2.53) than that of the comparison group. Most major depressive episodes (34.5%) presented within the first 200 days following the fracture. In conclusion, patients with a femoral neck fracture are at an increased risk of subsequent major depression. Most importantly, major depressive episodes mainly occurred within the first 200 days following the fracture.
BackgroundBronchial asthma influences some chronic diseases such as coronary heart disease, diabetes mellitus, and hypertension, but the impact of asthma on vital diseases such as chronic kidney disease is not yet verified. This study aims to clarify the association between bronchial asthma and the risk of developing chronic kidney disease.MethodsThe National Health Research Institute provided a database of one million random subjects for the study. A random sample of 141 064 patients aged ≥18 years without a history of kidney disease was obtained from the database. Among them, there were 35 086 with bronchial asthma and 105 258 without asthma matched for sex and age for a ration of 1:3. After adjusting for confounding risk factors, a Cox proportional hazards model was used to compare the risk of developing chronic kidney disease during a three-year follow-up period.ResultsOf the subjects with asthma, 2 196 (6.26%) developed chronic kidney disease compared to 4 120 (3.91%) of the control subjects. Cox proportional hazards regression analysis revealed that subjects with asthma were more likely to develop chronic kidney disease (hazard ratio [HR]: 1.56; 95% CI: 1.48-1.64; p < 0.001). After adjusting for sex, age, monthly income, urbanization level, geographic region, diabetes mellitus, hypertension, hyperlipidemia, and steroid use, the HR for asthma patients was 1.40 (95% CI: 1.33-1.48; p = 0.040). There was decreased HRs in steroid use (HR: 0.56; 95% CI: 0.62-0.61; p < 0.001) in the development of chronic kidney disease. Expectorants, bronchodilators, anti-muscarinic agents, airway smooth muscle relaxants, and leukotriene receptor antagonists may also be beneficial in attenuating the risk of chronic kidney disease.ConclusionsPatients with bronchial asthma may have increased risk of developing chronic kidney disease. The use of steroids or non-steroidal drugs in the treatment of asthma may attenuate this risk.
ObjectiveThe impact of schizophrenia on vital diseases, such as chronic kidney disease (CKD), has not as yet been verified. This study aims to establish whether there is an association between schizophrenia and CKD.DesignA nationwide matched cohort study.SettingTaiwan's National Health Insurance Research Database.ParticipantsA total of 2338 patients with schizophrenia, and 7014 controls without schizophrenia (1:3), matched cohort for sex, age group, geography, urbanisation and monthly income, between 1 January 2003 and 31 December 2007, based on the International Classifications of Disease Ninth Edition (ICD-9), Clinical Modification codes.Primary and secondary outcome measuresAfter making adjustments for confounding risk factors, a Cox proportional hazards model was used to compare the risk of developing CKD during a 3-year follow-up period from the index date.ResultsOf the 2338-subject case cohort, 163 (6.97%) developed a CKD, as did 365 (5.20%) of the 7014 control participants. Cox proportional hazards regression analysis revealed that patients with schizophrenia were more likely to develop CKD (HR=1.36, 95% CI 1.13 to 1.63; p<0.001). After adjusting for gender, age group, hypertension, diabetes mellitus, hyperlipidaemia, heart disease and non-steroid anti-inflammatory drugs (NSAIDs) usage, the HR for patients with schizophrenia was 1.25 (95% CI 1.04 to 1.50; p<0.05). Neither typical nor atypical antipsychotics was associated an increased risk of CKD in patients with schizophrenia.ConclusionsThe findings from this population-based retrospective cohort study suggest that schizophrenia is associated with a 25% increase in the risk of developing CKD within only a 3-year follow-up period.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.