The problem of discovering previously unknown frequent patterns in time series, also called motifs, has been recently introduced. A motif is a subseries pattern that appears a significant number of times. Results demonstrate that motifs may provide valuable insights about the data and have a wide range of applications in data mining tasks. The main motivation for this study was the need to mine time series data from protein folding/unfolding simulations. We propose an algorithm that extracts approximate motifs, i.e. motifs that capture portions of time series with a similar and eventually symmetric behavior. Preliminary results on the analysis of protein unfolding data support this proposal as a valuable tool. Additional experiments demonstrate that the application of utility of our algorithm is not limited to this particular problem. Rather it can be an interesting tool to be applied in many real world problems.
We present a systematic test of the performance of three popular united-atom force fields-OPLS-UA, GROMOS and TraPPE-at predicting hydrophobic solvation, more precisely at describing the solvation of alkanes in alkanes. Gibbs free energies of solvation were calculated for 52 solute/solvent pairs from Molecular Dynamics simulations and thermodynamic integration making use of the IBERCIVIS volunteer computing platform. Our results show that all force fields yield good predictions when both solute and solvent are small linear or branched alkanes (up to pentane). However, as the size of the alkanes increases, all models tend to increasingly deviate from experimental data in a systematic fashion. Furthermore, our results confirm that specific interaction parameters for cyclic alkanes in the united-atom representation are required to account for the additional excluded volume within the ring. Overall, the TraPPE model performs best for all alkanes, but systematically underpredicts the magnitude of solvation free energies by about 6% (RMSD of 1.2 kJ/mol). Conversely, both GROMOS and OPLS-UA systematically overpredict solvation free energies (by ∼13% and 15%, respectively). The systematic trends suggest that all models can be improved by a slight adjustment of their Lennard-Jones parameters. © 2016 Wiley Periodicals, Inc.
Systemic lupus erythematosus (SLE) affects predominantly women at reproductive age but may present at any age. Age at disease onset has a modulating effect on presentation and course of disease, but controversies persist regarding its impact on long-term outcome. Our aims were to characterize clinical features, co-morbidities and cumulative damage in childhood-onset, adult-onset and late-onset SLE. Patients with childhood-onset SLE fulfilling ACR 1997 criteria were identified in a nationwide register-Reuma.pt/SLE (N = 89) and compared with adult-onset and late-onset counterparts matched 1:1:1 for disease duration. 267 SLE patients with mean disease duration of 11.9 ± 9.3 years were analyzed. Skin (62 %), kidney (58 %), neurological (11 %) and hematologic involvement (76 %) were significantly more common in childhood-onset SLE and disease activity was higher in this subset than in adult- and late-onset disease (SLEDAI-2K 3.4 ± 3.8 vs. 2.2 ± 2.7 vs. 1.6 ± 2.8, respectively; p = 0.004). Also, more childhood-onset patients received cyclophosphamide (10 %) and mycophenolate mofetil (34 %). A greater proportion of women (96 %), prevalence of arthritis (89 %) and anti-SSA antibodies (34 %) were noted in the adult-onset group. There was a significant delay in the diagnosis of SLE in older ages. Co-morbidities such as hypertension, diabetes and thyroid disease were significantly more frequent in late-onset SLE, as well as the presence of irreversible damage evaluated by the SLICC/ACR damage index (20 vs. 26 vs. 40 %; p < 0.001). Greater organ involvement as well as the frequent need for immunosuppressants supports the concept of childhood-onset being a more severe disease. In contrast, disease onset is more indolent but co-morbidity burden and irreversible damage are greater in late-onset SLE, which may have implications for patients' management.
Protein aggregation into insoluble fibrillar structures known as amyloid characterizes several neurodegenerative diseases, including Alzheimer's, Huntington's and Creutzfeldt-Jakob. Transthyretin (TTR), a homotetrameric plasma protein, is known to be the causative agent of amyloid pathologies such as FAP (familial amyloid polyneuropathy), FAC (familial amyloid cardiomiopathy) and SSA (senile systemic amyloidosis). It is generally accepted that TTR tetramer dissociation and monomer partial unfolding precedes amyloid fibril formation. To explore the TTR unfolding landscape and to identify potential intermediate conformations with high tendency for amyloid formation, we have performed molecular dynamics unfolding simulations of WT-TTR and L55P-TTR, a highly amyloidogenic TTR variant. Our simulations in explicit water allow the identification of events that clearly discriminate the unfolding behavior of WT and L55P-TTR. Analysis of the simulation trajectories show that (i) the L55P monomers unfold earlier and to a larger extent than the WT; (ii) the single a-helix in the TTR monomer completely unfolds in most of the L55P simulations while remain folded in WT simulations; (iii) L55P forms, early in the simulations, aggregation-prone conformations characterized by full displacement of strands C and D from the main b-sandwich core of the monomer; (iv) L55P shows, late in the simulations, severe loss of the H-bond network and consequent destabilization of the CBEF b-sheet of the b-sandwich; (v) WT forms aggregationcompatible conformations only late in the simulations and upon extensive unfolding of the monomer. These results clearly show that, in comparison with WT, L55P-TTR does present a much higher probability of forming transient conformations compatible with aggregation and amyloid formation.
One of the central challenges in structural molecular biology today is the protein folding problem, i.e. the acquisition of the 3D structure of a protein from its linear sequence of amino-acids. Different computational approaches to study protein folding and protein unfolding have recently become common tools available to the researcher. However, due to the lack of appropriate infrastructures, it is very difficult to directly compare simulations performed by different groups, with different methods, in different experimental conditions or for different proteins. Thus, we set out to create a public data repository with the goal of addressing the problem of comparison, analysis and sharing of information and data on protein folding and protein unfolding simulations. The P-found system for protein folding and protein unfolding simulations is presented. At the moment, the data repository allows uploading of molecular dynamics (MD) protein folding and unfolding simulations, calculates and stores several time series with the variation over time of pre-defined molecular properties, and allows searching and downloading of these data. In the near future, simulations performed by other than MD methods may be uploaded, and data mining techniques for analysis and comparison of multiple simulations will be implemented. The home page for the P-found system is accessible at http://www.p-found.org.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.