The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B-cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in-depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence-based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.
Motivation General protein language models have been shown to summarise the semantics of protein sequences into representations that are useful for state-of-the-art predictive methods. However, for antibody specific problems, such as restoring residues lost due to sequencing errors, a model trained solely on antibodies may be more powerful. Antibodies are one of the few protein types where the volume of sequence data needed for such language models is available, for example in the Observed Antibody Space (OAS) database. Results Here, we introduce AbLang, a language model trained on the antibody sequences in the OAS database. We demonstrate the power of AbLang by using it to restore missing residues in antibody sequence data, a key issue with B-cell receptor repertoire sequencing, for example over 40% of OAS sequences are missing the first 15 amino acids. AbLang restores the missing residues of antibody sequences better than using IMGT germlines or the general protein language model ESM-1b. Further, AbLang does not require knowledge of the germline of the antibody and is seven times faster than ESM-1b. Availability AbLang is a python package available at https://github.com/oxpig/AbLang. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Dietary antioxidants are an important preservative in food and have been suggested to help in disease prevention. With consumer demands for less synthetic and safer additives in food products, the food industry is searching for antioxidants that can be marketed as natural. Peptides derived from natural proteins show promise, as they are generally regarded as safe and potentially contain other beneficial bioactivities. Antioxidative peptides are usually obtained by testing various peptides derived from hydrolysis of proteins by a selection of proteases. This slow and cumbersome trial-and-error approach to identify antioxidative peptides has increased interest in developing computational approaches for prediction of antioxidant activity and thereby reduce laboratory work. A few antioxidant predictors exist, however, no tool predicting the antioxidative properties of peptides is, to the best of our knowledge, currently available as a web-server. We here present the AnOxPePred tool and web-server (http://services.bioinformatics.dtu.dk/service.php?AnOxPePred-1.0) that uses deep learning to predict the antioxidant properties of peptides. Our model was trained on a curated dataset consisting of experimentally-tested antioxidant and non-antioxidant peptides. For a variety of metrics our method displays a prediction performance better than a k-NN sequence identity-based approach. Furthermore, the developed tool will be a good benchmark for future predictors of antioxidant peptides.
In this work, we developed a novel approach combining bioinformatics, testing of functionality and bottom-up proteomics to obtain peptide emulsifiers from potato side-streams. This is a significant advancement in the process to obtain emulsifier peptides and it is applicable to any type of protein.Our results indicated that structure at the interface is the major determining factor of the emulsifying activity of peptide emulsifiers. Fish oil-in-water emulsions with high physical stability were stabilized with peptides to be predicted to have facial amphiphilicity: (i) peptides with predominantly α-helix conformation at the interface and having 18-29 amino acids, and (ii) peptides with predominantly β-strand conformation at the interface and having 13-15 amino acids. In addition, high physically stable emulsions were obtained with peptides that were predicted to have axial hydrophobic/hydrophilic regions. Peptides containing the sequence FCLKVGV showed high in vitro antioxidant activity and led to emulsions with high oxidative stability. Peptide-level proteomics data and sequence analysis revealed the feasibility to obtain the potent emulsifier peptides found in this study (e.g. γ-1) by trypsin-based hydrolysis of different side streams in the potato industry.A considerable number of commercial products are oil-in-water emulsions (e.g. food, pharmaceutical, cosmetics) 1 . In addition, aqueous-based food products are enriched with hydrophobic bioactives (i.e., omega-3, vitamins A, D, E, carotenoids, flavonoids or curcumin) by using oil-in-water emulsions as delivery systems 2 . Nevertheless, oil-in-water emulsions are thermodynamically unstable systems. They tend to separate over time into their components (oil and water) due to several physical destabilization mechanisms such as creaming, flocculation, coalescence, and Ostwald ripening 3 . Emulsifiers are the most common stabilizers used in emulsions production since: (i) they facilitate emulsion formation (e.g., by reducing interfacial tension at the oil-water interface), and (ii) they provide physical stability to the emulsion (i.e., by strong steric and/or electrostatic repulsive forces) 4 . Moreover, emulsifiers also have an influence on the chemical stability of emulsions (e.g. oxidative stability) by determining the properties of the oil-water interface (i.e., thickness, porosity, charge, antioxidant activity). Indeed, these interfacial properties play a critical role on the interaction between oil and prooxidants such as radicals, oxygen and trace metals 5 .Milk proteins such as casein and whey protein are common emulsifiers used for food oil-in-water emulsions due to their excellent functional and antioxidant properties, which lead to physically and oxidatively stable emulsions 6 . Nonetheless, there is an increasing trend to replace animal proteins by plant or microbial proteins in vegetarian or vegan products, as well as to enhance food sustainability 7 . Different approaches have been suggested
The interaction between the class I major histocompatibility complex (MHC), the peptide presented by the MHC and the T-cell receptor (TCR) is a key determinant of the cellular immune response. Here, we present TCRpMHCmodels, a method for accurate structural modelling of the TCR-peptide-MHC (TCR-pMHC) complex. This TCR-pMHC modelling pipeline takes as input the amino acid sequence and generates models of the TCR-pMHC complex, with a median Cα RMSD of 2.31 Å. TCRpMHCmodels significantly outperforms TCRFlexDock, a specialised method for docking pMHC and TCR structures. TCRpMHCmodels is simple to use and the modelling pipeline takes, on average, only two minutes. Thanks to its ease of use and high modelling accuracy, we expect TCRpMHCmodels to provide insights into the underlying mechanisms of TCR and pMHC interactions and aid in the development of advanced T-cell-based immunotherapies and rational design of vaccines. The TCRpMHCmodels tool is available at http://www.cbs.dtu.dk/services/TCRpMHCmodels/.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.