2018
DOI: 10.1016/j.cels.2017.11.003
|View full text |Cite
|
Sign up to set email alerts
|

Quantitative Missense Variant Effect Prediction Using Large-Scale Mutagenesis Data

Abstract: Large datasets describing the quantitative effects of mutations on protein function are becoming increasingly available. Here, we leverage these datasets to develop Envision, which predicts the magnitude of a missense variant's molecular effect. Envision combines 21,026 variant effect measurements from nine large-scale experimental mutagenesis datasets, a hitherto untapped training resource, with a supervised, stochastic gradient boosting learning algorithm. Envision outperforms other missense variant effect p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

6
255
1
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 204 publications
(263 citation statements)
references
References 39 publications
6
255
1
1
Order By: Relevance
“…As additional data become available, mutfunc will be updated to improve coverage and future work could expand the set of mechanisms studied such as drug or small‐molecule binding sites, RNA‐binding interfaces, among others. The effects of variants on molecular and cellular phenotypes are increasingly being probed directly by large‐scale mutagenesis experiments (Fowler & Fields, ; Weile et al , ), which will likely result in improved variant effect prediction algorithms (Gray et al , ). The curation of such experimentally determined effects and the improved algorithms can be integrated in future iterations of mutfunc.…”
Section: Discussionmentioning
confidence: 99%
“…As additional data become available, mutfunc will be updated to improve coverage and future work could expand the set of mechanisms studied such as drug or small‐molecule binding sites, RNA‐binding interfaces, among others. The effects of variants on molecular and cellular phenotypes are increasingly being probed directly by large‐scale mutagenesis experiments (Fowler & Fields, ; Weile et al , ), which will likely result in improved variant effect prediction algorithms (Gray et al , ). The curation of such experimentally determined effects and the improved algorithms can be integrated in future iterations of mutfunc.…”
Section: Discussionmentioning
confidence: 99%
“…In addition, Envision, a machine learning model trained on large‐scale mutagenesis data sets (similar to this study) to predict the impact of missense variants on protein function, was also included as a newer baseline method (Gray, Hause, Luebeck, Shendure, & Fowler, ). No score transformation was necessary for Envision.…”
Section: Methodsmentioning
confidence: 99%
“…While these were not developed for the specific task of predicting effects of variants on protein stability, the VSP assay itself was originally developed as a general-purpose technique for the high-throughput identification of deleterious and benign variants and may thus be somewhat comparable with the effects these methods were developed to predict. For the baseline methods, score transformation was carried out (heuristically) to match the expected score range as follows In addition, Envision, a machine learning model trained on large-scale mutagenesis data sets (similar to this study) to predict the impact of missense variants on protein function, was also included as a newer baseline method (Gray, Hause, Luebeck, Shendure, & Fowler, 2018). No score transformation was necessary for Envision.…”
mentioning
confidence: 99%
“…Thus, it is infeasible to experimentally assess, for example, the effects of all non-synonymous Single Nucleotide Polymorphisms (nsSNPs) of a given individual, much less a population. However, the large-scale mutational fitness landscapes resulting from DMS analyses are an exciting resource for the development of new accurate variant effect prediction approaches (Gray, et al, 2018).…”
Section: Introductionmentioning
confidence: 99%