John Lazarsfeld scite author profile

A method to improve protein function prediction for sparsely annotated PPI networks is introduced. The method extends the DSD majority vote algorithm introduced by Cao et al. to give confidence scores on predicted labels and to use predictions of high confidence to predict the labels of other nodes in subsequent rounds. We call this a majority vote cascade. Several cascade variants are tested in a stringent cross-validation experiment on PPI networks from S. cerevisiae and D. melanogaster, and we show that for many different settings with several alternative confidence functions, cascading improves the accuracy of the predictions. A list of the most confident new label predictions in the two networks is also reported. Code, networks for the cross-validation experiments, and supplementary figures and tables appear at http://bcb.cs.tufts.edu/cascade.

show abstract

Approximate Majority With Catalytic Inputs

Amir¹,

Aspnes²,

Lazarsfeld³

2020

Preprint

View full text Add to dashboard Cite

Third-state dynamics [AAE08, PVV09] is a well-known process for quickly and robustly computing approximate majority through interactions between randomly-chosen pairs of agents. In this paper, we consider this process in a new model with persistentstate catalytic inputs, as well as in the presence of transient leak faults.Based on models considered in recent protocols for populations with persistent-state agents [DK18, ADK + 17, ATU20] we formalize a Catalytic Input (CI) model comprising n input agents and m worker agents. For m = Θ(n), we show that computing the parity of the input population with high probability requires at least Ω(n 2 ) total interactions, demonstrating a strong separation between the CI and standard population protocol models. On the other hand, we show that the third-state dynamics can be naturally adapted to this new model to solve approximate majority in O(n log n) total steps with high probability when the input margin is Ω( √ n log n), which preserves the time and space efficiency of the corresponding protocol in the original model.We then show the robustness of third-state dynamics protocols to the transient leak faults considered by [ADK + 17, ATU20]. In both the original and CI models, these protocols successfully compute approximate majority with high probability in the presence of leaks occurring at each time step with probability β ≤ O √ n log n/n . The resilience of these dynamics to adversarial leaks exhibits a subtle connection to previous results involving Byzantine agents.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

John Lazarsfeld

Majority Vote Cascading: A Semi-Supervised Framework for Improving Protein Function Prediction

Majority Vote Cascading

Approximate Majority With Catalytic Inputs

Contact Info

Product

Resources

About