Shuqing Li scite author profile

In deep neural nets, lower level embedding layers account for a large portion of the total number of parameters. Tikhonov regularization, graph-based regularization, and hard parameter sharing are approaches that introduce explicit biases into training in a hope to reduce statistical complexity. Alternatively, we propose stochastically shared embeddings (SSE), a data-driven approach to regularizing embedding layers, which stochastically transitions between embeddings during stochastic gradient descent (SGD). Because SSE integrates seamlessly with existing SGD algorithms, it can be used with only minor modifications when training large scale neural networks. We develop two versions of SSE: SSE-Graph using knowledge graphs of embeddings; SSE-SE using no prior information. We provide theoretical guarantees for our method and show its empirical effectiveness on 6 distinct tasks, from simple neural networks with one hidden layer in recommender systems, to the transformer and BERT in natural languages. We find that when used along with widely-used regularization methods such as weight decay and dropout, our proposed SSE can further reduce overfitting, which often leads to more favorable generalization results.Preprint. Under review.

show abstract

Hemodynamic effect of apelin in a canine model of acute pulmonary thromboembolism

Feng

et al. 2010

Peptides

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shuqing Li

Lentivirus mediated IL-17R blockade improves diastolic cardiac function in spontaneously hypertensive rats

Lanostane triterpenoids from the fungus Ceriporia lacerate associated with Acanthaster planci

Toll-like receptor 4 is involved in ischemic tolerance of postconditioning in hippocampus of tree shrews to thrombotic cerebral ischemia

Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers

Hemodynamic effect of apelin in a canine model of acute pulmonary thromboembolism

Contact Info

Product

Resources

About