Sen Yang scite author profile

Social media (i.e., Twitter, Facebook, Flickr, YouTube) and other services with user-generated content have made a staggering amount of information (and misinformation) available. Government officials seek to leverage these resources to improve services and communication with citizens. Yet, the sheer volume of social data streams generates substantial noise that must be filtered. Nonetheless, potential exists to identify issues in real time, such that emergency management can monitor and respond to issues concerning public safety. By detecting meaningful patterns and trends in the stream of messages and information flow, events can be identified as spikes in activity, while meaning can be deciphered through changes in content. This paper presents findings from a pilot study we conducted between June and December 2010 with government officials in Arlington, Virginia (and the greater National Capitol Region around Washington, DC) with a view to understanding the use of social media by government officials as well as community organizations, businesses and the public. We are especially interested in understanding social media use in crisis situations (whether severe or fairly common, such as traffic or weather crises).

show abstract

Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning

Yang

Zhu

2019

AAAI

425

274

View full text Add to dashboard Cite

In distributed training of deep neural networks, parallel minibatch SGD is widely used to speed up the training process by using multiple workers. It uses multiple workers to sample local stochastic gradient in parallel, aggregates all gradients in a single server to obtain the average, and update each worker's local model using a SGD update with the averaged gradient. Ideally, parallel mini-batch SGD can achieve a linear speed-up of the training time (with respect to the number of workers) compared with SGD over a single worker. However, such linear scalability in practice is significantly limited by the growing demand for gradient communication as more workers are involved. Model averaging, which periodically averages individual models trained over parallel workers, is another common practice used for distributed training of deep neural networks since (Zinkevich et al. 2010) (McDonald, Hall, and Mann 2010). Compared with parallel mini-batch SGD, the communication overhead of model averaging is significantly reduced. Impressively, tremendous experimental works have verified that model averaging can still achieve a good speed-up of the training time as long as the averaging interval is carefully controlled. However, it remains a mystery in theory why such a simple heuristic works so well. This paper provides a thorough and rigorous theoretical study on why model averaging can work as well as parallel mini-batch SGD with significantly less communication overhead.1 Equivalently, we can let the server update its solution using the averaged gradient and broadcast this solution to all local workers. Another equivalent implementation is to let each worker take a single SGD step using its own gradient and send the updated local solution to the server; let the server calculate the average of all workers' updated solutions and refresh each worker's local solution with the averaged version. arXiv:1807.06629v3 [math.OC]

show abstract

Exploring Pre-trained Language Models for Event Extraction and Generation

Yang¹,

Feng²,

Qiao³

et al. 2019

262

165

View full text Add to dashboard Cite

Traditional approaches to the task of ACE event extraction usually depend on manually annotated data, which is often laborious to create and limited in size. Therefore, in addition to the difficulty of event extraction itself, insufficient training data hinders the learning process as well. To promote event extraction, we first propose an event extraction model to overcome the roles overlap problem by separating the argument prediction in terms of roles. Moreover, to address the problem of insufficient training data, we propose a method to automatically generate labeled data by editing prototypes and screen out generated samples by ranking the quality. Experiments on the ACE2005 dataset demonstrate that our extraction model can surpass most existing extraction methods. Besides, incorporating our generation method exhibits further significant improvement. It obtains new state-of-the-art results on the event extraction task, including pushing the F1 score of trigger classification to 81.1%, and the F1 score of argument classification to 58.9%.

show abstract

Sparse temporally dynamic resting-state functional connectivity networks for early MCI identification

Wee

Yang

Yap

et al. 2015

Brain Imaging and Behavior

162

136

View full text Add to dashboard Cite

In conventional resting-state functional MRI (R-fMRI) analysis, functional connectivity is assumed to be temporally stationary, overlooking neural activities or interactions that may happen within the scan duration. Dynamic changes of neural interactions can be reflected by variations of topology and correlation strength in temporally correlated functional connectivity networks. These connectivity networks may potentially capture subtle yet short neural connectivity disruptions induced by disease pathologies. Accordingly, we are motivated to utilize disrupted temporal network properties for improving control-patient classification performance. Specifically, a sliding window approach is firstly employed to generate a sequence of overlapping R-fMRI sub-series. Based on these sub-series, sliding window correlations, which characterize the neural interactions between brain regions, are then computed to construct a series of temporal networks. Individual estimation of these temporal networks using conventional network construction approaches fails to take into consideration intrinsic temporal smoothness among successive overlapping R-fMRI subseries. To preserve temporal smoothness of R-fMRI sub-series, we suggest to jointly estimate the temporal networks by maximizing a penalized log likelihood using a fused sparse learning algorithm. This sparse learning algorithm encourages temporally correlated networks to have similar network topology and correlation strengths. We design a disease identification framework based on the estimated temporal networks, and group level network property differences and classification results demonstrate the importance of including temporally dynamic R-fMRI scan information to improve diagnosis accuracy of mild cognitive impairment patients.

show abstract

An application framework of digital twin and its case study

Zheng

Yang

Cheng

2018

J Ambient Intell Human Comput

412

125

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sen Yang

Social media use by government: From the routine to the critical

Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning

Exploring Pre-trained Language Models for Event Extraction and Generation

Sparse temporally dynamic resting-state functional connectivity networks for early MCI identification

An application framework of digital twin and its case study

Contact Info

Product

Resources

About