User alignment across social networks can facilitate more information/knowledge transferring across networks and thereby benefit several applications, including social link prediction, cross-domain recommendation, and information diffusion. Several works try to learn a common subspace for networks by preserving the structural proximities, such that different contribution weights of neighbors are ignored as users were always connected by unweighted edges. In this paper, we propose an attention-based network embedding model that exploits the social structures for user alignment. In particular, two main components are contained in our model framework: a masked graph attention mechanism which tries to learn the alignment task driven attention weights by the supervision of pre-aligned user pairs, and an embedding algorithm tries to learn a common vector space by explicitly modeling the weighted contribution probabilities between follower-ships and followee-ships. With the learned weights and embeddings transferring between these two components, we construct a unified model for user embedding and alignment. Stochastic gradient descent and negative sampling are adopted for efficient learning and scalability. The extensive experiments on real-world social network data sets demonstrate the effectiveness and robustness of the proposed model compared with several state-of-the-art methods. INDEX TERMS Attention mechanism, network embedding, social network analysis, user alignment.