Xiao Liu scite author profile

Xiao Liu

3Publications

11Citation Statements Received

128Citation Statements Given

How they've been cited

How they cite others

127

128

Affiliations

Tsinghua University

Publications

Order By: Most citations

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

Liu

Lai

et al. 2023

View full text Add to dashboard Cite

We present WebGLM, a web-enhanced question-answering system based on the General Language Model (GLM). Its goal is to augment a pre-trained large language model (LLM) with web search and retrieval capabilities while being efficient for real-world deployments. To achieve this, we develop WebGLM with strategies for the LLM-augmented retriever, bootstrapped generator, and human preference-aware scorer. Specifically, we identify and address the limitations of WebGPT (OpenAI), through which WebGLM is enabled with accuracy, efficiency, and cost-effectiveness advantages. In addition, we propose systematic criteria for evaluating web-enhanced QA systems. We conduct multi-dimensional human evaluation and quantitative ablation studies, which suggest the outperformance of the proposed WebGLM designs over existing systems. WebGLM with the 10-billion-parameter GLM (10B) is shown to perform better than the similar-sized WebGPT (13B) and even comparably to WebGPT (175B) in human evaluation. The code, demo, and data are at https://github.com/THUDM/WebGLM. CCS Concepts• Computing methodologies → Natural language generation; • Software and its engineering → Development frameworks and environments.

show abstract

GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

Hou

Cen

et al. 2023

View full text Add to dashboard Cite

Graph self-supervised learning (SSL), including contrastive and generative approaches, offers great potential to address the fundamental challenge of label scarcity in real-world graph data. Among both sets of graph SSL techniques, the masked graph autoencoders (e.g., GraphMAE)-one type of generative methods-have recently produced promising results. The idea behind this is to reconstruct the node features (or structures)-that are randomly masked from the input-with the autoencoder architecture. However, the performance of masked feature reconstruction naturally relies on the discriminability of the input features and is usually vulnerable to disturbance in the features. In this paper, we present a masked self-supervised learning framework 1 GraphMAE2 with the goal of overcoming this issue. The idea is to impose regularization on feature reconstruction for graph SSL. Specifically, we design the strategies of multi-view random re-mask decoding and latent representation prediction to regularize the feature reconstruction. The multi-view random re-mask decoding is to introduce randomness into reconstruction in the feature space, while the latent representation prediction is to enforce the reconstruction in the embedding space. Extensive experiments show that GraphMAE2 can consistently generate top results on various public datasets, including at least 2.45% improvements over state-of-the-art baselines on ogbn-Papers100M with 111M nodes and 1.6B edges.

show abstract

OAG: Linking Entities Across Large-Scale Heterogeneous Knowledge Graphs

Zhang

Liu

Tang

et al. 2023

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiao Liu

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner

OAG: Linking Entities Across Large-Scale Heterogeneous Knowledge Graphs

Contact Info

Product

Resources

About