Derek Thomas scite author profile

Derek Thomas

3Publications

8Citation Statements Received

65Citation Statements Given

How they've been cited

How they cite others

Affiliations

Inception Institute of Artificial Intelligence

Publications

Order By: Most citations

Autoencoding Keyword Correlation Graph for Document Clustering

Chiu¹,

Sahu²,

Thomas³

et al. 2020

View full text Add to dashboard Cite

Document clustering requires a deep understanding of the complex structure of longtext; in particular, the intra-sentential (local) and inter-sentential features (global). Existing representation learning models do not fully capture these features. To address this, we present a novel graph-based representation for document clustering that builds a graph autoencoder (GAE) on a Keyword Correlation Graph. The graph is constructed with topical keywords as nodes and multiple local and global features as edges. A GAE is employed to aggregate the two sets of features by learning a latent representation which can jointly reconstruct them. Clustering is then performed on the learned representations, using vector dimensions as features for inducing document classes. Extensive experiments on two datasets show that the features learned by our approach can achieve better clustering performance than other existing features, including term frequency-inverse document frequency and average embedding.

show abstract

Relation Extraction with Self-determined Graph Convolutional Network

Sahu

Thomas²,

Chiu

et al. 2020

View full text Add to dashboard Cite

Relation Extraction is a way of obtaining the semantic relationship between entities in text. The state-of-the-art methods use linguistic tools to build a graph for the text in which the entities appear and then a Graph Convolutional Network (GCN) is employed to encode the pre-built graphs. Although their performance is promising, the reliance on linguistic tools results in a non end-to-end process. In this work, we propose a novel model, the Self-determined Graph Convolutional Network (SGCN), which determines a weighted graph using a self-attention mechanism, rather using any linguistic tool. Then, the self-determined graph is encoded using a GCN. We test our model on the TACRED dataset and achieve the state-of-the-art result. Our experiments show that SGCN outperforms the traditional GCN, which uses dependency parsing tools to build the graph. CCS CONCEPTS • Computing methodologies → Information extraction.

show abstract

Attending to Inter-sentential Features in Neural Text Classification

Chiu

Sahu

Sengupta

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Derek Thomas

Autoencoding Keyword Correlation Graph for Document Clustering

Relation Extraction with Self-determined Graph Convolutional Network

Attending to Inter-sentential Features in Neural Text Classification

Contact Info

Product

Resources

About