AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding

Yan, Jun; Zalmout, Nasser; Liang, Yan; Grant, Christan; Ren, Xiang; Dong, Xin Luna

doi:10.18653/v1/2021.acl-long.362

Cited by 18 publications

(16 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Aghajanyan et al [3] train a hyper-text language model based on BART [24] on a largescale web crawl for various downstream tasks. More recently, several attribute extraction approaches [47,49,53] have been proposed, which treat each field as an attribute of interest and extract its corresponding value from clean object context such as web title. Chen et al [9] formulate the web information extraction problem as structural reading comprehension and build a BERT [15] based model to extract structured fields from the web documents.…”

Section: Related Work 21 Information Extractionmentioning

confidence: 99%

See 1 more Smart Citation

WebFormer: The Web-page Transformer for Structure Information Extraction

Wang¹,

Fang²,

Ravula³

et al. 2022

Preprint

View full text Add to dashboard Cite

Structure information extraction refers to the task of extracting structured text fields from web pages, such as extracting a product offer from a shopping page including product title, description, brand and price. It is an important research topic which has been widely studied in document understanding and web search. Recent natural language models with sequence modeling have demonstrated state-of-the-art performance on web information extraction. However, effectively serializing tokens from unstructured web pages is challenging in practice due to a variety of web layout patterns. Limited work has focused on modeling the web layout for extracting the text fields. In this paper, we introduce WebFormer, a Web-page transFormer model for structure information extraction from web documents. First, we design HTML tokens for each DOM node in the HTML by embedding representations from their neighboring tokens through graph attention. Second, we construct rich attention patterns between HTML tokens and text tokens, which leverages the web layout for effective attention weight computation. We conduct an extensive set of experiments on SWDE and Common Crawl benchmarks. Experimental results demonstrate the superior performance of the proposed approach over several state-of-the-art methods. CCS CONCEPTS• Computing methodologies → Information extraction.

show abstract

Section: Related Work 21 Information Extractionmentioning

confidence: 99%

“…Most previous sequence modeling approaches [2,53] only encode the text sequence of the web document without utilizing the HTML layout structure. In this work, we jointly model the text sequence with the HTML layout in a unified Transformer model.…”

Section: Input Layermentioning

confidence: 99%

WebFormer: The Web-page Transformer for Structure Information Extraction

Wang¹,

Fang²,

Ravula³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…With a limited budget, the development set would allow us to evaluate the performance of different models on 100 product types. Note that for most prior works on product attribute mining [14,18,19], the authors use the same method for gold-standard evaluation. While in this paper, the development set serves the purpose of relative performance comparison.…”

Section: Experiments 61 Datasetsmentioning

confidence: 99%

“…Inspired by named entity recognition models, earlier work leverage statistical models [9] for extraction. With the advancement of deep learning, the most prominent systems designed in recent years adopt BiLSTM-CRF [19,22] or BERT-BiLSTM-CRF [18] architectures for attribute value extraction. Supervised method combined with active learning [22] was explored in OpenTag [22], while follow up works typically settle on distant supervision [14,18,19].…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision

Zhang,

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Automatic extraction of product attributes from their textual descriptions is essential for online shopper experience. One inherent challenge of this task is the emerging nature of e-commerce products -we see new types of products with their unique set of new attributes constantly. Most prior works on this matter mine new values for a set of known attributes but cannot handle new attributes that arose from constantly changing data. In this work, we study the attribute mining problem in an open-world setting to extract novel attributes and their values. Instead of providing comprehensive training data, the user only needs to provide a few examples for a few known attribute types as weak supervision. We propose a principled framework that first generates attribute value candidates and then groups them into clusters of attributes. The candidate generation step probes a pre-trained language model to extract phrases from product titles. Then, an attribute-aware fine-tuning method optimizes a multitask objective and shapes the language model representation to be attribute-discriminative. Finally, we discover new attributes and values through the self-ensemble of our framework, which handles the open-world challenge. We run extensive experiments on a large distantly annotated development set and a gold standard human-annotated test set that we collected. Our model significantly outperforms strong baselines and can generalize to unseen attributes and product types. CCS CONCEPTS• Information systems → Web mining.

show abstract

Knowledge graphs: Introduction, history, and perspectives

et al. 2022

Self Cite

View full text Add to dashboard Cite

Knowledge graphs (KGs) have emerged as a compelling abstraction for organizing the world's structured knowledge and for integrating information extracted from multiple data sources. They are also beginning to play a central role in representing information extracted by AI systems, and for improving the predictions of AI systems by giving them knowledge expressed in KGs as input. The goals of this article are to (a) introduce KGs and discuss important areas of application that have gained recent prominence; (b) situate KGs in the context of the prior work in AI; and (c) present a few contrasting perspectives that help in better understanding KGs in relation to related technologies.

show abstract

AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding

Cited by 18 publications

References 21 publications

WebFormer: The Web-page Transformer for Structure Information Extraction

WebFormer: The Web-page Transformer for Structure Information Extraction

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision

Knowledge graphs: Introduction, history, and perspectives

Contact Info

Product

Resources

About