Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-long.362
|View full text |Cite
|
Sign up to set email alerts
|

AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding

Abstract: Automatic extraction of product attribute values is an important enabling technology in e-Commerce platforms. This task is usually modeled using sequence labeling architectures, with several extensions to handle multi-attribute extraction. One line of previous work constructs attribute-specific models, through separate decoders or entirely separate models. However, this approach constrains knowledge sharing across different attributes. Other contributions use a single multiattribute model, with different techn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 18 publications
(16 citation statements)
references
References 21 publications
0
16
0
Order By: Relevance
“…Aghajanyan et al [3] train a hyper-text language model based on BART [24] on a largescale web crawl for various downstream tasks. More recently, several attribute extraction approaches [47,49,53] have been proposed, which treat each field as an attribute of interest and extract its corresponding value from clean object context such as web title. Chen et al [9] formulate the web information extraction problem as structural reading comprehension and build a BERT [15] based model to extract structured fields from the web documents.…”
Section: Related Work 21 Information Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…Aghajanyan et al [3] train a hyper-text language model based on BART [24] on a largescale web crawl for various downstream tasks. More recently, several attribute extraction approaches [47,49,53] have been proposed, which treat each field as an attribute of interest and extract its corresponding value from clean object context such as web title. Chen et al [9] formulate the web information extraction problem as structural reading comprehension and build a BERT [15] based model to extract structured fields from the web documents.…”
Section: Related Work 21 Information Extractionmentioning
confidence: 99%
“…Most previous sequence modeling approaches [2,53] only encode the text sequence of the web document without utilizing the HTML layout structure. In this work, we jointly model the text sequence with the HTML layout in a unified Transformer model.…”
Section: Input Layermentioning
confidence: 99%
“…With a limited budget, the development set would allow us to evaluate the performance of different models on 100 product types. Note that for most prior works on product attribute mining [14,18,19], the authors use the same method for gold-standard evaluation. While in this paper, the development set serves the purpose of relative performance comparison.…”
Section: Experiments 61 Datasetsmentioning
confidence: 99%
“…Inspired by named entity recognition models, earlier work leverage statistical models [9] for extraction. With the advancement of deep learning, the most prominent systems designed in recent years adopt BiLSTM-CRF [19,22] or BERT-BiLSTM-CRF [18] architectures for attribute value extraction. Supervised method combined with active learning [22] was explored in OpenTag [22], while follow up works typically settle on distant supervision [14,18,19].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation