2022
DOI: 10.48550/arxiv.2202.00217
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

WebFormer: The Web-page Transformer for Structure Information Extraction

Abstract: Structure information extraction refers to the task of extracting structured text fields from web pages, such as extracting a product offer from a shopping page including product title, description, brand and price. It is an important research topic which has been widely studied in document understanding and web search. Recent natural language models with sequence modeling have demonstrated state-of-the-art performance on web information extraction. However, effectively serializing tokens from unstructured web… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 37 publications
(55 reference statements)
0
0
0
Order By: Relevance