PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents

Lo, Kyle; Shen, Zejiang; Newman, Benjamin; Chang, Joseph; Authur, Russell; Bransom, Erin; Candra, Stefan; Chandrasekhar, Yoganand; Huff, Regan; Kuehl, Bailey; Singh, Amanpreet; Wilhelm, Chris; Zamarron, Angele; Hearst, Marti A.; Weld, Daniel; Downey, Doug; Soldaini, Luca

doi:10.18653/v1/2023.emnlp-demo.45

Cited by 4 publications

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

FoRC@NSLP2024: Overview and Insights from the Field of Research Classification Shared Task

Abu Ahmad,

Borisova,

Rehm

2024

Lecture Notes in Computer Science

View full text Add to dashboard Cite

This article provides an overview of the Field of Research Classification (FoRC) shared task conducted as part of the Natural Scientific Language Processing Workshop (NSLP) 2024. The FoRC shared task encompassed two subtasks: the first was a single-label multi-class classification of scholarly papers across a taxonomy of 123 fields, while the second focused on fine-grained multi-label classification within computational linguistics, using a taxonomy of 170 (sub-)topics. The shared task received 13 submissions for the first subtask and two for the second, with teams surpassing baseline performance metrics in both subtasks. The winning team for subtask I employed a multi-modal approach integrating metadata, full-text, and images from publications, achieving a weighted F1 score of 0.75, while the winning team for the second subtask leveraged a weakly supervised X-transformer model enriched with automatically labelled data, achieving a micro F1 score of 0.56 and a macro F1 of 0.43.

show abstract

FoRC@NSLP2024: Overview and Insights from the Field of Research Classification Shared Task

Abu Ahmad,

Borisova,

Rehm

2024

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

Single-Label Multi-modal Field of Research Classification

Ruosch,

Vasu,

Wang

et al. 2024

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The automated field of research classification for scientific papers is still challenging, even with modern tools such as large language models. As part of a shared task tackling this problem, this paper presents our contribution SLAMFORC, an approach to single-label classification using multi-modal data. We combined the metadata of papers with their full text and, where available, images into a pipeline to predict their field of research with an ensemble voting on traditional classifiers and large language models. We evaluated our approach on the shared task dataset and scored the highest values for two of the four metrics used in the evaluation of the competition, with the other two being the second highest.

show abstract

LitAI: Enhancing Multimodal Literature Understanding and Mining with Generative AI

Medisetti,

Compson,

Fan

et al. 2024

2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR)

View full text Add to dashboard Cite

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents

Cited by 4 publications

References 27 publications

FoRC@NSLP2024: Overview and Insights from the Field of Research Classification Shared Task

FoRC@NSLP2024: Overview and Insights from the Field of Research Classification Shared Task

Single-Label Multi-modal Field of Research Classification

LitAI: Enhancing Multimodal Literature Understanding and Mining with Generative AI

Contact Info

Product

Resources

About