Ieva Staliūnaitė scite author profile

Ieva Staliūnaitė

5Publications

33Citation Statements Received

19Citation Statements Given

How they've been cited

How they cite others

Affiliations

Huawei Technologies (Sweden), Huawei Technologies (United Kingdom)

Publications

Order By: Most citations

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Staliūnaitė¹,

Iacobacci²

2020

View full text Add to dashboard Cite

Many NLP tasks have benefited from transferring knowledge from contextualized word embeddings, however the picture of what type of knowledge is transferred is incomplete. This paper studies the types of linguistic phenomena accounted for by language models in the context of a Conversational Question Answering (CoQA) task. We identify the problematic areas for the finetuned RoBERTa, BERT and DistilBERT models through systematic error analysis -basic arithmetic (counting phrases), compositional semantics (negation and Semantic Role Labeling), and lexical semantics (surprisal and antonymy). When enhanced with the relevant linguistic knowledge through multitask learning, the models improve in performance. Ensembles of the enhanced models yield a boost between 2.2 and 2.7 points in F1 score overall, and up to 42.1 points in F1 on the hardest question classes. The results show differences in ability to represent compositional and lexical information between RoBERTa, BERT and DistilBERT.

show abstract

Getting to “Hearer-old”: Charting Referring Expressions Across Time

Staliūnaitė¹,

Rohde²,

Webber³

et al. 2018

View full text Add to dashboard Cite

When a reader is first introduced to an entity, its referring expression must describe the entity. For entities that are widely known, a single word or phrase often suffices. This paper presents the first study of how expressions that refer to the same entity develop over time. We track thousands of person and organization entities over 20 years of New York Times (NYT). As entities move from hearernew (first introduction to the NYT audience) to hearer-old (common knowledge) status, we show empirically that the referring expressions along this trajectory depend on the type of the entity, and exhibit linguistic properties related to becoming common knowledge (e.g., shorter length, less use of appositives, more definiteness). These properties can also be used to build a model to predict how long it will take for an entity to reach hearer-old status. Our results reach 10-30% absolute improvement over a majority-class baseline.

show abstract

Breaking Sentiment Analysis of Movie Reviews

Staliūnaitė¹,

Bonfil²

2017

View full text Add to dashboard Cite

show abstract

Auxiliary Capsules for Natural Language Understanding

Staliūnaitė

Iacobacci

2020

View full text Add to dashboard Cite

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Staliūnaitė¹,

Gorinski²,

Iacobacci³

2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.