Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics 2023
DOI: 10.18653/v1/2023.eacl-main.268
|View full text |Cite
|
Sign up to set email alerts
|

Representation biases in sentence transformers

Dmitry Nikolaev,
Sebastian Padó

Abstract: Variants of the BERT architecture specialised for producing full-sentence representations often achieve better performance on downstream tasks than sentence embeddings extracted from vanilla BERT. However, there is still little understanding of what properties of inputs determine the properties of such representations. In this study, we construct several sets of sentences with pre-defined lexical and syntactic structures and show that SOTA sentence transformers have a strong nominal-participant-set bias: cosin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
2

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 14 publications
0
2
0
2
Order By: Relevance
“…La obra de Paul Chilton en "Analysing Political Discourse" (2004) ya advertía sobre las limitaciones inherentes a estos tipos de discursos (Chilton, 2004). En el mismo tenor, el estudio de Nikolaev et al (2023) sobre la estimación multilingüe del posicionamiento de partidos políticos ha corroborado estas deficiencias metodológicas (Nikolaev et al, 2023).…”
Section: El Análisis De Discursounclassified
See 1 more Smart Citation
“…La obra de Paul Chilton en "Analysing Political Discourse" (2004) ya advertía sobre las limitaciones inherentes a estos tipos de discursos (Chilton, 2004). En el mismo tenor, el estudio de Nikolaev et al (2023) sobre la estimación multilingüe del posicionamiento de partidos políticos ha corroborado estas deficiencias metodológicas (Nikolaev et al, 2023).…”
Section: El Análisis De Discursounclassified
“…Mario Abdo Benítez (2018-2023 Miembro del Partido Colorado y asumió la presidencia a través de elecciones democráticas. El padre de Abdo, Mario Abdo Benítez Sr., fue secretario privado de Stroessner y se le conocía como un cercano aliado del régimen.…”
unclassified
“…Sentence transformers (Reimers and Gurevych, 2019) implement architecture and training regime changes on BERT to optimize sentence embeddings for downstream tasks. Nikolaev and Padó (2023) analyze the relation between specific sentence properties (e.g. the contribution of different POS) and the geometry of the embedding space of sentence transformers.…”
Section: Related Workmentioning
confidence: 99%
“…Specific grammatical phenomena are often studied on specifically designed or selected datasets (e.g. (Nikolaev and Padó, 2023;Linzen et al, 2016)). We use BLM-AgrF .…”
Section: Datamentioning
confidence: 99%