Efficient Sentence Embedding using Discrete Cosine Transform

AlMarwani, Nada; Aldarmaki, Hanan; Diab, Mona

doi:10.18653/v1/d19-1380

Cited by 19 publications

(20 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Rücklé et al (2018) improved the average pooling method by concatenating different power means of word embeddings. Almarwani et al (2019) proposed the use of a Discrete Cosine Transform (DCT) to compress word vectors into sentence embeddings, while retaining word order information.…”

Section: Sentence Embedding Methodsmentioning

confidence: 99%

Sentence Analogies: Linguistic Regularities in Sentence Embeddings

Zhu¹,

Melo²

2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

While important properties of word vector representations have been studied extensively, far less is known about the properties of sentence vector representations. Word vectors are often evaluated by assessing to what degree they exhibit regularities with regard to relationships of the sort considered in word analogies. In this paper, we investigate to what extent commonly used sentence vector representation spaces as well reflect certain kinds of regularities. We propose a number of schemes to induce evaluation data, based on lexical analogy data as well as semantic relationships between sentences. Our experiments consider a wide range of sentence embedding methods, including ones based on BERT-style contextual embeddings. We find that different models differ substantially in their ability to reflect such regularities.

show abstract

Section: Sentence Embedding Methodsmentioning

confidence: 99%

Sentence Analogies: Linguistic Regularities in Sentence Embeddings

Zhu¹,

Melo²

2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…Queen (Mikolov et al, 2013). Thus, given word embeddings, cast a sentence as a multidimensional signal over time, in which DCT is used to summarize the general feature patterns in word sequences and compress them into fixed-length vectors (Kayal and Tsatsaronis, 2019;Almarwani et al, 2019).…”

Section: Dct As Sentence Encodermentioning

confidence: 99%

“…However, most of these models, including averaging, disregard structure information, which is an important aspect of any given language. Recently, Almarwani et al (2019) proposed a structure-sensitive sentence encoder, which utilizes Discrete Cosine Transform (DCT) as an efficient alternative to averaging. The authors show that this approach is versatile and scalable because it relies only on word embeddings, which can be easily obtained from large unlabeled data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Discrete Cosine Transform as Universal Sentence Encoder

AlMarwani¹,

Diab²

2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

Modern sentence encoders are used to generate dense vector representations that capture the underlying linguistic characteristics for a sequence of words, including phrases, sentences, or paragraphs. These kinds of representations are ideal for training a classifier for an end task such as sentiment analysis, question answering and text classification. Different models have been proposed to efficiently generate general purpose sentence representations to be used in pretraining protocols. While averaging is the most commonly used efficient sentence encoder, Discrete Cosine Transform (DCT) was recently proposed as an alternative that captures the underlying syntactic characteristics of a given text without compromising practical efficiency compared to averaging. However, as with most other sentence encoders, the DCT sentence encoder was only evaluated in English. To this end, we utilize DCT encoder to generate universal sentence representation for different languages such as German, French, Spanish and Russian. The experimental results clearly show the superior effectiveness of DCT encoding in which consistent performance improvements are achieved over strong baselines on multiple standardized datasets.

show abstract

“…DCT is a way to generate document-level representations in an order-preserving manner, adapted from image compression to NLP by Almarwani et al (2019). After mapping an input sequence of real numbers to the coefficients of orthogonal cosine basis functions, low-order coefficients can be used as document embeddings, outperforming vector averaging on most tasks, as shown by the authors.…”

Section: Aggregatorsmentioning

confidence: 99%

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Borchmann¹,

Wiśniewski²,

Gretkowski³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

We propose a new shared task of semantic retrieval from legal texts, in which a so-called contract discovery is to be performed-where legal clauses are extracted from documents, given a few examples of similar clauses from other legal acts. The task differs substantially from conventional NLI and shared tasks on legal information extraction (e.g., one has to identify text span instead of a single document, page, or paragraph). The specification of the proposed task is followed by an evaluation of multiple solutions within the unified framework proposed for this branch of methods. It is shown that state-of-the-art pretrained encoders fail to provide satisfactory results on the task proposed. In contrast, Language Model-based solutions perform better, especially when unsupervised fine-tuning is applied. Besides the ablation studies, we addressed questions regarding detection accuracy for relevant text fragments depending on the number of examples available. In addition to the dataset and reference results, LMs specialized in the legal domain were made publicly available.

show abstract

Efficient Sentence Embedding using Discrete Cosine Transform

Cited by 19 publications

References 20 publications

Sentence Analogies: Linguistic Regularities in Sentence Embeddings

Sentence Analogies: Linguistic Regularities in Sentence Embeddings

Discrete Cosine Transform as Universal Sentence Encoder

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Contact Info

Product

Resources

About