Lukás̆ Svoboda scite author profile

Lukás̆ Svoboda

5Publications

115Citation Statements Received

105Citation Statements Given

How they've been cited

152

113

How they cite others

104

Affiliations

Czech Technical University in Prague, University of West Bohemia, Na Homolce Hospital

Publications

Order By: Most citations

UWB at SemEval-2016 Task 1: Semantic Textual Similarity using Lexical, Syntactic, and Semantic Information

Brychcín

Svoboda

2016

View full text Add to dashboard Cite

We present our UWB system for Semantic Textual Similarity (STS) task at SemEval 2016. Given two sentences, the system estimates the degree of their semantic similarity. We use state-of-the-art algorithms for the meaning representation and combine them with the best performing approaches to STS from previous years. These methods benefit from various sources of information, such as lexical, syntactic, and semantic. In the monolingual task, our system achieve mean Pearson correlation 75.7% compared with human annotators. In the cross-lingual task, our system has correlation 86.3% and is ranked first among 26 systems.

show abstract

UWB at SemEval-2016 Task 5: Aspect Based Sentiment Analysis

Hercig¹,

Brychcín²,

Svoboda³

et al. 2016

View full text Add to dashboard Cite

This paper describes our system used in the Aspect Based Sentiment Analysis (ABSA) task of SemEval 2016. Our system uses Maximum Entropy classifier for the aspect category detection and for the sentiment polarity task. Conditional Random Fields (CRF) are used for opinion target extraction. We achieve state-of-the-art results in 9 experiments among the constrained systems and in 2 experiments among the unconstrained systems.

show abstract

New Word Analogy Corpus for Exploring Embeddings of Czech Words

Svoboda

Brychcín

2018

View full text Add to dashboard Cite

The word embedding methods have been proven to be very useful in many tasks of NLP (Natural Language Processing). Much has been investigated about word embeddings of English words and phrases, but only little attention has been dedicated to other languages. Our goal in this paper is to explore the behavior of state-of-the-art word embedding methods on Czech, the language that is characterized by very rich morphology. We introduce new corpus for word analogy task that inspects syntactic, morphosyntactic and semantic properties of Czech words and phrases. We experiment with Word2Vec and GloVe algorithms and discuss the results on this corpus. The corpus is available for the research community.

show abstract

Unsupervised Methods to Improve Aspect-Based Sentiment Analysis in Czech

Hercig¹,

Brychcín²,

Svoboda³

et al. 2016

CyS

View full text Add to dashboard Cite

We examine the effectiveness of several unsupervised methods for latent semantics discovery as features for aspect-based sentiment analysis (ABSA). We use the shared task definition from SemEval 2014. In our experiments we use labeled and unlabeled corpora within the restaurants domain for two languages: Czech and English. We show that our models improve the ABSA performance and prove that our approach is worth exploring. Moreover, we achieve new state-of-the-art results for Czech. Another important contribution of our work is that we created two new Czech corpora within the restaurant domain for the ABSA task: one labeled for supervised training, and the other (considerably larger) unlabeled for unsupervised training. The corpora are available to the research community.

show abstract

Cross-lingual word analogies using linear transformations between semantic spaces

Brychcín

Taylor

Svoboda

2019

Expert Systems with Applications

View full text Add to dashboard Cite

We generalize the word analogy task across languages, to provide a new intrinsic evaluation method for cross-lingual semantic spaces. We experiment with six languages within different language families, including English, German, Spanish, Italian, Czech, and Croatian. State-of-the-art monolingual semantic spaces are transformed into a shared space using dictionaries of word translations. We compare several linear transformations and rank them for experiments with monolingual (no transformation), bilingual (one semantic space is transformed to another), and multilingual (all semantic spaces are transformed onto English space) versions of semantic spaces. We show that tested linear transformations preserve relationships between words (word analogies) and lead to impressive results. We achieve average accuracy of 51.1%, 43.1%, and 38.2% for monolingual, bilingual, and multilingual semantic spaces, respectively. this does not make sense because w a 1 and w a 2 are in a different language. Thus we discard only w b 3 from the search.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lukás̆ Svoboda

UWB at SemEval-2016 Task 1: Semantic Textual Similarity using Lexical, Syntactic, and Semantic Information

UWB at SemEval-2016 Task 5: Aspect Based Sentiment Analysis

New Word Analogy Corpus for Exploring Embeddings of Czech Words

Unsupervised Methods to Improve Aspect-Based Sentiment Analysis in Czech

Cross-lingual word analogies using linear transformations between semantic spaces

Contact Info

Product

Resources

About