Timo Möller scite author profile

In this work we present the experiments which lead to the creation of our BERT and ELECTRA based German language models, GBERT and GELECTRA. By varying the input training data, model size, and the presence of Whole Word Masking (WWM) we were able to attain SoTA performance across a set of document classification and named entity recognition (NER) tasks for both models of base and large size. We adopt an evaluation driven approach in training these models and our results indicate that both adding more data and utilizing WWM improve model performance. By benchmarking against existing German models, we show that these models are the best German models to date. Our trained models will be made publicly available to the research community.

show abstract

German's Next Language Model

Chan¹,

Schweter²,

Möller³

2020

Preprint

View full text Add to dashboard Cite

show abstract

How Entrepreneurial Firms Profit from Pricing Capabilities: An Examination of Technology–Based Ventures

Flatten

Engelen

Möller³

et al. 2015

Entrepreneurship Theory and Practice

View full text Add to dashboard Cite

This study sheds light on the evolution of the specialized marketing capability, pricing. We develop an operationalization for and empirically validate the pricing–capability dimensions from the perspective of the resource–based view. Using a sample of 420 technology–based ventures, we examine the relationship between pricing capabilities and firm performance, with a particular focus on how age and uncertainty impact these relationships to determine how entrepreneurial firms can profit. We deconstruct pricing capability into four dimensions—price discrimination, dynamic orientation, performance goal orientation, and value delivery—to show, for example, that young companies should focus on their price discrimination capability to improve performance.

show abstract

Semantic Answer Similarity for Evaluating Question Answering Models

Risch¹,

Möller²,

Gutsch³

et al. 2021

Preprint

View full text Add to dashboard Cite

The evaluation of question answering models compares ground-truth annotations with model predictions. However, as of today, this comparison is mostly lexical-based and therefore misses out on answers that have no lexical overlap but are still semantically similar, thus treating correct answers as false. This underestimation of the true performance of models hinders user acceptance in applications and complicates a fair comparison of different models. Therefore, there is a need for an evaluation metric that is based on semantics instead of pure string similarity. In this short paper, we present SAS, a cross-encoder-based metric for the estimation of semantic answer similarity, and compare it to seven existing metrics. To this end, we create an English and a German three-way annotated evaluation dataset containing pairs of answers along with human judgment of their semantic similarity, which we release along with an implementation of the SAS metric and the experiments. We find that semantic similarity metrics based on recent transformer models correlate much better with human judgment than traditional lexical similarity metrics on our two newly created datasets and one dataset from related work.

show abstract

GermanQuAD and GermanDPR: Improving Non-English Question Answering and Passage Retrieval

Möller¹,

Risch²,

Pietsch³

2021

View full text Add to dashboard Cite

A major challenge of research on non-English machine reading for question answering (QA) is the lack of annotated datasets. In this paper, we present GermanQuAD, a dataset of 13,722 extractive question/answer pairs. To improve the reproducibility of the dataset creation approach and foster QA research on other languages, we summarize lessons learned and evaluate reformulation of question/answer pairs as a way to speed up the annotation process. An extractive QA model trained on GermanQuAD significantly outperforms multilingual models and also shows that machine-translated training data cannot fully substitute hand-annotated training data in the target language. Finally, we demonstrate the wide range of applications of German-QuAD by adapting it to GermanDPR, a training dataset for dense passage retrieval (DPR), and train and evaluate one of the first non-English DPR models.5 https://www.crowdguru.de/ 6 GermanQuAD ID: 57716 7 GermanQuAD ID: 56712 8 SQuAD ID: 5725cd6a89a1e219009abef7 9 SQuAD ID: 5723d010f6b826140030fc8e

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Timo Möller

German’s Next Language Model

German's Next Language Model

How Entrepreneurial Firms Profit from Pricing Capabilities: An Examination of Technology–Based Ventures

Semantic Answer Similarity for Evaluating Question Answering Models

GermanQuAD and GermanDPR: Improving Non-English Question Answering and Passage Retrieval

Contact Info

Product

Resources

About