The main goal of the project is to create the English-Polish-Belarusian Literary Parallel Corpus (EPB corpus) and present its applications in several linguistic disciplines, including translation studies and discourse analysis. The thesis provides an outline of corpus linguistics research and corpus linguistics as a methodology, then addresses the problem of the differences in the development of corpus linguistics in the three languages: English (as a lingua franca), Polish (a statutory national language) and Belarusian (a minority language). The analysis of available tools and resources for each of these languages proves the need for the EPB corpus in order to develop useful new resources for Belarusian in particular.A substantial part of the thesis presents the documentation of the process of creating the corpus. Various aspects of corpus design, text collection and text encoding are discussed in the context of the availability and usability of numerous tools. Special attention is paid to the tools specifically designed for each language and to the solutions that enable the data processed by these tools to be merged.Using corpus linguistics techniques (e.g. linguistic distribution, lexical density, vector-based semantic similarity measures) the thesis goes on to explore the application of the EPB corpus in investigating translation universals, in exploring the dependency between the author’s and the translator’s style, in supporting translation students and professionals, and in analysis of gender discourse. These case studies clearly show the practical value of the resource.Finally, the thesis provides a detailed overview of the plans and possibilities for further development of the project in the broader context of the evolution of Polish and Belarusian corpus linguistics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.