A new metric method for word sense disambiguation is put forward, being formulated within a phenomenological analogy to the wave function of an observable quantity in quantum mechanics where the actual meaning of a multivalued word (a homonym or a polysemant) is determined by its context. The choice of the actualized sense is determined by the minimal semantic distance between the semantics of possible word senses and that of the context, where the meanings of the word in question and the context follow from their semantic fields based on lexicographic hyperchains. In contrast to the common models, our method accounts for semantic polarity. The formulated method showed good results in disentangling polysemy, which was not possible to achieve within the contextualized embedding approacdh based on bidirectional encoder representations from transformers (BERT).
This article aims to study the grammatical and social aspects of the use of feminatives in the Ukrainian language on the basis of terminology science. It was shown that adding in text extra feminatives as simply opposed to masculinitives may result in a number of inconsistencies leading to violation of grammatical and phonological rules and to the degradation of the language expressive ability. Not doing much to make a woman more visible, this opposition, in particular, leaves no room for correctly identifying non-binary persons and personified objects.
As a possible solution, we propose to recognize the general gender combining simple genders by the logical “OR”, the existence of which follows from the analysis of the language practice. The corresponding nouns should be used instead of the masculine names in the generalizing function.
The article analyzes the alphabets used for the Ukrainian language in the course of its evolution and proposes a system of their simple-correspondent (isomorphic) transliteration into the Latin script that possesses the property of reversibility and is the base for the Ukrainian transliteration standard DSTU 9112:2021. It is stated that due to the complete letter register, compatibility with the language processing procedures, and the more appropriate phonological motivation, the proposed system is more viable than the Romanization tables of the ISO 9:1995 and that of the Library of Congress. In addition, a phonemic transcription mechanism for modern Ukrainian is proposed, that is necessary for machine translation, cross-lingual information retrieval, language transfer, speech generation, and other Natural Language Processing tasks. The obtained results are helpful for the effective inclusion of Ukraine and other Slavic countries in international cooperation in the field of preservation of historical written monuments, information and linguistic technologies, corpus linguistics and computational lexicography, machine translation, and library science.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.