Thiago Dias Bispo scite author profile

In this study, we will be presenting LUMPAC (LUMinescence PACkage), which was developed with the objective of making possible the theoretical study of lanthanide-based luminescent systems. This is the first software that allows the study of luminescent properties of lanthanide-based systems. Besides being a computationally efficient software, LUMPAC is user friendly and can be used by researchers who have no previous experience in theoretical chemistry. With this new tool, we hope to enable research groups to use theoretical tools on projects involving systems that contain lanthanide ions.

show abstract

Long Short-Term Memory Model for Classification of English-PtBR Cross-Lingual Hate Speech

Bispo¹,

Macedo²,

Santos³

et al. 2019

Journal of Computer Science

View full text Add to dashboard Cite

show abstract

ParamGULP: An efficient Python code for obtaining interatomic potential parameters for General Utility Lattice Program

Dutra

Bispo

Freitas

et al. 2021

Computer Physics Communications

View full text Add to dashboard Cite

Morphological Skip-Gram: Replacing FastText characters n-gram with morphological knowledge

Santos

Bispo

Macedo

et al. 2021

View full text Add to dashboard Cite

Natural language processing systems have attracted much interest of the industry. This branch of study is composed of some applications such as machine translation, sentiment analysis, named entity recognition, question and answer, and others. Word embeddings (i.e., continuous word representations) are an essential module for those applications generally used as word representation to machine learning models. Some popular methods to train word embeddings are GloVe and Word2Vec. They achieve good word representations, despite limitations: both ignore morphological information of the words and consider only one representation vector for each word. This approach implies the word embeddings does not consider different word contexts properly and are unaware of its inner structure. To mitigate this problem, the other word embeddings method FastText represents each word as a bag of characters n-grams. Hence, a continuous vector describes each n-gram, and the final word representation is the sum of its characters n-grams vectors. Nevertheless, the use of all n-grams character of a word is a poor approach since some n-grams have no semantic relation with their words and increase the amount of potentially useless information. This approach also increase the training phase time. In this work, we propose a new method for training word embeddings, and its goal is to replace the FastText bag of character n-grams for a bag of word morphemes through the morphological analysis of the word. Thus, words with similar context and morphemes are represented by vectors close to each other. To evaluate our new approach, we performed intrinsic evaluations considering 15 different tasks, and the results show a competitive performance compared to FastText. Moreover, the proposed model is $40\%$ faster than FastText in the training phase. We also outperform the baseline approaches in extrinsic evaluations through Hate speech detection and NER tasks using different scenarios.

show abstract

LaNa2

Guimarães

Bispo

Azevedo

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.