Alfredo Maldonado scite author profile

Alfredo Maldonado

5Publications

63Citation Statements Received

53Citation Statements Given

How they've been cited

How they cite others

Affiliations

Trinity College Dublin

Publications

Order By: Most citations

Measuring Gender Bias in Word Embeddings across Domains and Discovering New Gender Bias Word Categories

Chaloner¹,

Maldonado²

2019

View full text Add to dashboard Cite

Prior work has shown that word embeddings capture human stereotypes, including gender bias. However, there is a lack of studies testing the presence of specific gender bias categories in word embeddings across diverse domains. This paper aims to fill this gap by applying the WEAT bias detection method to four sets of word embeddings trained on corpora from four different domains: news, social networking, biomedical and a gender-balanced corpus extracted from Wikipedia (GAP). We find that some domains are definitely more prone to gender bias than others, and that the categories of gender bias present also vary for each set of word embeddings. We detect some gender bias in GAP. We also propose a simple but novel method for discovering new bias categories by clustering word embeddings. We validate this method through WEAT's hypothesis testing mechanism and find it useful for expanding the relatively small set of wellknown gender bias word categories commonly used in the literature.

show abstract

Detection of Verbal Multi-Word Expressions via Conditional Random Fields with Syntactic Dependency Features and Semantic Re-Ranking

Maldonado¹,

Han²,

Moreau³

et al. 2017

View full text Add to dashboard Cite

A description of a system for identifying Verbal Multi-Word Expressions (VMWEs) in running text is presented. The system mainly exploits universal syntactic dependency features through a Conditional Random Fields (CRF) sequence model. The system competed in the Closed Track at the PARSEME VMWE Shared Task 2017, ranking 2nd place in most languages on full VMWE-based evaluation and 1st in three languages on token-based evaluation. In addition, this paper presents an option to re-rank the 10 best CRF-predicted sequences via semantic vectors, boosting its scores above other systems in the competition. We also show that all systems in the competition would struggle to beat a simple lookup baseline system and argue for a more purposespecific evaluation scheme.

show abstract

ADAPT at SemEval-2018 Task 9: Skip-Gram Word Embeddings for Unsupervised Hypernym Discovery in Specialised Corpora

Maldonado¹,

Klubička²

2018

View full text Add to dashboard Cite

This paper describes a simple but competitive unsupervised system for hypernym discovery. The system uses skip-gram word embeddings with negative sampling, trained on specialised corpora. Candidate hypernyms for an input word are predicted based on cosine similarity scores. Two sets of word embedding models were trained separately on two specialised corpora: a medical corpus and a music industry corpus. Our system scored highest in the medical domain among the competing unsupervised systems but performed poorly on the music industry domain. Our approach does not depend on any external data other than raw specialised corpora.

show abstract

Semantic reranking of CRF label sequences for verbal multiword expression identification

Moreau¹,

Alsulaimani²,

Maldonado³

et al. 2018

View full text Add to dashboard Cite

Pronóstico estadístico del número total de tormentas por temporada para el Pacífico Oriental

Moyne

Maldonado²

1978

Geofis Int

View full text Add to dashboard Cite

El uso de frecuencias en lugar de frecuencias simples, permite el desarrollo de un proceso de pronóstico, mediante el análisis de la tendencia del número total de tormentas por temporada a cualquiera de las etapas a que llegue a desarrollar la tormenta; el análisis es aplicado a los ciclones de Océano Pacífico, cerca de las costas mexicanas. El método utilizado tiene una precisión mínima promedio de 85%.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.