This paper presents the results of research on typographical error analysis in two specialised bilingual paper dictionaries: Diccionario de términos económicos, financieros y comerciales/ A Dictionary of Economic, Financial and Commercial Terms (Ariel, 2012), and Diccionario de términos jurídicos/A Dictionary of Legal Terms (Ariel, 2012). A model of errors is described, including similar errors and errors that are repeated both intratextually and intertextually. The error frequency in A Dictionary of Economic, Financial and Commercial Terms is higher than the average error frequency in a reference corpus of fourteen dictionaries (mainly first editions). This indicates that repeated editions do not always guarantee a higher level of formal correctness. Our results also show that a high frequency of errors does not necessarily entail a high intratextual error repetition rate. On the other hand, we establish a relationship between typographical errors and the access function in dictionaries, as that kind of error can interfere with access to accurate lexicographical information and data retrieval (especially when they occur in lemmas or sublemmas).
It is only through an extreme concern for accuracy and the understanding of typographical errors that authors can turn specialised dictionaries into high quality reference works. This paper describes patterns of typographical error reproduction in three specialised English-Spanish dictionaries. We approach intratextual error reproduction (within a particular dictionary), either through related subentries or through non-related subentries. In addition, we compare the frequency of errors between dictionaries written by institutional lexicographers and works written by freelance professionals. The purpose is to provide a model for typographical error detection and analysis that may contribute to formal correctness in reference works. The reason is twofold: a) dictionaries are expected to be high-standard primary tools for language professionals; b) data quality is essential for a wide variety of utilities, ranging from dictionary writing systems and writing assistants to corpus tools. Keywords: data quality, data reusability, specialised bilingual lexicography, typographical error reproduction.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.