Key words: Schistosoma mansoni -bioinformatics -expressed sequences tag -clustering analysis -metabolismSchistosoma mansoni is a dioiceous trematode and one of the etiologic agents of schistosomiasis, the second more significant tropical disease concerning public health. Despite recent efforts undertaken to contain its progress, the disease is still endemic in several countries, with around 200 million people infected by the parasite (http://www.who.int/ctd/schisto/epidemio.htm). The study of S. mansoni is, therefore, very important in human parasitology. Gaining knowledge on the genome of this parasite is essential for a better understanding of its metabolism and biology and will help to elucidate important aspects of the mechanisms of drug resistance and antigenic variation that allow it to escape from the host immune system (Franco et al. 2000).The size of S. mansoni genome is estimated in 270Mb with the number of expressed genes ranging from 15000 to 20000 (Simpson et al. 1982, Franco & Simpson 2001. Although some genomic sequences of S. mansoni have been produced, the Schistosoma Genome Network (SGN) has chosen as first priority the sequencing of cDNA using the expressed sequence tags (ESTs) strategy, from which is possible to obtain fast and relevant information Although resulting in fast and very important information, ESTs available from public databases, such as dbEST, show some degree of redundancy and present a great number of errors, because they are single pass sequences (Miller et al. 1999). To overcome these problems and to increase the length of the sequences, facilitating identification by homology searches, clustering procedures are performed (Oliveira & Johnston 2001). In this kind of procedure, sequences that have some region of similarity are joined into a cluster. Therefore, sequences possessing overlapping regions and representing a single gene are joined into the same cluster, decreasing redundancy. Sequences of each cluster are then aligned to generate a consensus sequence. In this approach, the base (and, if available, the quality value designated by the base caller program) present in each sequence position is considered in the construction of a high quality consensus (Huang & Madan 1999). The clustering procedure can, therefore, have two outcomes: consensus are generated by the alignment of the sequences of a cluster and singlets result from sequences that have not been grouped to any others. Theoretically, each sequence (either a consensus or a singlet) should represent an individual gene, and so, these sequences are called uniques. As is expected that each sequence represents a single gene, the comparison of the number of uniques with the total number of predicted genes make it possible to know, approximately, how many genes have not been discovered yet.
Nesse artigo apresentamos um mapeamento de vetores numa máquina SIMD em malha bidimensional, a Zephyr. O mapeamento proposto permite que algoritmos projetados para uma máquina SIMD vetorial sejam executados eficientemente na Zephyr, explorando seus recursos de modo racional. As alterações necessárias para implementação do mapeamento são simples e estão descritas no texto. Os resultados obtidos para o algoritmo de cálculo de similaridade de sequências de DNA mostram que nossa abordagem tem a execução bem mais eficiente do que a conseguida quando o mapeamento automático da máquina é utilizado.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.