Due to the increasing amount of information present on the Web, Automatic Document Classification (ADC) has become an important research topic. ADC usually follows a standard supervised learning strategy, where we first build a model using pre-classified documents and then use it to classify new unseen documents. One major challenge for ADC in many scenarios is that the characteristics of the documents and the classes to which they belong may change over time. However, most of the current techniques for ADC are applied without taking into account the temporal evolution of the collection of documents.In this work, we perform a detailed study of the temporal evolution in the ADC, introducing an analysis methodology. We discuss that temporal evolution may be explained by three factors: 1) class distribution; 2) term distribution; and 3) class similarity. We employ metrics and experimental strategies capable of isolating each of these factors in order to analyze them separately, using two very different document collections: the ACM Digital Library and the Medline medical collections. Moreover, we present some preliminary results of potential gains that could be obtained by varying the training set to find the ideal size that minimizes the time effects. We show that by using just 69% of the ACM database, we are able to have an accuracy of 89.76%, and with only 25% of the Medline, an accuracy of 87.57%, which means gains of up to 20% in accuracy with much smaller training sets.
This work describes classification, functions, location, inhibition, activation, and therapeutic applications of proteases from snake venoms and vegetables. Snake venoms and vegetables can present toxins that unchain necrosis or proteolysis due to the direct cytotoxic action of venom proteases. These proteases are potential tools in the development of drugs for the prevention and treatment of several illnesses. We report herein mainly fibrinogenolytic metallo proteases and serine proteases ("thrombin-like"). These enzymes are extensively used in the treatment and prevention of thrombotic disorders, since they serve as defibrinogenating agents. The therapeutic uses of fibrin(ogen)olytic metallo proteases hold promise for clinical application due to potential in reversing the effects of thrombosis; this has been shown to be an alternative approach to the prevention and treatment of cardiovascular disorders, which are among the most prominent causes of mortality around the world. Plant proteases can be utilized for many cellular and molecular activities, in antibacterial and anticancer therapies, and in the treatment of snakebites, inhibiting snake venom activities such as blood-clotting, defibrinogenation, and fibrin(ogen)olytic and hemorrhagic actions. These toxins also display potential for clinical use in the treatment of hemostatic disorders.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.