Information Retrieval constitutes an important task for extracting meaningful data in huge amounts of texts since the majority of information in the world exist electronically, and in unstructured and plain text form. Most datamining applications assimilate only structured information, this means before any predictive model can be applied the data must be prepared in a special way. This paper presents a web-based tagger system for named entities detection in plain text to help users tagging texts resources. The advantage of using this tool is the possibility of tagging any plain text file and with any type of class for the entities, allowing to treat any domain and thus obtaining a wider recognition of named entities. In addition, a small application shows the facility in the training of NER classifier models as an advantage of corpora constructed with the web-based tagger.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.