Mathematical expression retrieval is a key means of searching scientific contents in web and digital libraries. Because mathematical expressions have many different attributes compared with ordinary text, it is necessary to study the special retrieval methods including the indexing and matching model of mathematical expressions. In this paper, on the basis of the introduction of the existing math searching methods and the FDS based index, a mathematical expression matching model was proposed which realized the exact matching of formulas with three query modes called global query mode, local query mode and operational query mode. The algorithms of the query modes were given respectively. A prototype system based on the proposed model was implemented and the comparison experiments were carried out. The experimental results show that the proposed matching model simultaneously realized matching formulas in exact mode and reducing time and space consumption of retrieving to an acceptable degree. It is effective for searching math content in relative digital mathematics library.
Abstract-It is quite inadequate in providing formula retrieval function by traditional retrieval techniques used in full-text information retrieval system. The main reason is that there are many difficulties to extract the keywords of the mathematical formulas. In this paper, a detailed analysis of the structural characteristics of mathematical formulas and existing index mechanism of mathematical formula searching engine is fulfilled. Then a full-text index (named SLIndex) of mathematical formulas with B+ tree structure is designed and implemented which extracts the structured logic sub-tree feature as keywords of formulas and employs inverted index. Finally, a formula search engine model based on SLIndex is implemented in Apache 2.0 web server.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.