This paper investigates the utility of the Inclusion Index, the Jaccard Index and the Cosine Index for calculating similarities of documents, as used for mapping science and technology. It is shown that, provided that the same content is searched across various documents, the Inclusion Index generally delivers more exact results, in particular when computing the degree of similarity based on citation data. In addition, various methodologies such as co-word analysis, Subject-Action-Object (SAO) structures, bibliographic coupling, co-citation analysis, and selfcitation links are compared. We find that the two former ones tend to describe rather semantic similarities that differ from knowledge flows as expressed by the citation-based methodologies.
In this paper, we are going to present a method for detecting the risks of patent infringement by evaluating similarities between patent documents on the basis of semantic patent analysis. This approach enables the user to visualize similarities in the contents on a semantic patent map by means of multi‐dimensional scaling. The effectiveness of the semantic patent map has already been demonstrated by Dressler (2006) with regard to patents of seal technology, in which documents are commonly kept short and the extracted contents are concise. This paper will open out to the field of biotechnology, where patents can easily comprise several hundreds of pages. The method presented here conveys an interdisciplinary approach and combines computer‐aided natural language processing with domain‐specific expertise of biochemical processes. This is illustrated by an authentic case of infringement involving two manufacturers of DNA chips. Our experiment will show how the infringement case is visualized on a patent map based on semantic patent analysis. This experiment can be compared with the search for a needle in a haystack, the two competitive patents representing significantly conflicting ‘needles.’ From an approximate number of 4,000 patents in the current US Class 435/6, a set of patents was selected that included the ‘needles’ mentioned. This paper will point out how such mutual interference can be detected by way of semantic patent analysis, and what advice may be given to R&D managers who are faced with the risk of patent infringement.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.