Biocomputing 2007 2006
DOI: 10.1142/9789812772435_0029
|View full text |Cite
|
Sign up to set email alerts
|

Mining Patents Using Molecular Similarity Search

Abstract: Text analytics is becoming an increasingly important tool used in biomedical research. While advances continue to be made in the core algorithms for entity identification and relation extraction, a need for practical applications of these technologies arises. We developed a system that allows users to explore the US Patent corpus using molecular information. The core of our system contains three main technologies: A high performing chemical annotator which identifies chemical terms and converts them to structu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(25 citation statements)
references
References 9 publications
0
25
0
Order By: Relevance
“…[14][15][16][17]), often a rate limiting step in workflows [15]. It was so in our use of a patent data base generated by using Blue Gene to read automatically all US patents [16,17]. The 6.7 million records comprise SMILES code [18] for compounds mentioned 1 along with assignee, and also the patent reference by which other data can be joined.…”
Section: Introductionmentioning
confidence: 99%
“…[14][15][16][17]), often a rate limiting step in workflows [15]. It was so in our use of a patent data base generated by using Blue Gene to read automatically all US patents [16,17]. The 6.7 million records comprise SMILES code [18] for compounds mentioned 1 along with assignee, and also the patent reference by which other data can be joined.…”
Section: Introductionmentioning
confidence: 99%
“…Several chemo-informatics tools to analyze chemical similarities between small-molecules are available (Medina-Franco et al, 2007;Miller, 2002;Rhodes et al, 2007).…”
Section: Introduction Imentioning
confidence: 99%
“…the main problem is the relevant, prevalent, and perennial one of what is meant by the similarity of compounds. The general discipline tackling these and related issues is often called molecule mining [14] for a comprehensive bibliography, and [15].…”
Section: Information From Patentsmentioning
confidence: 99%
“…This turns out to be insightful, nonetheless, in regard to readdressing the concepts of similarity and novelty. The initial aim of our project was to provide complementary tools to support patent based chemoinformatics systems developed by our colleagues [15,18]. The overall study with IBM colleagues involved using very high performance computing to read all US patents at that time, and to analyze a patent data base generated consisting of 6.7 million compounds re-expressed in SMILES codes [19] as character strings that represent the chemical formulae of compounds, alongside assignee and patent reference.…”
Section: Scope and Utilitymentioning
confidence: 99%