2012
DOI: 10.1007/978-3-642-33492-4_24
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing Patent Expertise through Automatic Matching with Scientific Papers

Abstract: International audienceThis paper focuses on a subtask of the QUAERO1 research program, a major innovating research project related to the automatic processing of multimedia and multilingual content. The objective discussed in this article is to propose a new method for the classification of scientific papers, developed in the context of an international patents classification plan related to the same field. The practical purpose of this work is to provide an assistance tool to experts in their task of evaluati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 24 publications
0
3
0
Order By: Relevance
“…A common solution to deal with unbalance in dataset is undersampling majority classes and oversampling minority classes. However, sampling that introduces redundancy in dataset does not improve the performance in this dataset, as it has been shown in [12]. We thus propose hereafter to prune irrelevant features and to contrast the relevant ones as an alternative solution.…”
Section: Experimental Data and Resultsmentioning
confidence: 93%
See 1 more Smart Citation
“…A common solution to deal with unbalance in dataset is undersampling majority classes and oversampling minority classes. However, sampling that introduces redundancy in dataset does not improve the performance in this dataset, as it has been shown in [12]. We thus propose hereafter to prune irrelevant features and to contrast the relevant ones as an alternative solution.…”
Section: Experimental Data and Resultsmentioning
confidence: 93%
“…The source data contains 6387 patents in XML format, grouped into 15 subclasses of the A61K class (medical preparation). 25887 citations have been extracted from 6387 patents [12]. Then the Medline database is queried with extracted citations for related scientific articles.…”
Section: Experimental Data and Resultsmentioning
confidence: 99%
“…Han et al [21] used a weight-adjusted cosine similarity measure to classify several real data sets and showed that this is more effective than traditional approaches. Their weightadjusted kNN outperformed traditional kNN approaches, decision tree techniques and rule-based systems.…”
Section: Literature Reviewmentioning
confidence: 99%