2003
DOI: 10.1007/978-3-540-24580-3_50
|View full text |Cite
|
Sign up to set email alerts
|

A Preliminary Approach to the Multilabel Classification Problem of Portuguese Juridical Documents

Abstract: Abstract. Portuguese juridical documents from Supreme Courts and the Attorney General's Office are manually classified by juridical experts into a set of classes belonging to a taxonomy of concepts. In this paper, a preliminary approach to develop techniques to automatically classify these juridical documents, is proposed. As basic strategy, the integration of natural language processing techniques with machine learning ones is used. Support Vector Machines (SVM) are used as learning algorithm and the obtained… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
25
0
1

Year Published

2004
2004
2022
2022

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 46 publications
(26 citation statements)
references
References 4 publications
0
25
0
1
Order By: Relevance
“…Binary relevance [9], a straightforward problem transformation approach, predicts positively the label sets of an unknown instance by N binary classifiers. Independent classifiers are commonly individual selection [10,11] and fusing selection [12]. This one-against-all strategy has been criticized to ignore the correlations among labels [13].…”
Section: Introductionmentioning
confidence: 99%
“…Binary relevance [9], a straightforward problem transformation approach, predicts positively the label sets of an unknown instance by N binary classifiers. Independent classifiers are commonly individual selection [10,11] and fusing selection [12]. This one-against-all strategy has been criticized to ignore the correlations among labels [13].…”
Section: Introductionmentioning
confidence: 99%
“…In [6], SVM performance is compared with other Machine Learning algorithms and in [7] a thorough study on some preprocessing techniques (feature reduction, feature subset selection and term weighting) is made over European Portuguese and English datasets. The impact of using linguistic information on the preprocessing phase is reported in [15] over a Brazilian dataset.…”
Section: Introductionmentioning
confidence: 99%
“…On previous work, we evaluated SVM performance compared with other Machine Learning algorithms [2] and performed a preliminary study on the impact of using linguistic information to reduce the number of features [3]. In this paper, we extend that work using IR techniques to weight and normalise features.…”
Section: Introductionmentioning
confidence: 99%