2009
DOI: 10.1016/j.knosys.2008.06.002
|View full text |Cite
|
Sign up to set email alerts
|

Rich document representation and classification: An analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0
3

Year Published

2011
2011
2018
2018

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(13 citation statements)
references
References 19 publications
0
10
0
3
Order By: Relevance
“…The vector space model (VSM) is one of the simplest and most common models for representing documents and is widely used in document classification [75]. In this model, a document is typically represented as bag of words where each word/term is represented as a dimension in a vector space and independent to other terms in the same document.…”
Section: Background and Motivationmentioning
confidence: 99%
“…The vector space model (VSM) is one of the simplest and most common models for representing documents and is widely used in document classification [75]. In this model, a document is typically represented as bag of words where each word/term is represented as a dimension in a vector space and independent to other terms in the same document.…”
Section: Background and Motivationmentioning
confidence: 99%
“…There are a number of representation techniques that have evolved over through research work done by various researchers in diverse domains. The various data representation models that have been proposed: Bag of Word (BoW) or Vector Space Model (VSM), term weighting approach [4][5], n-grams and nmultigrams approach [6], n-gram graph model [7], keywords or key-phrases approach, Latent Semantic Indexing (LSI) [8], Concise Semantic Analysis (CSA) [9], Rich Data Representation (RDR) [10].…”
Section: Data Representation Modelsmentioning
confidence: 99%
“…O problema de capturar a dependência entre os termos de forma eficiente para incorporarà representação dos documentos, aindaé um grande desafio de pesquisa, e tem despertado atenção recentemente (Keikha et al, 2009;Figueiredo et al, 2011;Farahat and Kamel, 2011;Kalogeratos and Likas, 2012;Cheng et al, 2013b;Gao et al, 2013). Além disso, diversos modelos para representação de documentos foram propostos na literatura para capturar a dependência entre os termos, como discutido na Seção 3.1.3.…”
Section: Extração De Tópicos Com Termos Dependentes Para Representaçãunclassified
“…Uma vez estruturados os documentos, algoritmos de mineração de dados convencionais podem ser aplicados para extrair conhecimento e informação por meio de padrões detectados em toda a coleção de documentos (Aggarwal and Zhai, 2012;Rios, 2013). Nesse contexto, a qualidade dos resultados obtidos com as abordagens automáticas para obtenção de conhecimento de textos estão fortemente relacionadosà qualidade dos atributos utilizados para representar a coleção de documentos (Shafiei et al, 2007;Keikha et al, 2009;Aggarwal and Zhai, 2012).…”
Section: Introductionunclassified
See 1 more Smart Citation