2015
DOI: 10.14257/ijmue.2015.10.5.24
|View full text |Cite
|
Sign up to set email alerts
|

A Graph Theoretical Preprocessing Step for Text Compression

Abstract: This paper presents CSGM 2 , a text preprocessing technique for compression purposes. It converts the original text into a word net (graph representation) and can retain the detailed contextual information such as word proximity. Specific directed graph is proposed to model this word net where words are stored in vertices and edges represent word transitions. The word net is fully capable of holding the natural word order in the original text and hence can be used directly for encoding purposes.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…It seems to be obvious from the above definition that every binary relation on a finite set can be represented by a digraph without parallel edges. The composite graph model (CGM) (22) which was modeled by this author in the year 2012 which represents a web document as a directed and completely labeled graph. The CGM was developed with help of the Tag Sensitive Graph Model (TSGM) (22) and Context-Sensitive Graph model (CSGM) (22) .…”
Section: Web Document Content Mining Processmentioning
confidence: 99%
See 1 more Smart Citation
“…It seems to be obvious from the above definition that every binary relation on a finite set can be represented by a digraph without parallel edges. The composite graph model (CGM) (22) which was modeled by this author in the year 2012 which represents a web document as a directed and completely labeled graph. The CGM was developed with help of the Tag Sensitive Graph Model (TSGM) (22) and Context-Sensitive Graph model (CSGM) (22) .…”
Section: Web Document Content Mining Processmentioning
confidence: 99%
“…The composite graph model (CGM) (22) which was modeled by this author in the year 2012 which represents a web document as a directed and completely labeled graph. The CGM was developed with help of the Tag Sensitive Graph Model (TSGM) (22) and Context-Sensitive Graph model (CSGM) (22) . In the composite graph representation, we are using the TSGM to represent three sections of a general web page namely head, link and address.…”
Section: Web Document Content Mining Processmentioning
confidence: 99%