2009 IEEE International Conference on Information Reuse &Amp; Integration 2009
DOI: 10.1109/iri.2009.5211539
|View full text |Cite
|
Sign up to set email alerts
|

Visual integration tool for heterogeneous data type by unified vectorization

Abstract: Abstract-Data integration is the problem of combining data residing at different sources, and providing the user with a unified view of these data. One of the critical issues of data integration is the detection of similar entities based on the content. This complexity is due to three factors: the data type of the databases are heterogenous, the schema of databases are unfamiliar and heterogenous as well, and the amount of records is voluminous and time consuming to analyze. As solution to these problems we ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 10 publications
0
6
0
Order By: Relevance
“…The following list is not an exhaustive list of applications, but it provides an overview about the most recent areas of research where visualization became essential: a) Topic summarization: e.g., understanding newspaper articles, stories, reporting events, investigating crime reports, finding patterns in blogs, following the development of political campaigns, or observing topic trends in the bibliography of research approaches [7,3,25]; b) Visual Analysis of Social Networks: e.g., analyzing dynamic groups memberships in temporal social networks by using graphical representations [10,12,17,6,29]; c) Visual Clustering Analysis: e.g., using data mining techniques to find patterns in data to generate group of data based on (dis)similarity. Several visualization tools have been developed in this domain and gained great popularity, to mention some [21,2,28,5,30]; d) Semantic Visual Analysis: e.g., visual analysis of webpage/documents based on the semantic representation of text in a "semantic graph" [23,8,9,22,31], or exploring data in folksonomy systems based on a hierarchical semantic representation, "semantic cloud or tags" [11,14,24,23,4,15,16,22,26] …”
Section: Visual Analytic Applicationsmentioning
confidence: 99%
“…The following list is not an exhaustive list of applications, but it provides an overview about the most recent areas of research where visualization became essential: a) Topic summarization: e.g., understanding newspaper articles, stories, reporting events, investigating crime reports, finding patterns in blogs, following the development of political campaigns, or observing topic trends in the bibliography of research approaches [7,3,25]; b) Visual Analysis of Social Networks: e.g., analyzing dynamic groups memberships in temporal social networks by using graphical representations [10,12,17,6,29]; c) Visual Clustering Analysis: e.g., using data mining techniques to find patterns in data to generate group of data based on (dis)similarity. Several visualization tools have been developed in this domain and gained great popularity, to mention some [21,2,28,5,30]; d) Semantic Visual Analysis: e.g., visual analysis of webpage/documents based on the semantic representation of text in a "semantic graph" [23,8,9,22,31], or exploring data in folksonomy systems based on a hierarchical semantic representation, "semantic cloud or tags" [11,14,24,23,4,15,16,22,26] …”
Section: Visual Analytic Applicationsmentioning
confidence: 99%
“…It appears that when two data types, such as numerical and textual, are simultaneously processed by HDM, the use of unified vectorization (UV) leads to better convergent semantic clustering results [1]. In spite of good results, the development and the use of similar data weighting measures to represent these heterogeneous data types in a unified VSM matrix improves the clustering results [2]. Let us examine these pre-processing steps.…”
Section: Pre-processingmentioning
confidence: 99%
“…For example, the 3-grams of the term "hello" are: "hel", "ell", and "llo". More details regarding the tokenization and the vectorization processes can be found in [2].…”
Section: Tokenization Of the Datamentioning
confidence: 99%
See 2 more Smart Citations