2015
DOI: 10.1007/978-3-319-21024-7_17
|View full text |Cite
|
Sign up to set email alerts
|

Applying Clustering Analysis to Heterogeneous Data Using Similarity Matrix Fusion (SMF)

Abstract: We define a heterogeneous dataset as a set of complex objects, that is, those defined by several data types including structured data, images, free text or time series. We envisage this could be extensible to other data types. There are currently research gaps in how to deal with such complex data. In our previous work, we have proposed an intermediary fusion approach called SMF which produces a pairwise matrix of distances between heterogeneous objects by fusing the distances between the individual data types… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
3
2
1

Relationship

5
1

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…Aalaa et al [10] applied the machine learning clustering method on heterogeneous (not necessarily multimedia though) datasets. Due to the fact that there were not many heterogeneous datasets publicly available, they created their own heterogeneous datasets, which contained different types of media.…”
Section: Related Workmentioning
confidence: 99%
“…Aalaa et al [10] applied the machine learning clustering method on heterogeneous (not necessarily multimedia though) datasets. Due to the fact that there were not many heterogeneous datasets publicly available, they created their own heterogeneous datasets, which contained different types of media.…”
Section: Related Workmentioning
confidence: 99%
“…Additional analysis of the shape of the curves represented (for example, clustering of biomarker trends) is also possible using this framework and some work has already been done in this area using fusion methods [57]. …”
Section: Knowledge Discovery Supportmentioning
confidence: 99%
“…Important notation used in this paper from this point is summarised in Table 1. A definition of our problem has been given in (Mojahed and De La Iglesia, 2014;Mojahed et al, 2015) but we reproduce it here to aid the reader in following the discussion. The formal definition of a heterogeneous dataset, H, is a set of objects such that H = {O 1 , O 2 , .…”
Section: Problem Statementmentioning
confidence: 99%
“…For this grouping we used death indicators after conducting some changes on the values of the corresponding attribute in the data preparation stage (for details see Mojahed et al (2015)). …”
Section: -Mortality Groupingmentioning
confidence: 99%