2019 IEEE 31st International Conference on Tools With Artificial Intelligence (ICTAI) 2019
DOI: 10.1109/ictai.2019.00131
|View full text |Cite
|
Sign up to set email alerts
|

Metric Learning from Imbalanced Data

Abstract: A key element of any machine learning algorithm is the use of a function that measures the dis/similarity between data points. Given a task, such a function can be optimized with a metric learning algorithm. Although this research field has received a lot of attention during the past decade, very few approaches have focused on learning a metric in an imbalanced scenario where the number of positive examples is much smaller than the negatives. Here, we address this challenging task by designing a new Mahalanobi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(14 citation statements)
references
References 29 publications
0
14
0
Order By: Relevance
“…One of the issues that faces building analytic models for crash severity, is the imbalance of data [18] where the occurrence fatality which is infrequent or rare event compared to no or minor injury accidents. Due to the extreme imbalance of accident data most algorithms will not produce good predictive models and perform poorly will likely missclassify the fatal accidents as it is not prevalent in the dataset [19]. For imbalanced data sets such as traffic accidents data, sampling techniques can help improve classifier accuracy [19].…”
Section: B Data Preprocessingmentioning
confidence: 99%
See 1 more Smart Citation
“…One of the issues that faces building analytic models for crash severity, is the imbalance of data [18] where the occurrence fatality which is infrequent or rare event compared to no or minor injury accidents. Due to the extreme imbalance of accident data most algorithms will not produce good predictive models and perform poorly will likely missclassify the fatal accidents as it is not prevalent in the dataset [19]. For imbalanced data sets such as traffic accidents data, sampling techniques can help improve classifier accuracy [19].…”
Section: B Data Preprocessingmentioning
confidence: 99%
“…Due to the extreme imbalance of accident data most algorithms will not produce good predictive models and perform poorly will likely missclassify the fatal accidents as it is not prevalent in the dataset [19]. For imbalanced data sets such as traffic accidents data, sampling techniques can help improve classifier accuracy [19]. Two sampling techniques; undersampling and oversampling techniques will be discussed below.…”
Section: B Data Preprocessingmentioning
confidence: 99%
“…Imbalanced Metric Learning (IML) (Gautheron et al, 2019) is a spectral metric learning method which handles imbalanced classes by further decomposition of the similar set S and dissimilar set D. Suppose the dataset is composed of two classes c 0 and c 1 . Let S 0 and S 1 denote the similarity sets for classes c 0 and c 1 , respectively.…”
Section: Imbalanced Metric Learning (Iml)mentioning
confidence: 99%
“…Let S 0 and S 1 denote the similarity sets for classes c 0 and c 1 , respectively. We define pairs of points taken randomly from these sets to have similarity and dissimilarity sets (Gautheron et al, 2019):…”
Section: Imbalanced Metric Learning (Iml)mentioning
confidence: 99%
“…Consequently, some researchers capture the distance between samples by finding a transformation that can increase the distance between dissimilar samples and reduce the distance of similar samples (Köstinger et al, 2012 ). When training on the imbalance datasets, metric learning also suffers from imbalance problems (Gautheron et al, 2019 ). It needs to be modified before training on the imbalance datasets.…”
Section: Introductionmentioning
confidence: 99%