Soft-max boosting

Geist, Matthieu

doi:10.1007/s10994-015-5491-2

Cited by 6 publications

(4 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The algorithm constructs an additive expansion of the objective function by minimizing a loss function, a technique that it shares with gradient boosting [39]. In the context of multi-class classification problems, XGBoost employs a variant of gradient boosting called Softmax Boosting to optimize a softmax cross-entropy loss function [40]. The softmax function is utilized to transform the model outputs into a probability distribution over the classes.…”

Section: Multi-class Classification Using Xgboostmentioning

confidence: 99%

Firewall Log Classification: Leveraging Feature Selection and Stacking Ensemble

Monfared,

Borna

2024

Preprint

View full text Add to dashboard Cite

Firewall logs can provide valuable information about attempted security breaches and attacks. By classifying these logs, security teams can more easily identify patterns and potential threats, allowing them to take proactive measures to protect their systems. This paper presents a thorough analysis of the various features found in firewall log data. It focuses on selecting the most crucial features using feature selection techniques. The recommended feature sets are then assessed using stacking ensemble XGBoost to demonstrate their suitability. In order to compare the effectiveness of different feature selection techniques, we evaluated at least one member from each technique family using the proposed method. The experimental findings indicate that when Feature Shuffle Random Forest is used in conjunction with the proposed approach, the resulting model is the most accurate, achieving a performance evaluation metric of 99.93\%, signifying exceptional accuracy.

show abstract

Section: Multi-class Classification Using Xgboostmentioning

confidence: 99%

Firewall Log Classification: Leveraging Feature Selection and Stacking Ensemble

Monfared,

Borna

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…These works demonstrate a lack of noise tolerance for boosting and empirical risk minimization based on convex losses, and suggest that any approach based on convex risk minimization will require modification of the loss, [...]" - [9] "For example, the random noise (Long and Servedio 2010) defeats all convex potential boosters [...]" - [24] "Long and Servedio (2010) proved that any convex potential loss is not robust to uniform or symmetric label noise." - [27] "We previously [23] showed that any boosting algorithm that works by stagewise minimization of a convex "potential function" cannot tolerate random classification noise" - [41] "However, the convex loss functions are shown to be prone to mistakes when outliers exist [25]." - [85] "[...] However, Long and Servedio (2010) pointed out that any boosting algorithm with convex loss functions is highly susceptible to a random label noise model."…”

Section: What the Papers Saymentioning

confidence: 99%

“…- [48] "This is as opposed to most boosting algorithms that are highly susceptible to outliers [24]." - [56] "Moreover, in the case of boosting, it has been shown that convex boosters are necessarily sensitive to noise (Long and Servedio 2010 [...]" - [25] "Ostensibly, this result establishes that convex losses are not robust to symmetric label noise, and motivates using non-convex losses [40,31,17,15,30]." - [77] "Interestingly, (Long and Servedio, 2010) established a lower bound against potential-based convex boosting techniques in the presence of RCN."…”

Section: What the Papers Saymentioning

confidence: 99%

What killed the Convex Booster ?

Mansour¹,

Nock²,

Williamson³

2022

Preprint

View full text Add to dashboard Cite

A landmark negative result of Long and Servedio established a worst-case spectacular failure of a supervised learning trio (loss, algorithm, model) otherwise praised for its high precision machinery. Hundreds of papers followed up on the two suspected culprits: the loss (for being convex) and/or the algorithm (for fitting a classical boosting blueprint). Here, we call to the half-century+ founding theory of losses for class probability estimation (properness), an extension of Long and Servedio's results and a new general boosting algorithm to demonstrate that the real culprit in their specific context was in fact the (linear) model class. We advocate for a more general stanpoint on the problem as we argue that the source of the negative result lies in the dark side of a pervasive -and otherwise prized -aspect of ML: parameterisation.

show abstract

“…Par gradient fonctionnel nous entendons dérivée au sens de Fréchet, pour l'espace de Hilbert adéquat, c'est-à-dire l'ensemble des classes d'équivalence des fonctions g ∈ R S×A telles que s ν(s) a g(a, s) 2 soit fini, muni du produit scalaire g 1 , g 2 = s ν(s) a g 1 (s, a)g 2 (s, a). Voir par exemple (Geist, 2015) pour plus de détails concernant ce type d'espace.…”

Section: Connexion à L'itération Conservative De La Politiqueunclassified

Recherche locale de politique dans un espace convexe

Scherrer

Geist

2015

Revue d'intelligence artificielle

View full text Add to dashboard Cite

En apprentissage par renforcement, la recherche locale de politique est une approche classique permettant de prendre en compte de grands espaces d'état. Formellement, elle consiste à chercher localement dans un espace de politiques paramétrées la solution qui va maximiser la fonction de valeur associée, moyennée selon une loi prédéfinie sur les états. La première contribution de cet article montre que si l'espace de politiques est convexe, tout optimum local (approché) présente une garantie globale de performance. Malheureusement, supposer la convexité de l'espace de recherche est une hypothèse forte : elle n'est pas satisfaite par les représentations usuelles des politiques et définir une paramétrisation non triviale qui satisfasse cette propriété est difficile. Une solution naturelle pour palier ce problème est d'optimiser la fonction objectif associée grâce à une montée de gradient fonctionnel, la recherche étant contrainte à l'enveloppe convexe de l'espace de politiques. Il s'avère que l'algorithme résultant est une légère généralisation du schéma d'itération conservative de la politique. Ainsi, notre seconde contribution consiste à souligner cette connexion originale entre recherche locale de politique et programmation dynamique approchée.

show abstract

Soft-max boosting

Cited by 6 publications

References 20 publications

Firewall Log Classification: Leveraging Feature Selection and Stacking Ensemble

Firewall Log Classification: Leveraging Feature Selection and Stacking Ensemble

What killed the Convex Booster ?

Recherche locale de politique dans un espace convexe

Contact Info

Product

Resources

About