Designing multi-label classifiers that maximize F measures: State of the art

Pillai, Ignazio; Fumera, Giorgio; Roli, Fabio

doi:10.1016/j.patcog.2016.08.008

Cited by 63 publications

(28 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Nowadays, the F 1 measure is widely used in most application areas of machine learning, not only in the binary scenario, but also in multiclass cases. In multiclass cases, researchers can employ the F 1 micro/macro averaging procedure [55–60], which can be even targeted for ad-hoc optimization [61].…”

Section: Introductionmentioning

confidence: 99%

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

2020

View full text Add to dashboard Cite

BackgroundTo evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, accordingly to the goal of the experiment they are investigating. Despite being a crucial issue in machine learning, no widespread consensus has been reached on a unified elective chosen measure yet. Accuracy and F1 score computed on confusion matrices have been (and still are) among the most popular adopted metrics in binary classification tasks. However, these statistical measures can dangerously show overoptimistic inflated results, especially on imbalanced datasets.ResultsThe Matthews correlation coefficient (MCC), instead, is a more reliable statistical rate which produces a high score only if the prediction obtained good results in all of the four confusion matrix categories (true positives, false negatives, true negatives, and false positives), proportionally both to the size of positive elements and the size of negative elements in the dataset.ConclusionsIn this article, we show how MCC produces a more informative and truthful score in evaluating binary classifications than accuracy and F1 score, by first explaining the mathematical properties, and then the asset of MCC in six synthetic use cases and in a real genomics scenario. We believe that the Matthews correlation coefficient should be preferred to accuracy and F1 score in evaluating binary classification tasks by all scientific communities.

show abstract

Section: Introductionmentioning

confidence: 99%

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

2020

View full text Add to dashboard Cite

show abstract

“…Specifically, Otsu Fig. 13: Errors between the planned and flown paths The comparison results can be quantitatively evaluated via the F -measure, a compromise between recall and precision [43]. Let t p and t n be the correctly reported positive and negative results, whereas f p and f n be the falsely reported positive and negative results.…”

Section: Surface Inspection Resultsmentioning

confidence: 99%

System Architecture for Real-Time Surface Inspection Using Multiple UAVs

Hoang

Phung

Dinh

et al. 2020

IEEE Systems Journal

View full text Add to dashboard Cite

This paper presents a real-time control system for surface inspection using multiple unmanned aerial vehicles (UAVs). The UAVs are coordinated in a specific formation to collect data of the inspecting objects. The communication platform for data transmission is based on the Internet of Things (IoT). In the proposed architecture, the UAV formation is established via using the angle-encoded particle swarm optimisation to generate an inspecting path and redistribute it to each UAV where communication links are embedded with an IoT board for network and data processing capabilities. Data collected are transmitted in real time through the network to remote computational units. To detect potential damage or defects, an online image processing technique is proposed and implemented based on histograms. Extensive simulation, experiments and comparisons have been conducted to verify the validity and performance of the proposed system.

show abstract

“…Note that since our GO-CC predictions follow from MAP estimates, we do not expect they are able to optimize more elaborate performance metrics like the F-score. We note, however, that the design of optimal F-score classification algorithms remain a challenging computational problem even for the prediction of flat multiclass/multilabel categories 74 .…”

Section: Methodsmentioning

confidence: 99%

Consistent prediction of GO protein localization

Spetale

Arce

Krsticevic

et al. 2018

Sci Rep

View full text Add to dashboard Cite

The GO-Cellular Component (GO-CC) ontology provides a controlled vocabulary for the consistent description of the subcellular compartments or macromolecular complexes where proteins may act. Current machine learning-based methods used for the automated GO-CC annotation of proteins suffer from the inconsistency of individual GO-CC term predictions. Here, we present FGGA-CC+, a class of hierarchical graph-based classifiers for the consistent GO-CC annotation of protein coding genes at the subcellular compartment or macromolecular complex levels. Aiming to boost the accuracy of GO-CC predictions, we make use of the protein localization knowledge in the GO-Biological Process (GO-BP) annotations to boost the accuracy of GO-CC prediction. As a result, FGGA-CC+ classifiers are built from annotation data in both the GO-CC and GO-BP ontologies. Due to their graph-based design, FGGA-CC+ classifiers are fully interpretable and their predictions amenable to expert analysis. Promising results on protein annotation data from five model organisms were obtained. Additionally, successful validation results in the annotation of a challenging subset of tandem duplicated genes in the tomato non-model organism were accomplished. Overall, these results suggest that FGGA-CC+ classifiers can indeed be useful for satisfying the huge demand of GO-CC annotation arising from ubiquitous high throughout sequencing and proteomic projects.

show abstract

Designing multi-label classifiers that maximize F measures: State of the art

Cited by 63 publications

References 15 publications

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

System Architecture for Real-Time Surface Inspection Using Multiple UAVs

Consistent prediction of GO protein localization

Contact Info

Product

Resources

About