Ivan-Alejandro Garcia-Ramirez scite author profile

Ivan-Alejandro Garcia-Ramirez

2Publications

1Citation Statement Received

60Citation Statements Given

How they've been cited

How they cite others

Affiliations

Center for Research and Advanced Studies of the National Polytechnic Institute, Instituto Politécnico Nacional

Publications

Order By: Most citations

A Novel Framework for Fast Feature Selection Based on Multi-Stage Correlation Measures

Garcia-Ramirez

Calderon-Mora

Méndez-Vázquez

et al. 2022

MAKE

View full text Add to dashboard Cite

Datasets with thousands of features represent a challenge for many of the existing learning methods because of the well known curse of dimensionality. Not only that, but the presence of irrelevant and redundant features on any dataset can degrade the performance of any model where training and inference is attempted. In addition, in large datasets, the manual management of features tends to be impractical. Therefore, the increasing interest of developing frameworks for the automatic discovery and removal of useless features through the literature of Machine Learning. This is the reason why, in this paper, we propose a novel framework for selecting relevant features in supervised datasets based on a cascade of methods where speed and precision are in mind. This framework consists of a novel combination of Approximated and Simulate Annealing versions of the Maximal Information Coefficient (MIC) to generalize the simple linear relation between features. This process is performed in a series of steps by applying the MIC algorithms and cutoff strategies to remove irrelevant and redundant features. The framework is also designed to achieve a balance between accuracy and speed. To test the performance of the proposed framework, a series of experiments are conducted on a large battery of datasets from SPECTF Heart to Sonar data. The results show the balance of accuracy and speed that the proposed framework can achieve.

show abstract

A Novel Tool For Fast Feature Selection

Garcia-Ramirez

Calderón

Méndez

et al. 2020

Preprint

View full text Add to dashboard Cite

Motivation: Datasets with high dimensionality represent a challenge to existing learning methods. The presence of irrelevant and redundant features in a dataset can degrade the performance of the models inferred from it. In large datasets, manual management of features tends to be impractical. Therefore, the development of automatic discovery techniques to remove useless features has attracted increasing interest. In this paper, we propose a novell framework to select relevant features in supervised datasets. Availability: This tool can be downloaded from https://github.com/ivangarcia88/selectionResults: This tool allow to identify relevant and remove redundant features, reducing computation time on training a machine learning model while improving the performance.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.