Feature selection using Deep Neural Networks

Roy, Debaditya; Murty, K. Sri Rama; Mohan, C. Krishna

doi:10.1109/ijcnn.2015.7280626

Cited by 56 publications

(48 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Results are promising, both in terms of learning performance and biomarkers detection Extensions of the Ph-CNN architecture are addressing the testing of different tree distances, optimization of neighbours detection and of the number of Phylo-Conv layers. Further, different feature selection algorithms, either generic or DLspecific can be adopted [53,54,55]. Improvements are expected on the transfer learning and domain adaptation procedures, such as learning on synthetic data and testing on metagenomics, and applying to larger datasets.…”

Section: Discussionmentioning

confidence: 99%

Phylogenetic convolutional neural networks in metagenomics

et al. 2018

View full text Add to dashboard Cite

BackgroundConvolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space.ResultsPh-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron.ConclusionPh-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user.

show abstract

Section: Discussionmentioning

confidence: 99%

Phylogenetic convolutional neural networks in metagenomics

et al. 2018

View full text Add to dashboard Cite

show abstract

“…As the network weights are directly used as the feature weights, it cannot handle situations where inputs have outliers or noise. Towards this end, Roy et al(2015) use the activation potentials contributed by each of the individual input dimensions, as the metric for feature selection. However, this work relies on the specific DNN structure and the ReLU activation function which might not be so suitable in many learning tasks.…”

Section: Feature Selection Methodsmentioning

confidence: 99%

“… RF(Random Forest), a tree-based feature selection method provided by scikit-learn package.  Roy et al (2015): a DNN-based feature selection method, reproduced according to the paper Parameter Settings. Model parameters are initialized with truncated normal distribution with a mean of 0 and standard deviation of 0.1.…”

Section: Embedded Methodsmentioning

confidence: 99%

AFS: An Attention-Based Mechanism for Supervised Feature Selection

Gui

2019

AAAI

View full text Add to dashboard Cite

As an effective data preprocessing step, feature selection has shown its effectiveness to prepare high-dimensional data for many machine learning tasks. The proliferation of high dimension and huge volume big data, however, has brought major challenges, e.g. computation complexity and stability on noisy data, upon existing feature-selection techniques. This paper introduces a novel neural network-based feature selection architecture, dubbed Attention-based Feature Selection (AFS). AFS consists of two detachable modules: an attention module for feature weight generation and a learning module for the problem modeling. The attention module formulates correlation problem among features and supervision target into a binary classification problem, supported by a shallow attention net for each feature. Feature weights are generated based on the distribution of respective feature selection patterns adjusted by backpropagation during the training process. The detachable structure allows existing off-theshelf models to be directly reused, which allows for much less training time, demands for the training data and requirements for expertise. A hybrid initialization method is also introduced to boost the selection accuracy for datasets without enough samples for feature weight generation. Experimental results show that AFS achieves the best accuracy and stability in comparison to several state-of-art feature selection algorithms upon both MNIST, noisy MNIST and several datasets with small samples.

show abstract

“…Several methods have been proposed for feature selection using DNN, with the focus on reducing input dimensionality, such as sparse one-to-one, dropout feature ranking, and activation potential based (15)(16)(17). We used the activation potential based method because of its proven performance in reducing the number of features, to obviate application of another filtering method coupled with DNN, and its simplicity and intuitiveness for selecting the number of important variables.…”

Section: Deep Neural Network (Dnn)mentioning

confidence: 99%

“…We used the activation potential based method because of its proven performance in reducing the number of features, to obviate application of another filtering method coupled with DNN, and its simplicity and intuitiveness for selecting the number of important variables. Feature selection was performed according to the method proposed by Roy et.al (17). Briefly, we computed the activation potential of each input feature connected to each of the hidden nodes in the first layer before applying Relu.…”

Section: Deep Neural Network (Dnn)mentioning

confidence: 99%

Identifying factors associated with opioid cessation in a biracial sample using machine learning

Cox

Sherva

Lunetta

et al. 2020

Exploration of Medicine

View full text Add to dashboard Cite

LASSO implemented in the R package 'glmnet' (1) was used for both feature selection and prediction. The shrinkage parameter lambda in the penalty term of LASSO regression was obtained using 10-fold cross validation on the training set 10 times. Separate accuracy criteria of either misclassification error or AUC were used to search for the lambda with the best model fit. The "1SE rule (2)" which aims to find the simplest model with comparable accuracy to the best model, was used to identify lambda whose cross validation error was one standard error unit from the lowest cross validation error on the training set. We identified the significant features by fitting lambdas on the training set. The test set accuracy was evaluated by using the class probability prediction of test set as an input to Scikit-learn (3) (roc_auc_score and f1_score) to obtain our final accuracy measures: F1 score and AUC. Support Vector Machine (SVM) with Recursive Feature EliminationLinear SVM with recursive feature elimination was implemented using the Scikit-learn python package. A soft margin was included in the model to reduce overfitting. Briefly, we set the penalty parameter C by exhaustively searching the range of values between 2 -20 and 2 8 that covers the range recommended by Hsu et.al (4) using 10-fold cross validation with balanced class weights on the training set . We applied the parameter C chosen by either maximizing AUC or F1 score (whichever yielded the highest accuracy) in the training set for recursive feature selection. A feature's importance, represented by its weight, was used as the input to the recursive feature elimination function, where 10-fold cross validation was used to find the combination of features that maximizes either AUC or F1 score (whichever yielded the highest accuracy). Selected features from each model were applied to the test set to obtain the overall test accuracy.

show abstract

Feature selection using Deep Neural Networks

Cited by 56 publications

References 16 publications

Phylogenetic convolutional neural networks in metagenomics

Phylogenetic convolutional neural networks in metagenomics

AFS: An Attention-Based Mechanism for Supervised Feature Selection

Identifying factors associated with opioid cessation in a biracial sample using machine learning

Contact Info

Product

Resources

About