Image captioning is a popular topic in the domains of computer vision and natural language processing (NLP). Recent advancements in deep learning (DL) models have enabled the improvement of the overall performance of the image captioning approach. This study develops a metaheuristic optimization with a deep learning-enabled automated image captioning technique (MODLE-AICT). The proposed MODLE-AICT model focuses on the generation of effective captions to the input images by using two processes involving encoding unit and decoding unit. Initially, at the encoding part, the salp swarm algorithm (SSA), with a HybridNet model, is utilized to generate effectual input image representation using fixed-length vectors, showing the novelty of the work. Moreover, the decoding part includes a bidirectional gated recurrent unit (BiGRU) model used to generate descriptive sentences. The inclusion of an SSA-based hyperparameter optimizer helps in attaining effectual performance. For inspecting the enhanced performance of the MODLE-AICT model, a series of simulations were carried out, and the results are examined under several aspects. The experimental values suggested the betterment of the MODLE-AICT model over recent approaches.
A brain tumor (BT) is an abnormal development of brain cells that causes damage to the nerves and blood vessels. An accurate and early diagnosis of BT is important to prevent future complications. Precise segmentation of the BT provides a basis for surgical and planning treatment to physicians. Manual detection utilizing MRI images is computationally difficult. Due to significant variation in their structure and location, viz., ambiguous boundaries and irregular shapes, computerized tumor diagnosis is still a challenging task. The application of a convolutional neural network (CNN) helps radiotherapists categorize the types of BT from magnetic resonance images (MRI). This study designs an evolutional algorithm with a deep learning-driven brain tumor MRI image classification (EADL-BTMIC) model. The presented EADL-BTMIC model aims to accurately recognize and categorize MRI images to identify BT. The EADL-BTMIC model primarily applies bilateral filtering (BF) based noise removal and skull stripping as a pre-processing stage. In addition, the morphological segmentation process is carried out to determine the affected regions in the image. Moreover, sooty tern optimization (STO) with the Xception model is exploited for feature extraction. Furthermore, the attention-based long short-term memory (ALSTM) technique is exploited for the classification of BT into distinct classes. To portray the increased performance of the EADL-BTMIC model, a series of simulations were carried out on the benchmark dataset. The experimental outcomes highlighted the enhancements of the EADL-BTMIC model over recent models.
Biomedical engineering is the application of the principles and problem-solving methods of engineering to biology along with medicine. Computation intelligence is the study of design of intelligent agents which are systems acting perceptively. The computation intelligence paradigm offers more advantages to the enhancement and maintenance of the field of biomedical engineering. Liver cancer is the major reason of mortality worldwide. Earlier-stage diagnosis and treatment might increase the survival rate of liver cancer patients. Manual recognition of the cancer tissue is a time-consuming and difficult task. Hence, a computer-aided diagnosis (CAD) is employed in decision making procedures for accurate diagnosis and effective treatment. In contrast to classical image-dependent “semantic” feature evaluation from human expertise, deep learning techniques could learn feature representation automatically from sample images using convolutional neural network (CNN). This study introduces a Hybrid Rider Optimization with Deep Learning Driven Biomedical Liver Cancer Detection and Classification (HRO-DLBLCC) model. The proposed HRO-DLBLCC model majorly focuses on the identification of liver cancer in the medical images. To do so, the proposed HRO-DLBLCC model employs preprocessing in two stages, namely, Gabor filtering (GF) based noise removal and watershed transform based segmentation. In addition, the proposed HRO-DLBLCC model involves NAdam optimizer with DenseNet-201 based feature extractor to generate an optimal set of feature vectors. Finally, the HRO algorithm with recurrent neural network–long short-term memory (RNN-LSTM) model is applied for liver cancer classification, in which the hyperparameters of the RNN-LSTM model are tuned by the use of HRO algorithm. The HRO-DLBLCC model is experimentally validated and compared with existing models. The experimental results assured the promising performance of the HRO-DLBLCC model over recent approaches.
Infinite Gaussian mixture process is a model that computes the Gaussian mixture parameters with order. This process is a probability density distribution with adequate training data that can converge to the input density curve. In this paper, we propose a data mining model namely Beta hierarchical distribution that can solve axial data modeling. A novel hierarchical Two-Hyper-Parameter Poisson stochastic process is developed to solve grouped data modelling. The solution uses data mining techniques to link datum in groups by linking their components. The learning techniques are novel presentations of Gaussian modelling that use prior knowledge of the representation hyper-parameters and approximate them in a closed form. Experiments are performed on axial data modeling of Arabic Script classification and depict the effectiveness of the proposed method using a hand written benchmark dataset which contains complex handwritten Arabic patterns. Experiments are also performed on the application of facial expression recognition and prove the accuracy of the proposed method using a benchmark dataset which contains eight different facial expressions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.