2020
DOI: 10.1109/access.2020.3038605
|View full text |Cite
|
Sign up to set email alerts
|

A Gentle Introduction to Reinforcement Learning and its Application in Different Fields

Abstract: Due to the recent progress in Deep Neural Networks, Reinforcement Learning (RL) has become one of the most important and useful technology. It is a learning method where a software agent interacts with an unknown environment, selects actions, and progressively discovers the environment dynamics. RL has been effectively applied in many important areas of real life. This article intends to provide an in-depth introduction of the Markov Decision Process, RL and its algorithms. Moreover, we present a literature re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
64
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
2
1

Relationship

3
6

Authors

Journals

citations
Cited by 142 publications
(64 citation statements)
references
References 295 publications
(282 reference statements)
0
64
0
Order By: Relevance
“…This case is regularly encountered in wireless optimization problems, which have too many local optima. [1], [12], [15], [16], [21], [23], [24], [67], [68]. Next generation wireless networks will generate a tremendous amount of data related to network statistics, such as user traffic, channel occupancy, channel quality, etc.…”
Section: Limitation Of Conventional Rram Techniquesmentioning
confidence: 99%
See 1 more Smart Citation
“…This case is regularly encountered in wireless optimization problems, which have too many local optima. [1], [12], [15], [16], [21], [23], [24], [67], [68]. Next generation wireless networks will generate a tremendous amount of data related to network statistics, such as user traffic, channel occupancy, channel quality, etc.…”
Section: Limitation Of Conventional Rram Techniquesmentioning
confidence: 99%
“…To overcome this, GANs are utilized, which generate large amounts of realistic datasets synthetically by expanding the available limited amounts of real-time datasets. From a DRL perspective, GANs-generated synthetic data is more effective and reliable than traditional augmentation methods [67]. This is because DRL agents will be exposed to various extreme challenging and practical situations by merging the realistic and synthetic data, enabling DRL models to be trained on unpredicted and rare events.…”
Section: Future Research Directionsmentioning
confidence: 99%
“…Recently, the development in computing technology and the introduction of new machine learning algorithms e.g. reinforcement learning [23], neural network [21] the goal of Artificial Intelligence (AI) has become a step closer. AI has important application in diverse fields including:healthcare [12], [22], robotics and autonomous control, vision enhancing method for low vision impairments [19], natural language processing, dynamic normative environments [31], risk management [26], intelligent environments [10], games and self-organized system [11], ambient assisted living techniques to improve the quality of life of elderly [13], Social humanoid robot [9] can help to monitor indoor environmental quality [29] and distributed fuzzy system able to infer in real-time critical situations [30].…”
Section: Introductionmentioning
confidence: 99%
“…Recently, the development in computing technology and the introduction of new machine learning algorithms e.g. reinforcement learning [22], text mining the goal of Artificial Intelligence (AI) has become a step closer. AI has important application in diverse fields including:healthcare [5]and [21], robotics and autonomous control [26] and [2],dynamic normative environments [28], ambient assisted living techniques for improvement in the quality of life of elder persons [8], drug identification [20], intelligent environments [3], games and self-organized system [4], vision enhancing method for low vision impairments [19] scheduling and management and configuration of resources, distributed fuzzy system for inferring in real-time critical situations [27], risk management [23] and computer vision.…”
Section: Introductionmentioning
confidence: 99%