2nd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2022) 2022
DOI: 10.1117/12.2641848
|View full text |Cite
|
Sign up to set email alerts
|

Researches advanced in the application of reinforcement learning

Abstract: Reinforcement learning has always been a research hotspot in the machine learning community, which aims to model the process of investigating the interaction between agents and the environment, making sequential decisions, optimizing strategies, and maximizing cumulative returns. With the rapid development of artificial intelligence technology, the huge research value and application potential of reinforcement learning have gradually become prominent. In this paper, we first introduce the development backgroun… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…[ 74,75] BERT-Bidirectional Encoder Representations from Transformers A pre-trained natural language processing model based on transformer architecture. BERT is particularly effective in understanding the context of words in a sentence and is used for various language-related tasks.…”
Section: Lstm-long Short-term Memorymentioning
confidence: 99%
“…[ 74,75] BERT-Bidirectional Encoder Representations from Transformers A pre-trained natural language processing model based on transformer architecture. BERT is particularly effective in understanding the context of words in a sentence and is used for various language-related tasks.…”
Section: Lstm-long Short-term Memorymentioning
confidence: 99%
“…Reinforcement Learning (RL): An area of machine learning where an agent learns to make decisions by interacting with an environment. The agent receives feedback in the form of rewards or penalties, allowing it to learn optimal strategies over time [73,74].…”
Section: Artificial Intelligence (Ai)mentioning
confidence: 99%