Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing

Shen, Wei; He, Xiaonan; Zhang, Chuheng; Ni, Qiang; Dou, Wanchun; Wang, Yan

doi:10.1145/3340531.3411913

Cited by 15 publications

(3 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Contrastive learning [29] is another way to accelerate feature representation learning with auxiliary self-supervised learning tasks [19]. Some other algorithms utilise auxiliary tasks of explicit predictive models, such as predicting the forward model [16][17][18], predicting the backward model [30], predicting the reward function [31] and introducing other prior knowledge about the model [32,33]. In order to extract task-relevant low-dimensional features from original images.…”

Section: Feature Representation For Deep Reinforcement Learningmentioning

confidence: 99%

Deep reinforcement learning using least‐squares truncated temporal‐difference

Ren

Lan

et al. 2023

CAAI Trans on Intel Tech

View full text Add to dashboard Cite

Policy evaluation (PE) is a critical sub‐problem in reinforcement learning, which estimates the value function for a given policy and can be used for policy improvement. However, there still exist some limitations in current PE methods, such as low sample efficiency and local convergence, especially on complex tasks. In this study, a novel PE algorithm called Least‐Squares Truncated Temporal‐Difference learning (LST2D) is proposed. In LST2D, an adaptive truncation mechanism is designed, which effectively takes advantage of the fast convergence property of Least‐Squares Temporal Difference learning and the asymptotic convergence property of Temporal Difference learning (TD). Then, two feature pre‐training methods are utilised to improve the approximation ability of LST2D. Furthermore, an Actor‐Critic algorithm based on LST2D and pre‐trained feature representations (ACLPF) is proposed, where LST2D is integrated into the critic network to improve learning‐prediction efficiency. Comprehensive simulation studies were conducted on four robotic tasks, and the corresponding results illustrate the effectiveness of LST2D. The proposed ACLPF algorithm outperformed DQN, ACER and PPO in terms of sample efficiency and stability, which demonstrated that LST2D can be applied to online learning control problems by incorporating it into the actor‐critic architecture.

show abstract

Section: Feature Representation For Deep Reinforcement Learningmentioning

confidence: 99%

Deep reinforcement learning using least‐squares truncated temporal‐difference

Ren

Lan

et al. 2023

CAAI Trans on Intel Tech

View full text Add to dashboard Cite

show abstract

“…In view of the different goals in different mobile crowdsourcing systems, Shen et al [26] designed a participant selection algorithm, which was applied to different mobile crowdsourcing systems to achieve multiple goals and formulate the participant selection problem as a reinforcement learning problem.…”

Section: Mobile Crowdsourcingmentioning

confidence: 99%

Deep Learning for Mobile Crowdsourcing Techniques, Methods, and Challenges: A Survey

Liu

Zhong

Xie

et al. 2021

Mobile Information Systems

View full text Add to dashboard Cite

With the ever-increasing popularity of mobile computing technology and the wide adoption of outsourcing strategy in labour-intensive industrial domains, mobile crowdsourcing has recently emerged as a promising resolution for solving complex computational tasks with quick response requirements. However, the complexity of a mobile crowdsourcing task makes it hard to pursue an optimal resolution with limited computing resources, as well as various task constraints. In this situation, deep learning has provided a promising way to pursue such an optimal resolution by training a set of optimal parameters. In the past decades, many researchers have devoted themselves to this hot topic and brought various cutting-edge resolutions. In view of this, we review the current research status of deep learning for mobile crowdsourcing from the perspectives of techniques, methods, and challenges. Finally, we list a group of remaining challenges that call for an intensive study in future research.

show abstract

“…(3 and 4) Using features at different historical timepoints is a common practice in statistical learning, especially in time-series modelling (Christ et al, 2018). Lastly, predicting future labels as auxiliary tasks can help in learning (Caruana et al, 1996;Cooper et al, 2005;Trinh et al, 2018;Zhu et al, 2020;Shen et al, 2020). We propose using historical and future (up to four talkturns ago or later) target labels as auxiliary targets.…”

Section: Related Work and Hypothesesmentioning

confidence: 99%

Transfer Learning in Conversational Analysis through Reusing Preprocessing Data as Supervisors

Kim¹,

Liu²,

Yacef³

2021

Preprint

View full text Add to dashboard Cite

Conversational analysis systems are trained using noisy human labels and often require heavy preprocessing during multi-modal feature extraction. Using noisy labels in single-task learning increases the risk of over-fitting. Auxiliary tasks could improve the performance of the primary task learning during the same training -this approach sits in the intersection of transfer learning and multi-task learning (MTL). In this paper, we explore how the preprocessed data used for feature engineering can be re-used as auxiliary tasks, thereby promoting the productive use of data. Our main contributions are: (1) the identification of sixteen beneficially auxiliary tasks, (2) studying the method of distributing learning capacity between the primary and auxiliary tasks, and (3) studying the relative supervision hierarchy between the primary and auxiliary tasks. Extensive experiments on IEMOCAP and SE-MAINE data validate the improvements over single-task approaches, and suggest that it may generalize across multiple primary tasks.

show abstract

Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing

Cited by 15 publications

References 31 publications

Deep reinforcement learning using least‐squares truncated temporal‐difference

Deep reinforcement learning using least‐squares truncated temporal‐difference

Deep Learning for Mobile Crowdsourcing Techniques, Methods, and Challenges: A Survey

Transfer Learning in Conversational Analysis through Reusing Preprocessing Data as Supervisors

Contact Info

Product

Resources

About