Shiro Masuda scite author profile

Shiro Masuda

4Publications

4Citation Statements Received

68Citation Statements Given

How they've been cited

How they cite others

Affiliations

Tokyo Metropolitan University

Publications

Order By: Most citations

Profit Sharing Introducing the Judgement of Incomplete Perception

Saito¹,

Masuda²

2004

Transactions of the Japanese Society for Artificial Intelligenc

View full text Add to dashboard Cite

keywords: reinforcement learning, POMDPs, profit sharing (PS), extended on-line profit sharing (EOPS), extended on-line profit sharing with judgement (EOPSwJ) SummaryTo apply reinforcement learning to difficult classes such as real-environment learning, we need to use a method robust to perceptual aliasing problem. The exploitation-oriented methods such as Profit Sharing can deal with the perceptual aliasing problem to a certain extent. However, when the agent needs to select different actions at the same sensory input, the learning efficiency worsens. To overcome the problem, several state partition methods using history information of state-action pairs are proposed. These methods try to convert a POMDP environment into an MDP environment, and thus they are sometimes very useful. However, their computation cost is very high especially in large state spaces. In contrast, memory-less approaches try to escape from the aliased states by outputting actions stochastically. However, these methods output actions stochastically even in unaliased states, and thus the learning efficiency is bad. If we desire to guarantee the rationality in POMDPs, it is efficient to output actions stochastically only in the aliased states and to output one action deterministically in the other unaliased states. Hence, to discriminate between aliased states and unaliased states, the utilization of χ 2 -goodness-of-fit test is proposed by Miyazaki et al. They point out that, in aliased states, the distributions of the state transitions by random search and a particular policy are different. This difference doesn't occur owing to non-deterministic actions. Hence, if the agent can collect enough samples to implement the test, the agent can distinguish between aliased states and unaliased states well. However, such a test needs a large amount of data, and it's a problem how the agent collects samples without worsening learning efficiency. If the agent uses random search in the course of learning, the learning efficiency worsens especially in unaliased states. Therefore, in this research, we propose a new method called Extended On-line Profit Sharing with Judgement (EOPSwJ) to detect important incomplete perception, which doesn't need large computation cost and numerous samples. We use two criterions for detecting important incomplete perceptions to attain a task. One is the rate of transitions to each state, and the other is the deterministic rate of actions. We confirm the availability of EOPSwJ using two simulations.

show abstract

Multi-Scopic Neuro-Cognitive Adaptation for Legged Locomotion Robots

Saputra

Wada

Masuda

et al. 2021

Preprint

View full text Add to dashboard Cite

Dynamic locomotion is realized through a simultaneous integration of adaptability and optimality. This article proposes a neuro-cognitive model for multi-legged locomotion robot that can seamlessly integrate multi-modal sensing, ecological perception, and cognition through the coordination of interoceptive and exteroceptive sensory information. Importantly, cognitive models can be discussed as micro-, meso-, and macro-scopic; these concepts correspond to sensing, perception, and cognition; and short-, medium-, and long-term adaptation (in terms of ecological psychology). The proposed neuro-cognitive model integrates these intelligent functions from a multi-scopic point of view. Macroscopic-level presents an attention mechanism with short-term adaptive locomotion control conducted by lower-level sensorimotor coordination-based model. Macrosopic-level serves environmental cognitive map featuring higher-level behavior planning. Mesoscopic level shows integration between the microscopic and macroscopic approaches, enabling the model to reconstruct a map and conduct localization using bottom-up facial environmental information and top-down map information, generating intention towards the ultimate goal at the macroscopic level. The experiments demonstrated that adaptability and optimality of multi-legged locomotion could be achieved using the proposed multi-scale neuro-cognitive model, from short to long-term adaptation, with efficient computational usage. Future research directions can be implemented not only in robotics contexts but also in the context of interdisciplinary studies incorporating cognitive science and ecological psychology.

show abstract

Multi-scopic neuro-cognitive adaptation for legged locomotion robots

Saputra

Wada

Masuda

et al. 2022

Sci Rep

View full text Add to dashboard Cite

Dynamic locomotion is realized through a simultaneous integration of adaptability and optimality. This article proposes a neuro-cognitive model for a multi-legged locomotion robot that can seamlessly integrate multi-modal sensing, ecological perception, and cognition through the coordination of interoceptive and exteroceptive sensory information. Importantly, cognitive models can be discussed as micro-, meso-, and macro-scopic; these concepts correspond to sensing, perception, and cognition; and short-, medium-, and long-term adaptation (in terms of ecological psychology). The proposed neuro-cognitive model integrates these intelligent functions from a multi-scopic point of view. Macroscopic-level presents an attention mechanism with short-term adaptive locomotion control conducted by a lower-level sensorimotor coordination-based model. Macrosopic-level serves environmental cognitive map featuring higher-level behavior planning. Mesoscopic level shows integration between the microscopic and macroscopic approaches, enabling the model to reconstruct a map and conduct localization using bottom-up facial environmental information and top-down map information, generating intention towards the ultimate goal at the macroscopic level. The experiments demonstrated that adaptability and optimality of multi-legged locomotion could be achieved using the proposed multi-scale neuro-cognitive model, from short to long-term adaptation, with efficient computational usage. Future research directions can be implemented not only in robotics contexts but also in the context of interdisciplinary studies incorporating cognitive science and ecological psychology.

show abstract

Simultaneous Determination of OP and OQ for Various Demand Models

Murase¹,

Masuda²,

Fukuda³

2004

Transactions of the Institute of Systems, Control and Informati

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shiro Masuda

Profit Sharing Introducing the Judgement of Incomplete Perception

Multi-Scopic Neuro-Cognitive Adaptation for Legged Locomotion Robots

Multi-scopic neuro-cognitive adaptation for legged locomotion robots

Simultaneous Determination of OP and OQ for Various Demand Models

Contact Info

Product

Resources

About