October 10, 2018 feature

A new developmental reinforcement learning approach for sensorimotor space enlargement

by Ingrid Fadelli, Tech Xplore, Tech Xplore

Researchers at the University of Lorraine have recently devised a new type of transfer learning based on model-free deep reinforcement learning with continuous sensorimotor space enlargement. Their approach, presented in a paper published during the eighth Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, and freely available on HAL archives-ouvertes, is inspired by child development, particularly by the growth of the sensorimotor space that occurs as a child is acquiring helpful new strategies.

"The formal framework of reinforcement learning can be used to model a wide range of problems," said Matthieu Zimmer, one of the researchers who carried out the study. "In this framework, an agent uses a trial-and-error method to slowly learn what sequence of actions is the most appropriate to reach a desired goal. If some requisites are met, then theory tells us that we have algorithms that the agent can use to find the optimal solution to the problem, yet this can take long periods of time. To speed up this process, we explored ways for an agent to attain good performance in fewer trials, even when it has nearly no knowledge of the task it will have to solve."

The transfer learning method proposed by Zimmer and his colleagues adds developmental layers to neural networks, allowing them to develop new strategies to complete tasks, especially when these tasks are somehow related. These developmental layers progressively uncover some dimensions of the sensorimotor space, following an intrinsic motivation heuristic.

To mitigate the effects of "catastrophic forgetting," a common issue in the development of neural networks, the researchers took inspiration from elastic weight consolidation theory, using it to regulate the learning of the neural controller.

"The basic idea of our work is for the agent to start with very limited perception and action capabilities and then develop these in a developmental way, inspired by how a child learns," said Alain Dutech, another researcher who carried out the study. "The space in which the agent searches for a solution is thus reduced, and this solution, albeit to a degraded problem, can be found more easily. Then we increase the capabilities of the agent, taking advantage of the previous solution found."

To better explain how their transfer learning approach works, the researchers use the example of a child learning to grab a pen. Initially, the child might only use her elbow and shoulder, learning how to touch the pen. Successively, she might decide to start using the hand and fingers, having grasped the basics of how to best make initial contact with the pen. This entails a gradual learning process, in which the child acquires sensorimotor strategies step by step, without having to learn too many things at once.

The researchers validated their new approach using two state-of-the-art deep learning algorithms, namely DDPG and NFAC, tested on Half-Cheetah and Humanoid, two high-dimensional environment benchmarks. Their results suggest that searching for a suboptimal solution in a subset of the parameter space before considering the full space is a helpful technique to bootstrap learning algorithms, achieving better performance with shorter training.

"In the very active and stimulating field of deep-reinforcement learning, we have shown that developmental methods like ours, as well as other similar ones explored by other researchers, could be combined with deep-learning methods to allow learning from scratch, with little prior knowledge," Zimmer said.

Despite its promising results, the study carried out by Zimmer and his colleagues also highlighted the gap that still exists between the abilities of deep neural networks and human beings. In fact, even when using developmental reinforcement learning, most existing agents are still far less versatile and efficient than humans.

"Sometimes, humans can learn in just one trial, yet even the most efficient artificial learning will require a complex combination of different algorithms to learn, estimate, memorize, compare, and optimize," Zimmer said. "Moreover, some of these algorithms are still not clearly defined."

Dutech and his colleagues are now exploring new horizons within the field of AI and deep learning. For instance, they would like to develop new ways for a learning agent to properly categorize the stimuli it perceives.

"Learning is much more efficient when the agent can interpret what is 'sees' or 'feels'," Dutech explained. "Today, the trend is to use deep-learning and neural networks to do this. We are now exploring other methods of extracting pertinent and useful information from the raw perception of artificial agents, which are less dependent on having a huge corpus of examples; such as unsupervised learning and self-organization."

More information: Developmental reinforcement learning through sensorimotor space enlargement. HAL Id: hal-01876995. hal.archives-ouvertes.fr/hal-01876995/document

Deep developmental reinforcement learning repo: github.com/matthieu637/ddrl

More resources: matthieu-zimmer.net/publicatio … /icdl2018_slides.zip

Provided by Tech Xplore

Citation: A new developmental reinforcement learning approach for sensorimotor space enlargement (2018, October 10) retrieved 19 April 2024 from https://techxplore.com/news/2018-10-developmental-approach-sensorimotor-space-enlargement.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Using reinforcement learning to achieve human-like balance control strategies in robots

463 shares

Feedback to editors

Team develops a way to teach a computer to type like a human

7 hours ago

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

8 hours ago

Garbage could replace a quarter of petroleum-based jet fuel every year

8 hours ago

For more open and equitable public discussions on social media, try 'meronymity'

10 hours ago

Mess is best: Disordered structure of battery-like devices improves performance

10 hours ago

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

11 hours ago

An ink for 3D-printing flexible devices without mechanical joints

11 hours ago

Floating solar's potential to support sustainable development

12 hours ago

Harvesting vibrational energy from 'colored noise'

13 hours ago

New understanding of energy losses in emerging light source

13 hours ago

Load comments (0)

A new developmental reinforcement learning approach for sensorimotor space enlargement

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

For more open and equitable public discussions on social media, try 'meronymity'

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Floating solar's potential to support sustainable development

Harvesting vibrational energy from 'colored noise'

New understanding of energy losses in emerging light source

Using reinforcement learning to achieve human-like balance control strategies in robots

DeepMind uses neural network to help explain meta-learning in people

A model-free deep reinforcement learning approach to tackle neural control problems

A new developmental framework could allow robots to optimize hyper-parameters autonomously

Using a deep learning neural network to allow a car to learn to drive itself in just 20 minutes

Scientists improve deep learning method for neural networks

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Game theory research shows AI can evolve into more selfish or cooperative personalities

Proof-of-principle demonstration of 3D magnetic recording could lead to enhanced hard disk drives

Tech companies want to build artificial general intelligence. But who decides when AGI is attained?

Phys.org

Medical Xpress

Science X

A new developmental reinforcement learning approach for sensorimotor space enlargement

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

For more open and equitable public discussions on social media, try 'meronymity'

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Floating solar's potential to support sustainable development

Harvesting vibrational energy from 'colored noise'

New understanding of energy losses in emerging light source

Related Stories

Using reinforcement learning to achieve human-like balance control strategies in robots

DeepMind uses neural network to help explain meta-learning in people

A model-free deep reinforcement learning approach to tackle neural control problems

A new developmental framework could allow robots to optimize hyper-parameters autonomously

Using a deep learning neural network to allow a car to learn to drive itself in just 20 minutes

Scientists improve deep learning method for neural networks

Recommended for you

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Game theory research shows AI can evolve into more selfish or cooperative personalities

Proof-of-principle demonstration of 3D magnetic recording could lead to enhanced hard disk drives

Tech companies want to build artificial general intelligence. But who decides when AGI is attained?

Your Privacy