September 24, 2018 feature

Using reinforcement learning to achieve human-like balance control strategies in robots

by Ingrid Fadelli , Tech Xplore

Researchers at the University of Edinburgh have developed a hierarchical framework based on deep reinforcement learning (RL) that can acquire a variety of strategies for humanoid balance control. Their framework, outlined in a paper pre-published on arXiv and presented at the 2017 International Conference on Humanoid Robotics, could perform far more human-like balancing behaviors than conventional controllers.

When standing or walking, human beings innately and effectively use a number of techniques for under-actuated control that help them to keep their balance. These include toe tilting and heel rolling, which create better foot-ground clearance. Replicating similar behaviors in humanoid robots could greatly improve their motor and movement capabilities.

"Our research focuses on using deep RL to solve dynamic locomotion of humanoid robots," Dr. Zhibin Li, a lecturer in robotics and control at the University of Edinburgh, who carried out the study, told TechXplore. "In the past, locomotion was mainly done using conventional analytical approaches—model based, which are limited because they require human effort and knowledge, and demand high computing power to run online."

Requiring less human effort and manual tuning, machine learning techniques could lead to the development of more effective and specific controllers than traditional engineering approaches. A further advantage of using RL is that the computation for these tools can also be outsourced offline, resulting in faster online performance for high dimensional control systems, such as humanoid robots.

"Given the increasingly powerful deep RL algorithms, an increasing number of research studies have started using deep RL to solve control tasks, as the recent progress in deep RL algorithms designed for continuous action domain has brought forward the possibility to apply reinforcement learning continuous control tasks that involve complicated dynamics," Dr. Li explained. "The main objective of our research was to explore the possibilities of using deep reinforcement learning to acquire versatile control policies comparable or better than analytical approaches while using less human effort."

The framework developed by Dr. Li, in collaboration with Dr. Taku Komura and Ph.D. student Chuanyu Yang, uses deep RL to attain high-level control policies. Constantly receiving feedback of the robot's state, these strategies enable desired joint angles at a lower frequency.

"At the low-level, proportional and derivative (PD) controllers are used at a much higher control frequency to guarantee the stable joint motions," Ph.D. student Chuanyu said. "The inputs for the low-level PD controller are desired joint angles produced by the high-level neural network, and the outputs are the desired torques for joint motors."

The researchers tested the performance of their algorithm and achieved highly promising results. They found that transferring human knowledge from control engineering methods to the reward design for RL algorithms enabled balance control strategies that resembled those used by humans. Moreover, as RL algorithms improve through a trial and error process, automatically adapting to new situations, their framework requires little hand tuning or other interventions by human engineers.

"Our study shows that deep reinforcement learning can be a powerful tool to produce comparable balancing results to that of a human-engineered controller with less manual tuning effort and shorter time," Dr. Li said. "The deep reinforcement learning algorithm we developed is even capable of learning emerged human-like behaviors such as tilting around toes or heels, which most engineering methods are unable to perform."

Dr. Li and his colleagues are now working on an extension of their study that applies RL to a full body Valkyrie robot in a 3-D simulation. In this new research effort, they were able to generalize human-resembling balancing strategies to walking and other locomotion tasks.

"Eventually, we would like to apply this hierarchical framework of combining machine learning and robot control to real humanoid robots, as well as to other robotic platforms," Dr. Li said.

More information: Emergence of human-comparable balancing behaviors by deep reinforcement learning. arXiv: 1809.02074v1 [cs.RO]. arxiv.org/abs/1809.02074

Citation: Using reinforcement learning to achieve human-like balance control strategies in robots (2018, September 24) retrieved 26 April 2024 from https://techxplore.com/news/2018-09-human-like-strategies-robots.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A model-free deep reinforcement learning approach to tackle neural control problems

118 shares

Feedback to editors

Proof of concept study shows path to easier recycling of solar modules

40 minutes ago

New circuit boards can be repeatedly recycled

2 hours ago

Researchers develop an automated benchmark for language-based task planners

2 hours ago

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

2 hours ago

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

2 hours ago

Researchers outline path forward for tandem solar cells

4 hours ago

Researcher develop high-performance amorphous p-type oxide semiconductor

4 hours ago

Scientists create new atomic clock that is both ultra-precise and sturdy

4 hours ago

A framework to compare lithium battery testing data and results during operation

7 hours ago

New approach could make reusing captured carbon far cheaper, less energy-intensive

11 hours ago

Load comments (0)

Using reinforcement learning to achieve human-like balance control strategies in robots

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

A model-free deep reinforcement learning approach to tackle neural control problems

Baidu researchers develop a new auto-tuning framework for autonomous vehicles

Using a deep learning neural network to allow a car to learn to drive itself in just 20 minutes

Teaching robots how to interact with children with autism

Teaching machines to direct traffic through deep reinforcement learning

Breakthrough software teaches computer characters to walk, run, even play soccer

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

People, not design features, make a robot social

Phys.org

Medical Xpress

Science X

Using reinforcement learning to achieve human-like balance control strategies in robots

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Related Stories

A model-free deep reinforcement learning approach to tackle neural control problems

Baidu researchers develop a new auto-tuning framework for autonomous vehicles

Using a deep learning neural network to allow a car to learn to drive itself in just 20 minutes

Teaching robots how to interact with children with autism

Teaching machines to direct traffic through deep reinforcement learning

Breakthrough software teaches computer characters to walk, run, even play soccer

Recommended for you

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

People, not design features, make a robot social

Your Privacy