February 2, 2018

Researchers develop new algorithms to train robots

by U.S. Army Research Laboratory

Researchers at the U.S. Army Research Laboratory and the University of Texas at Austin have developed new techniques for robots or computer programs to learn how to perform tasks by interacting with a human instructor. The findings of the study will be presented and published at the Association for the Advancement of Artificial Intelligence Conference in New Orleans, Louisiana, Feb. 2-7.

ARL and UT researchers considered a specific case where a human provides real-time feedback in the form of critique. First introduced by collaborator Dr. Peter Stone, a professor at the University of Texas at Austin, along with his former doctoral student, Brad Knox, as TAMER, or Training an Agent Manually via Evaluative Reinforcement, the ARL/UT team developed a new algorithm called Deep TAMER.

It is an extension of TAMER that uses deep learning - a class of machine learning algorithms that are loosely inspired by the brain to provide a robot the ability to learn how to perform tasks by viewing video streams in a short amount of time with a human trainer.

According to Army researcher Dr. Garrett Warnell, the team considered situations where a human teaches an agent how to behave by observing it and providing critique, for example, "good job" or "bad job" -similar to the way a person might train a dog to do a trick. Warnell said the researchers extended earlier work in this field to enable this type of training for robots or computer programs that currently see the world through images, which is an important first step in designing learning agents that can operate in the real world.

Many current techniques in artificial intelligence require robots to interact with their environment for extended periods of time to learn how to optimally perform a task. During this process, the agent might perform actions that may not only be wrong, like a robot running into a wall for example, but catastrophic like a robot running off the side of a cliff. Warnell said help from humans will speed things up for the agents, and help them avoid potential pitfalls.

As a first step, the researchers demonstrated Deep TAMER's success by using it with 15 minutes of human-provided feedback to train an agent to perform better than humans on the Atari game of bowling - a task that has proven difficult for even state-of-the-art methods in artificial intelligence. Deep-TAMER-trained agents exhibited superhuman performance, besting both their amateur trainers and, on average, an expert human Atari player.

Within the next one to two years, researchers are interested in exploring the applicability of their newest technique in a wider variety of environments: for example, video games other than Atari Bowling and additional simulation environments to better represent the types of agents and environments found when fielding robots in the real world.

Their work will be published in the AAAI 2018 conference proceedings.

"The Army of the future will consist of Soldiers and autonomous teammates working side-by-side," Warnell said. "While both humans and autonomous agents can be trained in advance, the team will inevitably be asked to perform tasks, for example, search and rescue or surveillance, in new environments they have not seen before. In these situations, humans are remarkably good at generalizing their training, but current artificially-intelligent agents are not."

Deep TAMER is the first step in a line of research its researchers envision will enable more successful human-autonomy teams in the Army. Ultimately, they want autonomous agents that can quickly and safely learn from their human teammates in a wide variety of styles such as demonstration, natural language instruction and critique.

Provided by U.S. Army Research Laboratory

Citation: Researchers develop new algorithms to train robots (2018, February 2) retrieved 19 April 2024 from https://techxplore.com/news/2018-02-algorithms-robots.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

DeepMind researchers boost AI learning speed with UNREAL agent

22 shares

Feedback to editors

Silent flight edges closer to take off, according to new research

24 minutes ago

A flexible and efficient DC power converter for sustainable-energy microgrids

44 minutes ago

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

1 hour ago

To build a better AI helper, start by modeling the irrational behavior of humans

1 hour ago

Versatile fibers offer improved energy storage capacity for wearable devices

2 hours ago

Harnessing solar energy for high-efficiency NH₃ production

2 hours ago

A dexterous four-legged robot that can walk and handle objects simultaneously

4 hours ago

Climate change will increase value of residential rooftop solar panels across US, study finds

6 hours ago

Bitcoin's next 'halving' is right around the corner. Here's what you need to know

7 hours ago

Team develops a way to teach a computer to type like a human

18 hours ago

Load comments (0)

Researchers develop new algorithms to train robots

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Bitcoin's next 'halving' is right around the corner. Here's what you need to know

Team develops a way to teach a computer to type like a human

DeepMind researchers boost AI learning speed with UNREAL agent

Animal training techniques teach robots new tricks

New robots can see into their future

Artificial agent designs quantum experiments

Startup to train robots like puppets

Engineers refine method to instruct robots to collaborate through demonstration

A dexterous four-legged robot that can walk and handle objects simultaneously

For more open and equitable public discussions on social media, try 'meronymity'

An ink for 3D-printing flexible devices without mechanical joints

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Octopus inspires new suction mechanism for robots

Engineers design spider-like robot that may be used to explore caves on Mars

Phys.org

Medical Xpress

Science X

Researchers develop new algorithms to train robots

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Bitcoin's next 'halving' is right around the corner. Here's what you need to know

Team develops a way to teach a computer to type like a human

Related Stories

DeepMind researchers boost AI learning speed with UNREAL agent

Animal training techniques teach robots new tricks

New robots can see into their future

Artificial agent designs quantum experiments

Startup to train robots like puppets

Engineers refine method to instruct robots to collaborate through demonstration

Recommended for you

A dexterous four-legged robot that can walk and handle objects simultaneously

For more open and equitable public discussions on social media, try 'meronymity'

An ink for 3D-printing flexible devices without mechanical joints

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Octopus inspires new suction mechanism for robots

Engineers design spider-like robot that may be used to explore caves on Mars

Your Privacy