January 17, 2020 report

AI learning technique may illustrate function of reward pathways in the brain

by Bob Yirka , Tech Xplore

AI learning technique found to be applicable to reward pathways in the brain — When the future is uncertain, future reward can be represented as a probability distribution. some possible futures are good (teal), others are bad (red). Distributional reinforcement learning can learn about this distribution over predicted rewards through a variant of the TD algorithm. Credit: *Nature* (2020). DOI: 10.1038/s41586-019-1924-6

A team of researchers from DeepMind, University College and Harvard University has found that lessons learned in applying learning techniques to AI systems may help explain how reward pathways work in the brain. In their paper published in the journal Nature, the group describes comparing distributional reinforcement learning in a computer with dopamine processing in the mouse brain, and what they learned from it.

Prior research has shown that dopamine produced in the brain is involved in reward processing—it is produced when something good happens, and its expression results in feelings of pleasure. Some studies have also suggested that the neurons in the brain that respond to the presence of dopamine all respond in the same ways—an event causes a person or a mouse to feel either good or bad. Other studies have suggested that neuronal response is more of a gradient. In this new effort, the researchers have found evidence supporting the latter theory.

Distributional reinforcement learning is a type of machine learning based on reinforcement. It is often used when designing games such as Starcraft II or Go. It keeps track of good moves versus bad moves and learns to reduce the number of bad moves, improving its performance the more it plays. But such systems do not treat all good and bad moves the same—each move is weighted as it is recorded and the weights are part of the calculations used when making future move choices.

Researchers have noted that humans appear to use a similar strategy to improve their level of play, as well. The researchers in London suspected that the similarities between the AI systems and the way the brain carries out reward processing were likely similar, as well. To find out if they were correct, they carried out experiments with mice. They inserted devices into their brains that were capable of recording responses from individual dopamine neurons. The mice were then trained to carry out a task in which they received rewards for responding in a desired way.

The mouse neuron responses revealed that they did not all respond the same way, as prior theory had predicted. Instead, they responded in reliably different ways—an indication that the levels of pleasure the mice were experiencing were more of a gradient, as the team had predicted.

More information: Will Dabney et al. A distributional code for value in dopamine-based reinforcement learning, Nature (2020). DOI: 10.1038/s41586-019-1924-6

Journal information: Nature

Citation: AI learning technique may illustrate function of reward pathways in the brain (2020, January 17) retrieved 24 April 2024 from https://techxplore.com/news/2020-01-ai-technique-function-reward-pathways.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Sugar changes the chemistry of your brain

148 shares

Feedback to editors

With a game show as his guide, researcher uses AI to predict deception

7 hours ago

Super Mario hackers' tricks could protect software from bugs, study finds

8 hours ago

The world's largest 3D printer is at a university in Maine. It just unveiled an even bigger one

10 hours ago

Researchers develop tiny chip that can safeguard user data while enabling efficient computing on a smartphone

12 hours ago

Personalization has the potential to democratize who decides how LLMs behave

12 hours ago

Aerogel-based phase change materials improve thermal management, reduce microwave emissions in electronic devices

12 hours ago

Holographic displays offer a glimpse into an immersive future

12 hours ago

Researchers develop high-energy-density aqueous battery based on halogen multi-electron transfer

12 hours ago

Extracting high-purity gold from electrical and electronic waste

14 hours ago

How potatoes, corn and beans led to breakthrough in smart windows technology

14 hours ago

Load comments (0)

AI learning technique may illustrate function of reward pathways in the brain

With a game show as his guide, researcher uses AI to predict deception

Super Mario hackers' tricks could protect software from bugs, study finds

The world's largest 3D printer is at a university in Maine. It just unveiled an even bigger one

Researchers develop tiny chip that can safeguard user data while enabling efficient computing on a smartphone

Personalization has the potential to democratize who decides how LLMs behave

Aerogel-based phase change materials improve thermal management, reduce microwave emissions in electronic devices

Holographic displays offer a glimpse into an immersive future

Researchers develop high-energy-density aqueous battery based on halogen multi-electron transfer

Extracting high-purity gold from electrical and electronic waste

How potatoes, corn and beans led to breakthrough in smart windows technology

Sugar changes the chemistry of your brain

Dopamine fasting: an expert reviews the latest craze in Silicon Valley

How our brains remember things depends upon how we learn them

In pursuit of pleasure, brain learns to hit the repeat button

New data suggests nicotine while pregnant alters genes

Unraveling the brain's reward circuits

A new framework to generate human motions from language prompts

Personalization has the potential to democratize who decides how LLMs behave

With a game show as his guide, researcher uses AI to predict deception

Neural networks can mediate between download size and quality, according to researcher

A coffee roastery in Finland has launched an AI-generated blend. The results were surprising

Microsoft teases lifelike avatar AI tech but gives no release date

Phys.org

Medical Xpress

Science X

AI learning technique may illustrate function of reward pathways in the brain

With a game show as his guide, researcher uses AI to predict deception

Super Mario hackers' tricks could protect software from bugs, study finds

The world's largest 3D printer is at a university in Maine. It just unveiled an even bigger one

Researchers develop tiny chip that can safeguard user data while enabling efficient computing on a smartphone

Personalization has the potential to democratize who decides how LLMs behave

Aerogel-based phase change materials improve thermal management, reduce microwave emissions in electronic devices

Holographic displays offer a glimpse into an immersive future

Researchers develop high-energy-density aqueous battery based on halogen multi-electron transfer

Extracting high-purity gold from electrical and electronic waste

How potatoes, corn and beans led to breakthrough in smart windows technology

Related Stories

Sugar changes the chemistry of your brain

Dopamine fasting: an expert reviews the latest craze in Silicon Valley

How our brains remember things depends upon how we learn them

In pursuit of pleasure, brain learns to hit the repeat button

New data suggests nicotine while pregnant alters genes

Unraveling the brain's reward circuits

Recommended for you

A new framework to generate human motions from language prompts

Personalization has the potential to democratize who decides how LLMs behave

With a game show as his guide, researcher uses AI to predict deception

Neural networks can mediate between download size and quality, according to researcher

A coffee roastery in Finland has launched an AI-generated blend. The results were surprising

Microsoft teases lifelike avatar AI tech but gives no release date

Your Privacy