October 30, 2018 feature

A new dynamic ensemble active learning method based on a non-stationary bandit

by Ingrid Fadelli , Tech Xplore

Researchers at the University of Edinburgh, University College London (UCL) and Nara Institute of Science and Technology have developed a new ensemble active learning approach based on a non-stationary multi-armed bandit and an expert advice algorithm. Their method, presented in a paper pre-published on arXiv, could reduce the time and effort invested in the manual annotation of data.

"Conventional supervised machine learning is data-hungry, and labelled data can be a bottleneck when data annotation is expensive," Timothy Hospedales, one of the researchers who carried out the study told Tech Xplore. "Active learning supports supervised learning by predicting the most informative data points to annotate so that good models can be trained with a reduced annotation budget."

Active learning is a particular area of machine learning in which a learning algorithm can actively choose the data it wants to learn from. This typically results in better performance, with significantly smaller training datasets.

Researchers have developed a variety of active learning algorithms that could reduce the costs of annotation, but so far, none of these solutions has proved to be effective for all problems. Other studies have hence used bandit algorithms to identify the best active learning algorithm for a given dataset.

"The term 'bandit' refers to a multi-armed bandit slot machine, which is a convenient mathematical abstraction for exploration/exploitation problems," Hospedales explained. "A bandit algorithm finds a good balance between effort spent on exploring all slot machines to find out which is paying out most, with effort spent on exploiting the best slot machine found so far."

The efficacy of active learning algorithms varies both across problems and over time at different stages of learning. This observation is analogous to playing slot machines, where payout probability changes over time.

"The aim of our study was to develop a new bandit algorithm that improves performance by accounting for this aspect of the active learning problem," Hospedales said.

To tackle this limitation, the researchers proposed a dynamic ensemble active learner (DEAL) based on a non-stationary bandit. This learner builds up an estimate of each active learning algorithm's efficacy online, based on the reward (importance-weighted accuracy) obtained after every annotation of data.

"It does this by using the preference expressed for that point by each active learning algorithm," Kunkun Pang, another researcher who carried out the study, told Tech Xplore. "To deal with the issue of the changing efficacy of active learners over time, we periodically restart the learning algorithm to refresh its active learner preference. With this capability, if the most effective active learning algorithm changes between early and late stages of learning, we can quickly adapt to this change."

The researchers tested their approach on 13 popular datasets, achieving highly encouraging results. Their DEAL algorithm has a mathematical performance guarantee, meaning that it there is a high degree of confidence in how well it will work.

"The guarantee relates the performance of our algorithm, which is that of an ideal oracle that always knows the right choice for the active learner," Hospedales explained. "It provides a bound on the performance gap between such a best-case algorithm and ours."

The empirical evaluation carried out by Hospedales and his colleagues confirmed that their DEAL algorithm improves active learning performance on a suite of benchmarks. It does this by continuously identifying the most effective active learning algorithm for different tasks and at different stages of training.

"Today, while active learning is appealing, its impact on machine learning practices is limited due to the hassle of matching algorithms to problems and to stages of learning," Hospedales said. "DEAL eliminates this difficulty and provides an approach to tackle many problems and all stages of learning. By making active learning easier to use, we hope it can have a bigger impact on reducing annotation cost in machine learning practice."

Despite the very promising results, the technique devised by the researchers still has a significant limitation. DEAL does all the learning within a single problem and this results in a 'cold start,' meaning that the algorithm approaches all new problems with a blank slate.

"In ongoing work, we are learning how to annotate on many different problems and eventually transfer this knowledge to a new problem, in order to perform effective annotation immediately with no warm-up requirements," Pang said. "Our preliminary work on this topic has been published and also won the Best Paper prize at ICML 2018 AutoML workshop."

More information: Dynamic ensemble active learning: A non-stationary bandit with expert advice. arXiv: 1810.07778 [cs.LG]. arxiv.org/abs/1810.07778

Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning. arxiv:1806.04798 [cs.LG] arxiv.org/abs/1806.04798

Citation: A new dynamic ensemble active learning method based on a non-stationary bandit (2018, October 30) retrieved 24 April 2024 from https://techxplore.com/news/2018-10-dynamic-ensemble-method-based-non-stationary.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New algorithm limits bias in machine learning

59 shares

Feedback to editors

Ultra-thin, flexible solar cells demonstrate their promise in a commercial quadcopter drone

2 minutes ago

Microsoft claims that small, localized language models can be powerful as well

59 minutes ago

Securing competitiveness of energy-intensive industries through relocation: The pulling power of renewables

1 hour ago

New research demonstrates potential of thin-film electronics for flexible chip design

1 hour ago

A simple 'twist' improves the engine of clean fuel generation

1 hour ago

Storing and utilizing energy with innovative sulfur-based cathodes

1 hour ago

Salt battery harvests osmotic energy where the river meets the sea

4 hours ago

Emulating neurodegeneration and aging in artificial intelligence systems

5 hours ago

With a game show as his guide, researcher uses AI to predict deception

19 hours ago

Super Mario hackers' tricks could protect software from bugs, study finds

20 hours ago

Load comments (0)

A new dynamic ensemble active learning method based on a non-stationary bandit

Ultra-thin, flexible solar cells demonstrate their promise in a commercial quadcopter drone

Microsoft claims that small, localized language models can be powerful as well

Securing competitiveness of energy-intensive industries through relocation: The pulling power of renewables

New research demonstrates potential of thin-film electronics for flexible chip design

A simple 'twist' improves the engine of clean fuel generation

Storing and utilizing energy with innovative sulfur-based cathodes

Salt battery harvests osmotic energy where the river meets the sea

Emulating neurodegeneration and aging in artificial intelligence systems

With a game show as his guide, researcher uses AI to predict deception

Super Mario hackers' tricks could protect software from bugs, study finds

New algorithm limits bias in machine learning

New algorithm can more quickly predict LED materials

Restoring balance in machine learning datasets

Improving machine learning with an old approach

Baidu researchers develop a new auto-tuning framework for autonomous vehicles

Machine learning used for helping farmers select optimal products suited for their operation

Microsoft claims that small, localized language models can be powerful as well

Emulating neurodegeneration and aging in artificial intelligence systems

Personalization has the potential to democratize who decides how LLMs behave

With a game show as his guide, researcher uses AI to predict deception

A new framework to generate human motions from language prompts

Neural networks can mediate between download size and quality, according to researcher

Phys.org

Medical Xpress

Science X

A new dynamic ensemble active learning method based on a non-stationary bandit

Ultra-thin, flexible solar cells demonstrate their promise in a commercial quadcopter drone

Microsoft claims that small, localized language models can be powerful as well

Securing competitiveness of energy-intensive industries through relocation: The pulling power of renewables

New research demonstrates potential of thin-film electronics for flexible chip design

A simple 'twist' improves the engine of clean fuel generation

Storing and utilizing energy with innovative sulfur-based cathodes

Salt battery harvests osmotic energy where the river meets the sea

Emulating neurodegeneration and aging in artificial intelligence systems

With a game show as his guide, researcher uses AI to predict deception

Super Mario hackers' tricks could protect software from bugs, study finds

Related Stories

New algorithm limits bias in machine learning

New algorithm can more quickly predict LED materials

Restoring balance in machine learning datasets

Improving machine learning with an old approach

Baidu researchers develop a new auto-tuning framework for autonomous vehicles

Machine learning used for helping farmers select optimal products suited for their operation

Recommended for you

Microsoft claims that small, localized language models can be powerful as well

Emulating neurodegeneration and aging in artificial intelligence systems

Personalization has the potential to democratize who decides how LLMs behave

With a game show as his guide, researcher uses AI to predict deception

A new framework to generate human motions from language prompts

Neural networks can mediate between download size and quality, according to researcher

Your Privacy