September 12, 2019 report

Chemists show how bias can crop up in machine learning algorithm results

by Bob Yirka , Phys.org

A team of material scientists at Haverford College has shown how human bias in data can impact the results of machine-learning algorithms used to predict new reagents for use in making desired products. In their paper published in the journal Nature, the group describes testing a machine-learning algorithm with different types of datasets and what they found.

One of the more well-known applications of machine-learning algorithms is in facial recognition. But there are possible problems with such algorithms. One such problem occurs when a facial algorithm intended to look for an individual among many faces has been trained using people of just one race. In this new effort, the researchers wondered if bias, unintentional or otherwise, might be cropping up in machine learning algorithm results used in chemistry applications designed to look for new products.

Such algorithms use data describing the ingredients of reactions that result in the creation of a new product. But the data the system is trained on could have a major impact on the results. The researchers note that currently, such data is obtained from published research efforts, which means they are typically generated by humans. They note that the data from such efforts could have been generated by the researchers themselves, or by other researchers working on separate efforts. Data could even come from a single person simply relating from memory, or from a professor's suggestion, or a graduate student with a bright idea. The point is, the data could be biased in terms of the background of the resource.

In this new effort, the researchers wanted to know if such biases might have an impact on the results of machine-learning algorithms used for chemistry applications. To find out, they looked at a specific set of materials called amine-templated vanadium borates. When they are synthesized successfully, crystals form—an easy way to determine if a reaction was successful.

The experiment consisted of training a machine-learning algorithm on data surrounding the synthesis of vanadium borates, and then programming the system to create its own. Some of the data collected by the researchers was human-generated, and some of it was collected randomly. They report that the algorithm trained on the random data did better at finding ways to synthesize the vanadium borates than when it used data generated from humans. They claim this shows a clear bias in the data that was created by humans.

More information: Xiwen Jia et al. Anthropogenic biases in chemical reaction data hinder exploratory inorganic synthesis, Nature (2019). DOI: 10.1038/s41586-019-1540-5

Journal information: Nature

Citation: Chemists show how bias can crop up in machine learning algorithm results (2019, September 12) retrieved 19 April 2024 from https://phys.org/news/2019-09-chemists-bias-crop-machine-algorithm.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Algorithm able to accurately see differences between cancerous lung tumors

115 shares

Feedback to editors

Chemists show how bias can crop up in machine learning algorithm results

Baby white sharks prefer being closer to shore, scientists find

Key protein regulates immune response to viruses in mammal cells

Unraveling the mysteries of consecutive atmospheric river events

Research team resolves decades-long problem in microscopy

RNA's hidden potential: New study unveils its role in early life and future bioengineering

Smoother surfaces make for better accelerators

Scientists reveal hydroclimatic changes on multiple timescales in Central Asia over the past 7,800 years

Research reveals a surprising topological reversal in quantum systems

NASA's Juno gives aerial views of mountain and lava lake on Io

Toxic fireproof chemicals can be absorbed through touch, 3D-printed skin model shows

Relevant PhysicsForums posts

Can you eat the Periodic Table?

New Insight into the Chemistry of Solvents

Separation of KCl from potassium chromium(III) PDTA

Zirconium Versus Zirconium Carbide For Use With Galinstan

Electrolysis: Dark blue oxide from steel?

Identification of HOMO/LUMO in radicals

Algorithm able to accurately see differences between cancerous lung tumors

New algorithm limits bias in machine learning

How CERN machine-learning techniques could improve autonomous vehicles

A Hippocratic Oath for data science? We'll settle for a little more data literacy

An algorithm could play a major role in helping radiologists diagnose cancer early, accurately

Programming and prejudice: Computer scientists discover how to find bias in algorithms

Materials follow the 'Rule of Four,' but scientists don't know why yet

Researchers realize hydrogen formation by contact electrification of water microdroplets and its regulation

New plastic coating discovery gives greater functionality to 3D printing

Advanced nuclear magnetic resonance technique reveals precise structural, dynamical details in zeolites

From defects to order: Spontaneously emerging crystal arrangements in perovskite halides

Trash to treasure—Researchers turn metal waste into catalyst for hydrogen

Medical Xpress

Tech Xplore

Science X

Chemists show how bias can crop up in machine learning algorithm results

Baby white sharks prefer being closer to shore, scientists find

Key protein regulates immune response to viruses in mammal cells

Unraveling the mysteries of consecutive atmospheric river events

Research team resolves decades-long problem in microscopy

RNA's hidden potential: New study unveils its role in early life and future bioengineering

Smoother surfaces make for better accelerators

Scientists reveal hydroclimatic changes on multiple timescales in Central Asia over the past 7,800 years

Research reveals a surprising topological reversal in quantum systems

NASA's Juno gives aerial views of mountain and lava lake on Io

Toxic fireproof chemicals can be absorbed through touch, 3D-printed skin model shows

Relevant PhysicsForums posts

Related Stories

Algorithm able to accurately see differences between cancerous lung tumors

New algorithm limits bias in machine learning

How CERN machine-learning techniques could improve autonomous vehicles

A Hippocratic Oath for data science? We'll settle for a little more data literacy

An algorithm could play a major role in helping radiologists diagnose cancer early, accurately

Programming and prejudice: Computer scientists discover how to find bias in algorithms

Recommended for you

Materials follow the 'Rule of Four,' but scientists don't know why yet

Researchers realize hydrogen formation by contact electrification of water microdroplets and its regulation

New plastic coating discovery gives greater functionality to 3D printing

Advanced nuclear magnetic resonance technique reveals precise structural, dynamical details in zeolites

From defects to order: Spontaneously emerging crystal arrangements in perovskite halides

Trash to treasure—Researchers turn metal waste into catalyst for hydrogen

Newsletter sign up

Donate and enjoy an ad-free experience