February 12, 2018

Browser extension helps the visually impaired interpret online images

Imagine internet browsing without the ability to make sense of images. It's a problem that visually impaired computer users face every day. While screen reading technology gives users audible access to written content, it needs written descriptions to interpret images, and often there isn't any.

Some website developers include descriptions of images in the code (called "alt text") because it improves their websites' search engine rankings. However, there's no mechanism for determining whether these descriptions are accurate or informative. As a result, developers often enter one-word descriptions such as "image" or "photo," leaving the visually impaired with no useful information about the image.

To help address this problem, an ATLAS Institute researcher developed a system that collects captions and alt text associated with other instances of the same photo elsewhere online, associating human-authored descriptions with every website where it appears. Called Caption Crawler, the image captioning system compiles descriptions in a database: if a photo has never been queried, it will offer alt text in about 20 seconds; if the photo has previously been processed, alt text is available almost immediately.

The technology was developed by Darren Guinness, a Ph.D. student in the ATLAS Interactive Robotics and Novel Technologies (IRON) Lab and the Superhuman Computing Lab, working in conjunction with Microsoft Research's Edward Cutrell and Meredith Ringel Morris. The research, which merges the benefits of a fully automated system with the quality of human-authored content, will be presented at the Association for Computing Machinery's (ACM) 2018 Conference on Human Factors in Computing Systems (CHI) in Montreal in April.

Users who want Caption Crawler to replace poor-quality alt text, press a keyboard shortcut to request a replacement. The screen reader automatically speaks the new caption, which is the longest caption found for a particular photo. Users can also use a different shortcut to access any additional found captions.

Caption Crawler only works with images used on multiple websites, but the approach is effective because about half of website administrators provide informative photo descriptions, Guinness says.

"Although this approach cannot caption unique images that only appear in a single place online, it can increase the accessibility of many online images," he says. "Caption Crawler is a low latency, incredibly low-cost solution to a big problem. It produces human-quality captioning without incurring additional costs in human labeling time."

Caption Crawler combines a Google Chrome Browser Extension with a Node.js cloud server. The browser extension searches the Document Object Model (DOM) of the active webpage for image tags and background images, which are then sent to the server for caption retrieval. When Caption Crawler finds a caption for an image, the caption is streamed back to the browser extension, which then associates the caption to the image.

Research shows humans produce higher quality captions than automated computer and machine-learning based approaches, Guinness says. Caption Crawler uses a hybrid system that captures both, prioritizing human captioning over machine learning and computer vision-based approaches. If no human-authored captions can be found, computer-generated captions from Microsoft's CaptionBot are used to describe the image. When the text from CaptionBot is read aloud, the screen reader first speaks the words "CaptionBot," so that the user is aware that the caption is not human-authored.

"Hybrid systems that meld both human-quality text and machine learning approaches hold a lot of promise for improving access to online media," Guinness says.

More information: Caption Crawler: Enabling Reusable Alternative Text Descriptions using Reverse Image Search: www-cs.stanford.edu/~merrie/pa … s/captioncrawler.pdf

Provided by University of Colorado at Boulder

Citation: Browser extension helps the visually impaired interpret online images (2018, February 12) retrieved 25 April 2024 from https://techxplore.com/news/2018-02-browser-extension-visually-impaired-online.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Microsoft researchers build a bot that draws what you tell it to

28 shares

Feedback to editors

How much energy can offshore wind farms in the U.S. produce? New study sheds light

2 hours ago

Engineers uncover key to efficient and stable organic solar cells

7 hours ago

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

8 hours ago

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

8 hours ago

Researchers increase storage, efficiency and durability of capacitors

8 hours ago

Study explores why human-inspired machines can be perceived as eerie

10 hours ago

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Apr 24, 2024

Study shows potential of super grids when hurricanes overshadow solar panels

Apr 24, 2024

Rubber-like stretchable energy storage device fabricated with laser precision

Apr 24, 2024

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Apr 24, 2024

Load comments (0)

Browser extension helps the visually impaired interpret online images

How much energy can offshore wind farms in the U.S. produce? New study sheds light

Engineers uncover key to efficient and stable organic solar cells

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

Researchers increase storage, efficiency and durability of capacitors

Study explores why human-inspired machines can be perceived as eerie

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Microsoft researchers build a bot that draws what you tell it to

A program that captions your photos

Microsoft Research project can interpret, caption photos

YouTube extends automatic video captioning

Paying attention to words not just images leads to better image captions

Google adds automatic captions to YouTube

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Holographic displays offer a glimpse into an immersive future

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Phys.org

Medical Xpress

Science X

Browser extension helps the visually impaired interpret online images

How much energy can offshore wind farms in the U.S. produce? New study sheds light

Engineers uncover key to efficient and stable organic solar cells

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

Researchers increase storage, efficiency and durability of capacitors

Study explores why human-inspired machines can be perceived as eerie

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Related Stories

Microsoft researchers build a bot that draws what you tell it to

A program that captions your photos

Microsoft Research project can interpret, caption photos

YouTube extends automatic video captioning

Paying attention to words not just images leads to better image captions

Google adds automatic captions to YouTube

Recommended for you

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Holographic displays offer a glimpse into an immersive future

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Your Privacy