SlideShare a Scribd company logo
1 of 82
Lee Stearns1, Ruofei Du1, Uran Oh1, Catherine Jou1, Leah Findlater2, David A. Ross3, Jon E. Froehlich1
University of Maryland: Computer Science1, Information Studies2, Atlanta VA R&D Center for Visual & Neurocognitive Rehabilitation3
Evaluating Haptic and Auditory Guidance to
Assist Blind People in Reading Printed Text
Using Finger-Mounted Cameras
TACCESS | ASSETS 2016
What if printed text could be accessed
through touch in the same way as braille?
*Video Credit: YouTube—Ginny Owens—How I See It (Reading Braille)
What if printed text could be accessed
through touch in the same way as braille?
What if printed text could be accessed
through touch in the same way as braille?
Reading printed materials is still an important but
challenging task for people with visual impairments
POPULAR READING DEVICES
POPULAR READING DEVICES
Scanner | OCR | Screen Reader
POPULAR READING DEVICES
Dedicated devices (e.g., video magnifiers)
POPULAR READING DEVICES
Smartphone apps (e.g., KNFB Reader iOS)
POPULAR READING DEVICES
Wearable Cameras (e.g., OrCam)
Scanner | OCR | Screen Reader Dedicated Devices (e.g., video magifiers)
Smartphone Apps (e.g., KNFB Reader iOS) Wearable Cameras (e.g., OrCam)
POPULAR READING DEVICES
Open Questions (Existing Devices)
1. How to assist with aiming the camera
to capture desired content?
Open Questions (Existing Devices)
1. How to assist with aiming the camera
to capture desired content?
2. How to handle complex documents
and convey layout information?
HANDSIGHT
A vision-augmented touch system
HANDSIGHT
A vision-augmented touch system
Tiny CMOS cameras,
HANDSIGHT
Tiny CMOS cameras,
haptic actuators mounted
on one or more fingers
A vision-augmented touch system
HANDSIGHT
Tiny CMOS cameras,
haptic actuators mounted
on one or more fingers
Smartwatch for power,
processing, speech
and audio output
A vision-augmented touch system
HANDSIGHT
Tiny CMOS cameras,
haptic actuators mounted
on one or more fingers
Smartwatch for power,
processing, speech
and audio output
A vision-augmented touch system * Originally proposed in Stearns et al. 2014
AUGMENTING THE USER’S FINGER
Survey: Digital Digits (Shikrot et al. 2015)
Camera & Optical Mouse Sensor
AUGMENTING THE USER’S FINGER
Magic Finger (Yang et al. 2012)
Camera and
Vibration Motor
AUGMENTING THE USER’S FINGER
FingerReader (Shilkrot et al. 2014, 2015)
Camera
Vibration Motors
AUGMENTING THE USER’S FINGER
HandSight (Stearns et al. 2014)
Processing+Power
Advantages of Finger-Based Reading
1. Does not require framing an overhead camera
Advantages of Finger-Based Reading
1. Does not require framing an overhead camera
2. Allows direct access to spatial information
Advantages of Finger-Based Reading
1. Does not require framing an overhead camera
2. Allows direct access to spatial information
3. Provides better control over pace and rereading
Advantages of Finger-Based Reading
1. Does not require framing an overhead camera
2. Allows direct access to spatial information
3. Provides better control over pace and rereading
Advantages of Finger-Based Reading
1. Does not require framing an overhead camera
2. Allows direct access to spatial information
3. Provides better control over pace and rereading
New Challenges
1. How to precisely trace a line of text?
2. How to support physical navigation?
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
2. Audio via built-in
or external speakers
1. Finger-mounted
vibration motors
1. Finger-mounted
vibration motors
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
Move up
1. Finger-mounted
vibration motors
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
Move down
1. Finger-mounted
vibration motors
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
2. Audio via built-in
or external speakers
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
Higher pitch: move up
2. Audio via built-in
or external speakers
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
Lower pitch: move down
2. Audio via built-in
or external speakers
COMPARING TWO TYPES OF
DIRECTIONAL FINGER GUIDANCE
Research Questions
1. To what extent are finger-based cameras a viable
accessibility solution for reading printed text?
2. What design choices can improve this viability?
Study Overview
Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants)
Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants)
Study Overview
Study I: initial iPad study (19 participants)
Study Overview
Study I: initial iPad study (19 participants)
Goals:
Compare audio/haptic
Explore & interpret spatial layouts
Assess reading and comprehension
Study Overview
Study I Method
Used an iPad to focus on user experience, gather finger trace data
Study I Method
Used an iPad to focus on user experience, gather finger trace data
19 participants
Median Age 48 (SD=12, Range=26-67)
Gender 11 Male, 8 Female
Vision Level 10 Totally Blind, 9 Light Sensitive
Study I Method
Used an iPad to focus on user experience, gather finger trace data
19 participants
Within-subjects, two guidance
conditions: audio and haptic
Haptic vibrations
Audio pitch
Study I Method
Used an iPad to focus on user experience, gather finger trace data
19 participants
Within-subjects, two guidance conditions: audio and haptic
Participants read two documents for each condition
plain magazine
Study I Method
Used an iPad to focus on user experience,
gather finger trace data
19 participants
Within-subject, two guidance
conditions: audio and haptic
Participants read two documents for each condition
Analysis: reading speed and accuracy,
comprehension, subjective feedback
audio
haptic
Exploration Mode Reading Mode
System Design: Exploration and Reading Modes
Continuous audio feedback to identify content beneath finger
Flute sound: text
Cello sound: picture
Silence: empty space
Same across both conditions
System Design: Exploration Mode
Flute sound: text
Silence: empty space
Cello sound: picture
Bimanual: right index finger to read, left to anchor start of line
Directional guidance: audio or haptic depending on condition
Used to stay on the line or find the start of the next line
Audio: pitch of continuous audio
Haptic: strength and position of vibration
Additional audio cues (same for both conditions)
Start/end of line or paragraph
Synthesized speech
System Design: Reading Mode
Above the line: downward guidance
(low pitch or lower vibration motor)
Below the line: upward guidance
(high pitch or upper vibration motor)
Start/end of line or paragraph
(short but distinctive audio cues)
Study I Findings
Haptic vs. Audio: Quantitative Performance
Study I Findings
Haptic vs. Audio: Quantitative Performance
Line tracing / magazine documents: Audio significantly more accurate (p = 0.018)
Study I Findings
Haptic vs. Audio: Quantitative Performance
Line tracing / magazine documents: Audio significantly more accurate (p = 0.018)
audio haptic
Example finger traces—Dashed red lines mark drift off of the line
Study I Findings
Haptic vs. Audio: Quantitative Performance
Line tracing / magazine documents: Audio significantly more accurate (p = 0.018)
Comprehension high, no significant differences between conditions
audio haptic
Example finger traces—Dashed red lines mark drift off of the line
Study I Findings
Haptic vs. Audio: Subjective Preference
Preferences split (11 haptic, 7 audio, 1 equal preference)
Study I Findings
Haptic vs. Audio: Subjective Preference
Preferences split (11 haptic, 7 audio, 1 equal preference)
Preferred Haptic
More intuitive
Easier to use
Faster
Less distracting
Study I Findings
Haptic vs. Audio: Subjective Preference
Preferences split (11 haptic, 7 audio, 1 equal preference)
Preferred Haptic Preferred Audio
More intuitive
Easier to use
Faster
Less distracting
Less confusing
More comfortable
No desensitization
Study I Findings
Haptic vs. Audio: Subjective Preference
Preferences split (11 haptic, 7 audio, 1 equal preference)
Preferred Haptic Preferred Audio
More intuitive
Easier to use
Faster
Less distracting
Less confusing
More comfortable
No desensitization
Reflects contradictory findings in Stearns et al. 2014, Shilkrot et al. 2014, 2015
Study I Findings
Overall Reading Experience
Pros
Low learning curve
Flexible
Direct control over speed
Study I Findings
Overall Reading Experience
Pros Cons
Low learning curve
Flexible
Direct control over speed
Hard to use for reading
High cognitive load may
affect comprehension
Study I Findings
Exploration Mode
Participants appreciated direct access to spatial information, and
nearly all able to locate images and count the number of columns.
Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants)
Study Overview
Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants)
Study Overview
Goals:
Evaluate HandSight prototype
Gather subjective feedback
Compare with KNFB Reader iOS
Study II: physical prototype study (4 participants)
Study Overview
Finger-mounted camera to read physical documents
Study II: HandSight Prototype System
Study II Method
HandSight:
Each participant used their preferred guidance from Study I
to explore and read two documents
Study II Method
KNFB Reader iOS:
Photograph and read 3 physical documents
Study II Findings
HandSight: Overall Experience
Average reading speed: 45 wpm (SD=19, Range=18-60)
Rated somewhat easy to use, but slow and required concentration
Study II Findings
HandSight: Overall Experience
Average reading speed: 45 wpm (SD=19, Range=18-60)
Rated somewhat easy to use, but slow and required concentration
Participant Quotes:
“I’m very pleased and excited about the system. I think it could make a
great difference in my life.” (P19)
“It seems like a lot of effort for reading text.” (P12)
Study II Findings
HandSight vs. KNFB Reader iOS
Participants unanimously preferred KNFB Reader iOS
HandSight KNFB Reader iOS
Study II Findings
HandSight vs. KNFB Reader iOS
Participants unanimously preferred KNFB Reader iOS
Faster, easier to concentrate on the content of the text
HandSight KNFB Reader iOS
Implications
Pros
Feasibility of a Finger-Based Reading Approach
Implications
Pros
Spatial layout information
Feasibility of a Finger-Based Reading Approach
Implications
Pros
Spatial layout information
Direct control over reading
Feasibility of a Finger-Based Reading Approach
Implications
Pros
Spatial layout information
Direct control over reading
Reduced camera framing issues
Feasibility of a Finger-Based Reading Approach
Implications
Pros
Spatial layout information
Direct control over reading
Reduced camera framing issues
Efficient text detection and recognition
Feasibility of a Finger-Based Reading Approach
Implications
Pros
Spatial layout information
Direct control over reading
Reduced camera framing issues
Efficient text detection and recognition
* We observed these in our studies
Feasibility of a Finger-Based Reading Approach
Implications
Feasibility of a Finger-Based Reading Approach
Pros Cons
Spatial layout information
Direct control over reading
Reduced camera framing issues
Efficient text detection and recognition
* We observed these in our studies
Slower, requires increased
concentration and physical dexterity
Implications
Feasibility of a Finger-Based Reading Approach
Pros Cons
Spatial layout information
Direct control over reading
Reduced camera framing issues
Efficient text detection and recognition
* We observed these in our studies
Slower, requires increased
concentration and physical dexterity
* Consistent with Shilkrot et al. 2014, 2015
Implications
Feasibility of a Finger-Based Reading Approach
Pros Cons
Spatial layout information
Direct control over reading
Reduced camera framing issues
Efficient text detection and recognition
* We observed these in our studies
Slower, requires increased
concentration and physical dexterity
* Consistent with Shilkrot et al. 2014, 2015
Importance of spatial layout
information is unclear
Implications
Feasibility of a Finger-Based Reading Approach
Pros Cons
Spatial layout information
Direct control over reading
Reduced camera framing issues
Efficient text detection and recognition
* We observed these in our studies
Slower, requires increased
concentration and physical dexterity
* Consistent with Shilkrot et al. 2014, 2015
Importance of spatial layout
information is unclear
* Has yet to be investigated in this context
Future Work
Study utility of spatial layout information in everyday use
(e.g., newspapers, menus, maps, graphs)
Future Work
Study utility of spatial layout information
Explore possibilities for camera placement
HANDSIGHT
a vision augmented touch system
Questions?
Contact: lstearns@umd.edu
Thank you to our participants and the Maryland State Library for the Blind and Physically Handicapped.
This research was funded by the Department of Defense.
Lee Stearns1, Ruofei Du1, Uran Oh1, Catherine Jou1, Leah Findlater2, David A. Ross3, Jon E. Froehlich1
University of Maryland: Computer Science1, Information Studies2, Atlanta VA R&D Center for Visual & Neurocognitive Rehabilitation3
Evaluating Haptic and Auditory Guidance to
Assist Blind People in Reading Printed Text
Using Finger-Mounted Cameras
TACCESS | ASSETS 2016

More Related Content

Recently uploaded

Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 

Recently uploaded (20)

Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Evaluating Haptic and Auditory Guidance to Assist Blind People in Reading Printed Text Using Finger-Mounted Cameras

  • 1. Lee Stearns1, Ruofei Du1, Uran Oh1, Catherine Jou1, Leah Findlater2, David A. Ross3, Jon E. Froehlich1 University of Maryland: Computer Science1, Information Studies2, Atlanta VA R&D Center for Visual & Neurocognitive Rehabilitation3 Evaluating Haptic and Auditory Guidance to Assist Blind People in Reading Printed Text Using Finger-Mounted Cameras TACCESS | ASSETS 2016
  • 2. What if printed text could be accessed through touch in the same way as braille? *Video Credit: YouTube—Ginny Owens—How I See It (Reading Braille)
  • 3. What if printed text could be accessed through touch in the same way as braille?
  • 4. What if printed text could be accessed through touch in the same way as braille? Reading printed materials is still an important but challenging task for people with visual impairments
  • 6. POPULAR READING DEVICES Scanner | OCR | Screen Reader
  • 7. POPULAR READING DEVICES Dedicated devices (e.g., video magnifiers)
  • 8. POPULAR READING DEVICES Smartphone apps (e.g., KNFB Reader iOS)
  • 9. POPULAR READING DEVICES Wearable Cameras (e.g., OrCam)
  • 10. Scanner | OCR | Screen Reader Dedicated Devices (e.g., video magifiers) Smartphone Apps (e.g., KNFB Reader iOS) Wearable Cameras (e.g., OrCam) POPULAR READING DEVICES
  • 11. Open Questions (Existing Devices) 1. How to assist with aiming the camera to capture desired content?
  • 12. Open Questions (Existing Devices) 1. How to assist with aiming the camera to capture desired content? 2. How to handle complex documents and convey layout information?
  • 14. HANDSIGHT A vision-augmented touch system Tiny CMOS cameras,
  • 15. HANDSIGHT Tiny CMOS cameras, haptic actuators mounted on one or more fingers A vision-augmented touch system
  • 16. HANDSIGHT Tiny CMOS cameras, haptic actuators mounted on one or more fingers Smartwatch for power, processing, speech and audio output A vision-augmented touch system
  • 17. HANDSIGHT Tiny CMOS cameras, haptic actuators mounted on one or more fingers Smartwatch for power, processing, speech and audio output A vision-augmented touch system * Originally proposed in Stearns et al. 2014
  • 18. AUGMENTING THE USER’S FINGER Survey: Digital Digits (Shikrot et al. 2015)
  • 19. Camera & Optical Mouse Sensor AUGMENTING THE USER’S FINGER Magic Finger (Yang et al. 2012)
  • 20. Camera and Vibration Motor AUGMENTING THE USER’S FINGER FingerReader (Shilkrot et al. 2014, 2015)
  • 21. Camera Vibration Motors AUGMENTING THE USER’S FINGER HandSight (Stearns et al. 2014) Processing+Power
  • 22. Advantages of Finger-Based Reading 1. Does not require framing an overhead camera
  • 23. Advantages of Finger-Based Reading 1. Does not require framing an overhead camera 2. Allows direct access to spatial information
  • 24. Advantages of Finger-Based Reading 1. Does not require framing an overhead camera 2. Allows direct access to spatial information 3. Provides better control over pace and rereading
  • 25. Advantages of Finger-Based Reading 1. Does not require framing an overhead camera 2. Allows direct access to spatial information 3. Provides better control over pace and rereading
  • 26. Advantages of Finger-Based Reading 1. Does not require framing an overhead camera 2. Allows direct access to spatial information 3. Provides better control over pace and rereading New Challenges 1. How to precisely trace a line of text? 2. How to support physical navigation?
  • 27. COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE 2. Audio via built-in or external speakers 1. Finger-mounted vibration motors
  • 28. 1. Finger-mounted vibration motors COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE
  • 29. Move up 1. Finger-mounted vibration motors COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE
  • 30. Move down 1. Finger-mounted vibration motors COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE
  • 31. 2. Audio via built-in or external speakers COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE
  • 32. Higher pitch: move up 2. Audio via built-in or external speakers COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE
  • 33. Lower pitch: move down 2. Audio via built-in or external speakers COMPARING TWO TYPES OF DIRECTIONAL FINGER GUIDANCE
  • 34. Research Questions 1. To what extent are finger-based cameras a viable accessibility solution for reading printed text? 2. What design choices can improve this viability?
  • 35. Study Overview Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants)
  • 36. Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants) Study Overview
  • 37. Study I: initial iPad study (19 participants) Study Overview Study I: initial iPad study (19 participants) Goals: Compare audio/haptic Explore & interpret spatial layouts Assess reading and comprehension Study Overview
  • 38. Study I Method Used an iPad to focus on user experience, gather finger trace data
  • 39. Study I Method Used an iPad to focus on user experience, gather finger trace data 19 participants Median Age 48 (SD=12, Range=26-67) Gender 11 Male, 8 Female Vision Level 10 Totally Blind, 9 Light Sensitive
  • 40. Study I Method Used an iPad to focus on user experience, gather finger trace data 19 participants Within-subjects, two guidance conditions: audio and haptic Haptic vibrations Audio pitch
  • 41. Study I Method Used an iPad to focus on user experience, gather finger trace data 19 participants Within-subjects, two guidance conditions: audio and haptic Participants read two documents for each condition plain magazine
  • 42. Study I Method Used an iPad to focus on user experience, gather finger trace data 19 participants Within-subject, two guidance conditions: audio and haptic Participants read two documents for each condition Analysis: reading speed and accuracy, comprehension, subjective feedback audio haptic
  • 43. Exploration Mode Reading Mode System Design: Exploration and Reading Modes
  • 44. Continuous audio feedback to identify content beneath finger Flute sound: text Cello sound: picture Silence: empty space Same across both conditions System Design: Exploration Mode
  • 45. Flute sound: text Silence: empty space Cello sound: picture
  • 46. Bimanual: right index finger to read, left to anchor start of line Directional guidance: audio or haptic depending on condition Used to stay on the line or find the start of the next line Audio: pitch of continuous audio Haptic: strength and position of vibration Additional audio cues (same for both conditions) Start/end of line or paragraph Synthesized speech System Design: Reading Mode
  • 47. Above the line: downward guidance (low pitch or lower vibration motor) Below the line: upward guidance (high pitch or upper vibration motor) Start/end of line or paragraph (short but distinctive audio cues)
  • 48. Study I Findings Haptic vs. Audio: Quantitative Performance
  • 49. Study I Findings Haptic vs. Audio: Quantitative Performance Line tracing / magazine documents: Audio significantly more accurate (p = 0.018)
  • 50. Study I Findings Haptic vs. Audio: Quantitative Performance Line tracing / magazine documents: Audio significantly more accurate (p = 0.018) audio haptic Example finger traces—Dashed red lines mark drift off of the line
  • 51. Study I Findings Haptic vs. Audio: Quantitative Performance Line tracing / magazine documents: Audio significantly more accurate (p = 0.018) Comprehension high, no significant differences between conditions audio haptic Example finger traces—Dashed red lines mark drift off of the line
  • 52. Study I Findings Haptic vs. Audio: Subjective Preference Preferences split (11 haptic, 7 audio, 1 equal preference)
  • 53. Study I Findings Haptic vs. Audio: Subjective Preference Preferences split (11 haptic, 7 audio, 1 equal preference) Preferred Haptic More intuitive Easier to use Faster Less distracting
  • 54. Study I Findings Haptic vs. Audio: Subjective Preference Preferences split (11 haptic, 7 audio, 1 equal preference) Preferred Haptic Preferred Audio More intuitive Easier to use Faster Less distracting Less confusing More comfortable No desensitization
  • 55. Study I Findings Haptic vs. Audio: Subjective Preference Preferences split (11 haptic, 7 audio, 1 equal preference) Preferred Haptic Preferred Audio More intuitive Easier to use Faster Less distracting Less confusing More comfortable No desensitization Reflects contradictory findings in Stearns et al. 2014, Shilkrot et al. 2014, 2015
  • 56. Study I Findings Overall Reading Experience Pros Low learning curve Flexible Direct control over speed
  • 57. Study I Findings Overall Reading Experience Pros Cons Low learning curve Flexible Direct control over speed Hard to use for reading High cognitive load may affect comprehension
  • 58. Study I Findings Exploration Mode Participants appreciated direct access to spatial information, and nearly all able to locate images and count the number of columns.
  • 59. Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants) Study Overview
  • 60. Study I: initial iPad study (19 participants) Study II: physical prototype study (4 participants) Study Overview
  • 61. Goals: Evaluate HandSight prototype Gather subjective feedback Compare with KNFB Reader iOS Study II: physical prototype study (4 participants) Study Overview
  • 62. Finger-mounted camera to read physical documents Study II: HandSight Prototype System
  • 63. Study II Method HandSight: Each participant used their preferred guidance from Study I to explore and read two documents
  • 64. Study II Method KNFB Reader iOS: Photograph and read 3 physical documents
  • 65. Study II Findings HandSight: Overall Experience Average reading speed: 45 wpm (SD=19, Range=18-60) Rated somewhat easy to use, but slow and required concentration
  • 66. Study II Findings HandSight: Overall Experience Average reading speed: 45 wpm (SD=19, Range=18-60) Rated somewhat easy to use, but slow and required concentration Participant Quotes: “I’m very pleased and excited about the system. I think it could make a great difference in my life.” (P19) “It seems like a lot of effort for reading text.” (P12)
  • 67. Study II Findings HandSight vs. KNFB Reader iOS Participants unanimously preferred KNFB Reader iOS HandSight KNFB Reader iOS
  • 68. Study II Findings HandSight vs. KNFB Reader iOS Participants unanimously preferred KNFB Reader iOS Faster, easier to concentrate on the content of the text HandSight KNFB Reader iOS
  • 69. Implications Pros Feasibility of a Finger-Based Reading Approach
  • 70. Implications Pros Spatial layout information Feasibility of a Finger-Based Reading Approach
  • 71. Implications Pros Spatial layout information Direct control over reading Feasibility of a Finger-Based Reading Approach
  • 72. Implications Pros Spatial layout information Direct control over reading Reduced camera framing issues Feasibility of a Finger-Based Reading Approach
  • 73. Implications Pros Spatial layout information Direct control over reading Reduced camera framing issues Efficient text detection and recognition Feasibility of a Finger-Based Reading Approach
  • 74. Implications Pros Spatial layout information Direct control over reading Reduced camera framing issues Efficient text detection and recognition * We observed these in our studies Feasibility of a Finger-Based Reading Approach
  • 75. Implications Feasibility of a Finger-Based Reading Approach Pros Cons Spatial layout information Direct control over reading Reduced camera framing issues Efficient text detection and recognition * We observed these in our studies Slower, requires increased concentration and physical dexterity
  • 76. Implications Feasibility of a Finger-Based Reading Approach Pros Cons Spatial layout information Direct control over reading Reduced camera framing issues Efficient text detection and recognition * We observed these in our studies Slower, requires increased concentration and physical dexterity * Consistent with Shilkrot et al. 2014, 2015
  • 77. Implications Feasibility of a Finger-Based Reading Approach Pros Cons Spatial layout information Direct control over reading Reduced camera framing issues Efficient text detection and recognition * We observed these in our studies Slower, requires increased concentration and physical dexterity * Consistent with Shilkrot et al. 2014, 2015 Importance of spatial layout information is unclear
  • 78. Implications Feasibility of a Finger-Based Reading Approach Pros Cons Spatial layout information Direct control over reading Reduced camera framing issues Efficient text detection and recognition * We observed these in our studies Slower, requires increased concentration and physical dexterity * Consistent with Shilkrot et al. 2014, 2015 Importance of spatial layout information is unclear * Has yet to be investigated in this context
  • 79. Future Work Study utility of spatial layout information in everyday use (e.g., newspapers, menus, maps, graphs)
  • 80. Future Work Study utility of spatial layout information Explore possibilities for camera placement
  • 82. Questions? Contact: lstearns@umd.edu Thank you to our participants and the Maryland State Library for the Blind and Physically Handicapped. This research was funded by the Department of Defense. Lee Stearns1, Ruofei Du1, Uran Oh1, Catherine Jou1, Leah Findlater2, David A. Ross3, Jon E. Froehlich1 University of Maryland: Computer Science1, Information Studies2, Atlanta VA R&D Center for Visual & Neurocognitive Rehabilitation3 Evaluating Haptic and Auditory Guidance to Assist Blind People in Reading Printed Text Using Finger-Mounted Cameras TACCESS | ASSETS 2016

Editor's Notes

  1. Hello everyone! My name is Lee Stearns and I’m from the University of Maryland. Today I’m going to be presenting my group’s research, titled “Evaluating Haptic and Auditory Guidance to Assist Blind People in Reading Printed Text Using Finger-Mounted Cameras”.
  2. We began this project with a question: what if printed text could be accessed through touch in the same way as braille? Touch-based reading is a highly tactile experience that enables easy exploration of spatial layouts and direct control over reading speed or order.
  3. But relatively little material is available in Braille, and even with the increasing availability of digital information and screen reader software, reading printed text materials is still an important but challenging task for people with visual impairments.
  4. But relatively little material is available in Braille, and even with the increasing availability of digital information and screen reader software, reading printed text materials is still an important but challenging task for people with visual impairments.
  5. We looked at several existing devices that offer reading solutions,
  6. from stationary scanners and screen reader software [next slide] to dedicated devices such as desktop video magnifiers, [next slide] to mobile phone applications like KNFB Reader [next slide] and more recently to wearable cameras such as OrCam.
  7. from stationary scanners and screen reader software [next slide] to dedicated devices such as desktop video magnifiers, [next slide] to mobile phone applications like KNFB Reader [next slide] and more recently to wearable cameras such as OrCam.
  8. from stationary scanners and screen reader software [next slide] to dedicated devices such as desktop video magnifiers, [next slide] to mobile phone applications like KNFB Reader [next slide] and more recently to wearable cameras such as OrCam.
  9. from stationary scanners and screen reader software [next slide] to dedicated devices such as desktop video magnifiers, [next slide] to mobile phone applications like KNFB Reader [next slide] and more recently to wearable cameras such as OrCam.
  10. These devices have gone a long way toward improving the accessibility of printed materials, but several open questions still remain.
  11. One important question for handheld or wearable devices is how to help with aiming the camera to capture the target content. For example, the KNFB Reader app and OrCam both provide spoken field of view reports to assist with aiming, but the process can still be challenging for blind readers.
  12. Another question is how to interpret and communicate documents with complex layouts such as newspapers or menus. Even with digital content it can be difficult to decide the order to read different blocks of text, and what layout details to convey.
  13. As an alternative to these solutions, we envision a system called HandSight that’s made up of tiny cameras [next slide] and haptics [next slide] integrated directly with the user’s fingers. It would be connected to a smartwatch [next slide] for power, processing, and speech or audio output. [next slide] We first proposed this idea as a system that augments the sense of touch for several different applications, but our focus so far has primarily been on reading printed text.
  14. As an alternative to these solutions, we envision a system called HandSight that’s made up of tiny cameras [next slide] and haptics [next slide] integrated directly with the user’s fingers. It would be connected to a smartwatch [next slide] for power, processing, and speech or audio output. [next slide] We first proposed this idea as a system that augments the sense of touch for several different applications, but our focus so far has primarily been on reading printed text.
  15. As an alternative to these solutions, we envision a system called HandSight that’s made up of tiny cameras [next slide] and haptics [next slide] integrated directly with the user’s fingers. It would be connected to a smartwatch [next slide] for power, processing, and speech or audio output. [next slide] We first proposed this idea as a system that augments the sense of touch for several different applications, but our focus so far has primarily been on reading printed text.
  16. As an alternative to these solutions, we envision a system called HandSight that’s made up of tiny cameras [next slide] and haptics [next slide] integrated directly with the user’s fingers. It would be connected to a smartwatch [next slide] for power, processing, and speech or audio output. [next slide] We first proposed this idea as a system that augments the sense of touch for several different applications, but our focus so far has primarily been on reading printed text.
  17. As an alternative to these solutions, we envision a system called HandSight that’s made up of tiny cameras [next slide] and haptics [next slide] integrated directly with the user’s fingers. It would be connected to a smartwatch [next slide] for power, processing, and speech or audio output. [next slide] We first proposed this idea as a system that augments the sense of touch for several different applications, but our focus so far has primarily been on reading printed text.
  18. The idea of augmenting the user’s finger has become more and more common in recent years. There’s a survey paper called Digital Digits that gives a good overview, but today I’m only going to mention two projects that are particularly relevant to our research.
  19. The first is Magic Finger, which combined a small camera with an optical mouse sensor at the tip of the user’s finger to recognize gestures on any surface.
  20. And more specifically related to our goal of reading printed materials is FingerReader, which explored using a ring-based camera with haptic and audio feedback. It allowed blind readers to trace their finger over printed text and hear speech output in real-time.
  21. Our own previous research explored a similar solution. We used a ring design with the same tiny camera as Magic Finger, along with a pair of vibration motors to provide haptic feedback.
  22. Reading by touch has several potential advantages. It enables direct access—you place your finger on the page, and you can immediately start getting feedback and exploring the document. You don’t need to worry about lining up the document with the camera or waiting for it to be scanned and processed.
  23. It may also provide a more intuitive way for users to explore a document’s layout, which we hope could improve comprehension and document understanding, especially for more complex documents.
  24. And since the system reads words only when the finger moves across them, users have better control over pace and rereading.
  25. But a finger-based approach also introduces some new challenges that haven’t been fully investigated.
  26. But a finger-based approach also introduces some new challenges that haven’t been fully investigated. Because the field of view from a finger-mounted camera is limited, the reader has to precisely trace their finger along the current line of text so that the image doesn’t get cut off or distorted. We’ll also need to assist users in physical navigation through the document, such as finding the start of a passage and moving from one line to the next. To overcome these two challenges we’ll need to be able to effectively guide the user’s finger.
  27. We compared two types of guidance to see which would better enable blind readers to trace lines of text and read a document: haptic or audio.
  28. First was the haptic directional guidance, which used a pair of vibration motors mounted on the top and bottom of the user’s finger. These conveyed direction and distance by the position and intensity of the vibration.
  29. The upper motor directed participants to move upwards,
  30. and the lower motor directed them downwards.
  31. Second was audio directional guidance, which used a continuous tone with a variable pitch to convey direction and distance.
  32. In this case a higher pitch directed participants upward
  33. and a lower pitch directed them downward.
  34. Our research was the first to take a detailed look at the directional guidance and general usability of reading with a finger-mounted camera quantitatively as well as subjectively. The main questions driving our research were to learn to what extent finger-based cameras are a viable accessibility solution for reading printed text? And also what design choices could improve that viability?
  35. We conducted two studies to further investigate this idea. The first was a controlled lab experiment with 19 participants, which looked at the user experience of reading through touch and specifically compared the two types of directional finger guidance. The second was a smaller follow-up study with 4 participants, which collected preliminary feedback about our wearable prototype.
  36. The primary goal of the first study was to compare audio and haptic directional finger guidance in terms of user performance and preference. But we also explored the extent that our approach would allow a blind reader to interpret the spatial layout of a document and to read and understand that document.
  37. The primary goal of the first study was to compare audio and haptic directional finger guidance in terms of user performance and preference. But we also explored the extent that our approach would allow a blind reader to interpret the spatial layout of a document and to read and understand that document.
  38. We used an iPad covered with a thin sheet of paper to simulate the experience of reading a physical document. That let us bypass some of the technical challenges and instead focus just on the user experience. It also allowed us to collect precise finger trace data to perform detailed analysis of finger movements.
  39. We recruited 19 participants, all of whom were totally blind or with only minimal light perception.
  40. We used a within-subjects experiment design with 2 directional guidance conditions: audio and haptic. The conditions were identical aside from the type of directional guidance that was provided while the participant was tracing a line of text or searching for the start of the next line.
  41. We asked each participant to read 2 documents for each condition, a plain text document and a magazine-style document that had two columns, a title, and an image.
  42. We collected statistics on the participants’ reading speed and accuracy, their comprehension of each document, and subjective feedback about the ease of use. We used all of that information to compare the two guidance conditions and to evaluate the overall usability of our interface.
  43. Our system had 2 modes: exploration mode and reading mode. For each document, participants first explored to learn its layout and find the first line of text, and then read sequentially through the selected passage.
  44. In the exploration mode, the participant could move around the page however they liked, and the system provided continuous audio feedback to identify the content that was underneath their finger. We chose distinctive high and low pitched sounds to represent text or pictures, and silence to represent empty space. The exploration mode was the same across both conditions. I’m now going to play a video demo of that interface.
  45. [play video]
  46. In the reading mode, participants moved sequentially along each line and the system read each word out loud as their finger passed over it. They could use their right index finger to read and their left as an anchor to help them find the start of the next line. We provided vertical directional guidance to help stay on a line or to locate the start of a new line. For the audio condition, continuous audio cues played when the participant’s finger drifted off of the line, and the pitch changed depending the direction and distance to the line. The haptic condition was very similar, but it used the location and intensity of the vibrations in place of the audio pitch. We also played distinctive audio cues to identify the start and end of each line or paragraph, as well as the speech feedback. Here’s a demo of the reading interface.
  47. [play video]
  48. Our findings highlighted tradeoffs between the two guidance conditions.
  49. The speed and accuracy of the two types of guidance were very similar for both line finding and line tracing, but audio may offer some advantages to accuracy since it was significantly better than haptics while reading the magazine documents.
  50. These images show some example finger traces from a single participant. The solid green lines show when the participant was accurately following the line, and dashed red lines show where they were off the line and receiving directional guidance. Audio guidance tended to prompt a more immediate correction than haptic.
  51. Comprehension was high across all of the documents, with no significant differences between the two conditions.
  52. The participants’ preferences were split, although a majority preferred haptic guidance.
  53. The participants who preferred haptic felt that it was more intuitive, easier to use, or faster than audio. Some also found the audio guidance more distracting, and said that it made it harder to focus on the speech feedback.
  54. On the other hand, almost all of those who preferred audio guidance found it to be less confusing than haptic. Several participants had trouble remembering which direction was indicated by which motor. A few felt that the mapping we used was the reverse of how it should be, and it’s possible that given the option to customize it more would have preferred the haptic guidance. Two participants were also concerned with the comfort level of the haptic version and the potential for desensitization over time.
  55. This split in user preferences reflects the contradictory findings in reported previous studies.
  56. In general, several participants liked the lower learning curve and flexibility of our system compared to other reading approaches, and especially the ability to directly control the reading speed.
  57. However, others raised concerns about ease of use and cognitive load, which could affect comprehension and retention.
  58. Finally, a few participants mentioned that they liked the knowledge about the document’s layout that the exploration mode provided, and nearly all were able to use it to locate images and count the number of columns in a document.
  59. We conducted a follow-up study with 4 participants from Study I who returned to test our proof-of-concept wearable prototype
  60. The first study was primarily focused on the user experience, and the iPad let us performed detailed analysis of the participants finger movements. For this second study, we implemented the interfaces from our iPad version as a fully functioning physical prototype, which we called HandSight. We asked participants to try the system and provide subjective feedback, and also compared it against the KNFB Reader iPhone app, which was the current state of the art for reading printed documents.
  61. The first study was primarily focused on the user experience, and the iPad let us performed detailed analysis of the participants finger movements. For this second study, we implemented the interfaces from our iPad version as a fully functioning physical prototype, which we called HandSight. We asked participants to try the system and provide subjective feedback, and also compared it against the KNFB Reader iPhone app, which was the current state of the art for reading printed documents.
  62. Unfortunately I don’t have time to talk much about our prototype system, but all of the details are available in our paper. We used the same haptic and audio cues as in the first study and the process for exploring and reading a document was very similar from the participants’ perspective.
  63. Participants each used their preferred directional guidance type from the first study to explore and read two documents using our prototype system. The documents had a single column and were approximately the same length and reading level.
  64. Afterward, participants used the KNFB Reader iPhone app to photograph and read 3 more documents. Two of these were similar to those read with our prototype, but the third had two columns since KNFB Reader supports multi-column documents.
  65. All four participants completed the reading tasks with our prototype, but their performance and subjective feedback was mixed. The average reading speed was 45 words per minute, but the speed varied quite a bit. The participants all expressed concern over the level of concentration that was required to interpret the directional guidance and other audio cues while listening to the speech feedback.
  66. Two quotes in particular highlight the difference in opinions. One participant was enthusiastic about the concept, saying “I’m very pleased and excited about the system. I think it could make a great difference in my life.” Another was more critical, finding the approach to be slower than expected: “It seems like a lot of effort for reading text.”
  67. I want to stress that our experiment with the KNFB Reader app was not a controlled comparison. But even without that and without using all of the app’s features, participants unanimously preferred KNFB Reader for the types of documents we asked them read.
  68. The main reason for that was how fluid the reading experience was after the document had been captured, since the app read the document quickly and participants were able to concentrate fully on the content.
  69. This ties into our original questions about the feasibility of a touch-based approach for reading printed text. We expected that it would have many advantages. [next slide] These included information about the spatial layout, [next slide] direct access to the text on the page and control over the reading speed, [next slide] easier camera framing, [next slide] and the potential for more efficient text detection and recognition.
  70. This ties into our original questions about the feasibility of a touch-based approach for reading printed text. We expected that it would have many advantages. [next slide] These included information about the spatial layout, [next slide] direct access to the text on the page and control over the reading speed, [next slide] easier camera framing, [next slide] and the potential for more efficient text detection and recognition.
  71. This ties into our original questions about the feasibility of a touch-based approach for reading printed text. We expected that it would have many advantages. [next slide] These included information about the spatial layout, [next slide] direct access to the text on the page and control over the reading speed, [next slide] easier camera framing, [next slide] and the potential for more efficient text detection and recognition.
  72. This ties into our original questions about the feasibility of a touch-based approach for reading printed text. We expected that it would have many advantages. [next slide] These included information about the spatial layout, [next slide] direct access to the text on the page and control over the reading speed, [next slide] easier camera framing, [next slide] and the potential for more efficient text detection and recognition.
  73. This ties into our original questions about the feasibility of a touch-based approach for reading printed text. We expected that it would have many advantages. [next slide] These included information about the spatial layout, [next slide] direct access to the text on the page and control over the reading speed, [next slide] easier camera framing, [next slide] and the potential for more efficient text detection and recognition.
  74. We did observe these advantages in our experiments, but some important concerns also came up.
  75. Our studies showed that reading text line-by-line can be slow and require high mental and physical effort. The reader has to pay attention to multiple haptic or audio cues along with the speech feedback.
  76. This finding is consistent with previous work in this area, and is one of the biggest limitations of our approach. Practice and improvements to the guidance interface might help, but we’d need to do a longer study over multiple sessions to see how much.
  77. We also don’t know how important spatial layout information will be in general. For relatively simple documents like the ones we used in our studies, it might not have much benefit. But for documents with important spatial information, such as maps and graphs, an approach like ours could become much more useful.
  78. To our knowledge, no one has investigated this question as of yet, at least not in the context of touch-based access to physical documents.
  79. Toward that end, we plan to run a more in-depth study in the near future with these types of documents to see how useful spatial layout information might be in everyday use.
  80. We’ve also considered the idea that a hybrid system might be a good compromise. It could use a body-mounted camera to determine context and read back longer passages, but then use the finger-mounted camera for immediate access and exploration.
  81. Finally, reading is only one of the applications we hope to support, and our goal is to produce a wearable device that augments the sense of touch and helps with a variety of activities of daily living.
  82. That’s it for my presentation. Thank you to everyone on my team, and to all of you attending today.
  83. Previous research has used directional guidance methods similar to these, but in each case the focus was on feasibility. The user studies had small sample sizes of 3 to 4 participants, and didn’t report on quantitative performance metrics. That prevents us from fully understanding the effectiveness of finger guidance, as well as the reading performance and user reactions. Participants in the previous studies also had contradictory preferences when comparing the different types of directional guidance.