SlideShare a Scribd company logo
1 of 27
Researcher Dilemmas
using Behavioral Big Data in Healthcare
INFORMS Workshop on Data Mining & Decision Analysis
Houston, TX, Oct 21, 2017
Galit Shmueli 徐茉莉
Institute of Service Science
What is Behavioral Big Data (BBD)
Special type of Big Data
• Behavioral: people’s measurable
“everyday” behavior, interactions, self-
reported opinions, thoughts, feelings
• Human and social aspects:
Intentions, deception, emotion,
reciprocation, herding,…
When aware of data collection -> modified behavior (legal risks, embarrassment,
unwanted solicitation)
BBD vs. Inanimate Big Data
Behavioral
Big Data
Researcher
Human
Subjects
Research
Question
Inanimate
Big Data
Researcher
Research
Question
1. Aware, ongoing interaction
with the BBD - “contaminate”
BBD with intention,
deception, emotion, herding…
2. Can be harmed by BBD
Figure 1: The types of physiological
data points and the wearable
sensors under development or on
the market to monitor them.
Elenko, Underwood & Zohar (2015),
“Defining Digital Medicine”,
Nature Biotechnology 33, 456-461
Physiological
Big Data
Human
Subjects
BBD vs.
Physiological
Big Data
• Physical/bio
measurements
• Data collection
timing often set
by medical system
• Clinical trials:
awareness &
vested interest
• People’s daily actions,
interactions, self-reported
feelings, opinions,
thoughts (UGC)
• Data generation timing
often chosen by user
• Experiments: users often
unaware; goal not always
in user’s interest
Different research methods in life sciences and behavioral sciences
• Measurement instruments
• Models (latent variable models, social network analysis)
• Human subjects risks
“Behavioral Health” Data vs. BBD
• Behaviors: substance abuse & mental health
• Population: patients with mental illness /
substance abuse
• Specific (defined) behavior of patient
• “Big”?
www.carolinashealthcare.org/medical-services/prevention-wellness/behavioral-health
BBD in
Healthcare
Research
Hospital Data on Patients, Staff, Assets
Patients
Personal info
Medical history (visits, tests,
medication, hospitalization...)
Scheduled events, billing
Physicians
Scheduled + actual appointments,
procedures, prescriptions,…
Entries of patient info/data
Nurses
Location, work hours,…
Pharmacy staff
Speed of service
Quality of service
Lab staff
Speed of service
Quality of service
Other staff
Finance/accounting
Cleaning
Receptionists
Volunteers
Food court…
Data Collection
Technologies:
• Medical devices
• HIT systems
(EHR, HR for
Health Info
System)
---
Smart Hospital
• Cameras
• Sensors
• GPS
• IoT
Typical Research Fields using Hospital Data
Operations Researchers and Industrial Engineers
For: Hospital Management and Operations
(staffing, scheduling,…)
Medical/Healthcare Researchers & Clinicians
For: Improved Medical Treatment
(safety, effectiveness,…)
Information Systems Researchers
For: Improved Design & Use of Medical IS
(value of IS, effectiveness, standardization,…)
Hospital
Telemedicine
/ Telehealth
Remote
Patient
Monitoring
mHealth/ eHealth
Health-”unrelated” behavior
Health-related behaviorNew Medical BBD Directions
Behavioral big data also on…
Interactions between
Patients – doctors/nurses
Doctors – other doctors
Patients – other patients
Patient family – hospital staff
Patients – social network ”friends”
...
Health-related BBD: Online
• Medical/health websites
• Online forums
• Social networks
• Search engines
Info voluntarily entered by users: personal details, photos, comments,
messages, search terms, likes, payment information, connections with “friends”
Passive footprints: duration on the website, pages browsed, sequence,
referring website, Internet browser, operating system, location, IP address
Quantified “Self”
“Some hospitals are collecting new information
from patients directly, while others have sought
data from companies that sell consumer and
financial information, or federal agencies that
provide statistics on poverty, housing density
and unemployment.”
The big obstacle: access to the data. Doctors and nurses have limited time to collect new data
and patients bombarded with questions about their lives may suffer “interview fatigue”
Health-unrelated
BBD
Research Using New Medical BBD: Challenges
Behavioral
Big Data
Researcher
Human
Subjects
Research
Question
Scientific vs.
Clinical vs.
Commercial
Explain
vs.
Predict
Different (conflicting) Goals:
Unit of analysis vs.
Unit of measurement
Under/over-
coverage
New risks (privacy, liability,
security, HIPAA compliance)
New ethical challenges:
Generalization Challenges:
Acquire + analyze data
Users (self-selection,
spill-over, knowledge of
allocation, network)
Company algorithms
Average effect vs. individual effect
Data contaminated by:New modes of connection &
information (social networks,
forums, IoT)
ATE vs.
Individual
Technical expertise
Sample Behavioral
Healthcare-Related BBD Studies
Vocal Minority and Silent Majority:
How Do Online Ratings Reflect
Population Perception of Quality?
Gao et al. (MISQ 2015)
Outcomes matter: estimating
pre-transplant survival rates
of kidney-transplant patients
using simulator-based
propensity scores
Yahav & Shmueli (Annals of
Oper. Research, 2014)
Emotional Contagion in
Social Networks
Kramer et al. (PNAS, 2014)
Detecting influenza
epidemics using search
engine query data
Ginsberg et al. (Nature, 2009)
Emotional Contagion in Social Networks
Kramer et al. (2014) Proceedings of the National Academies of Sciences
• Can emotional states be transferred to others via emotional contagion?
• BBD from large-scale experiment run by FB, manipulating users’
exposure level to emotional expressions in their Facebook News Feed
• No IRB
“[The work] was consistent with Facebook’s Data
Use Policy, to which all users agree prior to
creating an account on Facebook, constituting
informed consent for this research.”
• PNAS editorial Expression of Concern
• Varied response from public, academia, press,
ethicists, corporates
Behavioral
Big Data
Researcher
Human
Subjects
Research
Question
Scientific vs.
Clinical vs.
Commercial
Explain
vs.
Predict
Different (conflicting) Goals:
Unit of analysis vs.
Unit of measurement
Under/over-
coverage
New risks (privacy, liability,
security, HIPAA compliance)
New ethical challenges:
Generalization Challenges:
Acquire + analyze data
Users (self-selection,
spill-over, knowledge of
allocation, network)
Company algorithms
Average effect vs. individual effect
Data contaminated by:New modes of connection &
information (social networks,
forums, IoT)
ATE vs.
Individual
Technical expertise
Behavioral Healthcare-Related
BBD Study: Example #2
Detecting influenza epidemics using search engine query data
Ginsberg et al. (2009), Nature
• “Up-to-date influenza estimates may enable public health officials and health
professional to better respond to seasonal epidemics”
• Researchers from Google and CDC
• BBD: automated search results for 50M keywords on Google.com (2003-
2007). For each query, collected {query text, IP address}
• Analysis: Fit 450M different models, correlating each query text with CDC
data; Combined 45 queries with highest correlation
Researchers: epidemiologists + data science academics
Dalton et al. (2016), “Flutracking weekly online community
survey of influenza-like illness annual report, 2015”
Communicable diseases intelligence quarterly report
Challenge: Acquire data
• The algorithm detects “flu” or “winter”
• Persistent over-estimation
• Performs worse than lagged CDC
3-week-old data
• Never released 45 terms used
• Changes made by Google’s search
algorithm to display potential diagnoses
+ recommend search for treatment
(more advertising) -> increased search
• Lazer et al. recommend combining/
calibrating GFT with CDC data
Behavioral
Big Data
Researcher
Human
Subjects
Research
Question
Scientific vs.
Clinical vs.
Commercial
Explain
vs.
Predict
Different (conflicting) Goals:
Unit of analysis vs.
Unit of measurement
Under/over-
coverage
New risks (privacy, liability,
security, HIPAA compliance)
New ethical challenges:
Generalization Challenges:
Acquire + analyze data
Users (self-selection,
spill-over, knowledge of
allocation, network)
Company algorithms
Average effect vs. individual effect
Data contaminated by:New modes of connection &
information (social networks,
forums, IoT)
ATE vs.
Individual
Technical expertise
Telemedicine
/ Telehealth
Remote
Patient
Monitoring
mHealth/ eHealth
Health-”unrelated” behavior
Health-related behaviorNew healthcare BBD offers
new research opportunities
… and new challenges
Behavioral
Big Data
Researcher
Human
Subjects
Research
Question
Scientific vs.
Clinical vs.
Commercial
Explain
vs.
Predict
Different (conflicting) Goals:
Unit of analysis vs.
Unit of measurement
Under/over-
coverage
New risks (privacy, liability,
security, HIPAA compliance)
New ethical challenges:
Generalization Challenges:
Acquire + analyze data
Users (self-selection,
spill-over, knowledge of
allocation, network)
Company algorithms
Average effect vs. individual effect
Data contaminated by:New modes of connection &
information (social networks,
forums, IoT)
ATE vs.
Individual
Technical expertise
Anal yt ics
Humanit y
Responsibil it y
Galit Shmueli 徐茉莉
Institute of Service Science

More Related Content

What's hot

Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions Sitra / Hyvinvointi
 
Presenting health data to patients
Presenting health data to patientsPresenting health data to patients
Presenting health data to patientsKathleen Gray
 
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...Amit Sheth
 
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...Artificial Intelligence Institute at UofSC
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIHPhilip Bourne
 
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...Philip Bourne
 
Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)
Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)
Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)Nawanan Theera-Ampornpunt
 
Evaluating a Potential Commercial Tool for Healthcare Application for People ...
Evaluating a Potential Commercial Tool for Healthcare Application for People ...Evaluating a Potential Commercial Tool for Healthcare Application for People ...
Evaluating a Potential Commercial Tool for Healthcare Application for People ...Artificial Intelligence Institute at UofSC
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterprisePhilip Bourne
 
Secure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH ViewSecure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH ViewPhilip Bourne
 
Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...
Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...
Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...Amit Sheth
 
kHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma Care
kHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma CarekHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma Care
kHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma CareAmit Sheth
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health careAravindharamanan S
 
Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...Amit Sheth
 

What's hot (20)

Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions
 
Presenting health data to patients
Presenting health data to patientsPresenting health data to patients
Presenting health data to patients
 
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
Ontology-enabled Healthcare Applications exploiting Physical-Cyber-Social Big...
 
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
 
Big data for health
Big data for healthBig data for health
Big data for health
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIH
 
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
 
Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)
Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)
Health IT, Ethics & Law for Pathologists: Perils or Promises? (March 1, 2019)
 
Evaluating a Potential Commercial Tool for Healthcare Application for People ...
Evaluating a Potential Commercial Tool for Healthcare Application for People ...Evaluating a Potential Commercial Tool for Healthcare Application for People ...
Evaluating a Potential Commercial Tool for Healthcare Application for People ...
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
Dov Greenbaum, "Avoiding Regulation in the Medical Internet of Things"
Dov Greenbaum, "Avoiding Regulation in the Medical Internet of Things"Dov Greenbaum, "Avoiding Regulation in the Medical Internet of Things"
Dov Greenbaum, "Avoiding Regulation in the Medical Internet of Things"
 
What's in WhatsApp for Radiologists?
What's in WhatsApp for Radiologists?What's in WhatsApp for Radiologists?
What's in WhatsApp for Radiologists?
 
Secure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH ViewSecure Data Sharing and Related Matters – An NIH View
Secure Data Sharing and Related Matters – An NIH View
 
AI & Healthcare @ AIISC: May 2021 Snapshot
AI & Healthcare @ AIISC: May 2021 SnapshotAI & Healthcare @ AIISC: May 2021 Snapshot
AI & Healthcare @ AIISC: May 2021 Snapshot
 
Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...
Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...
Augmented Personalized Health: an explicit knowledge enhanced neurosymbolic d...
 
kHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma Care
kHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma CarekHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma Care
kHealth: Semantic Multi-sensory Mobile Approach to Personalized Asthma Care
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
kHealth Bariatrics
kHealth BariatricskHealth Bariatrics
kHealth Bariatrics
 
Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...Augmented Personalized Health: using AI techniques on semantically integrated...
Augmented Personalized Health: using AI techniques on semantically integrated...
 
k-BOT: Knowledge-driven Chatbot for Health @ CASY2020
k-BOT: Knowledge-driven Chatbot for Health @ CASY2020k-BOT: Knowledge-driven Chatbot for Health @ CASY2020
k-BOT: Knowledge-driven Chatbot for Health @ CASY2020
 

Similar to Researcher Dilemmas using Behavioral Big Data in Healthcare (INFORMS DMDA Workshop)

Behavioral Big Data & Healthcare Research
Behavioral Big Data & Healthcare ResearchBehavioral Big Data & Healthcare Research
Behavioral Big Data & Healthcare ResearchGalit Shmueli
 
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiBehavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiGalit Shmueli
 
Sdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalSdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalkimlyman
 
Wake up Pharma and look into your Big data
Wake up Pharma and look into your Big data Wake up Pharma and look into your Big data
Wake up Pharma and look into your Big data Yigal Aviv
 
Improving health care outcomes with responsible data science
Improving health care outcomes with responsible data scienceImproving health care outcomes with responsible data science
Improving health care outcomes with responsible data scienceWessel Kraaij
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDavid Lee Scher, MD
 
From personal health data to a personalized advice
From personal health data to a personalized adviceFrom personal health data to a personalized advice
From personal health data to a personalized adviceWessel Kraaij
 
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Galit Shmueli
 
Person-generated health data: How can it help us to feel better?
Person-generated health data: How can it help us to feel better?Person-generated health data: How can it help us to feel better?
Person-generated health data: How can it help us to feel better?Kathleen Gray
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Philip Bourne
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineIda Sim
 
1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptxRahulTr22
 
The shared value of personal and population data
The shared value of personal and population dataThe shared value of personal and population data
The shared value of personal and population dataWessel Kraaij
 
Health Data Innovation (Wolfram Data Summit)
Health Data Innovation (Wolfram Data Summit)Health Data Innovation (Wolfram Data Summit)
Health Data Innovation (Wolfram Data Summit)Peter Speyer
 
Medinfo2015 workshop-adherence mangement-patient_driven-publicized
Medinfo2015 workshop-adherence mangement-patient_driven-publicizedMedinfo2015 workshop-adherence mangement-patient_driven-publicized
Medinfo2015 workshop-adherence mangement-patient_driven-publicizedPei-Yun Sabrina Hsueh
 
WSDM2015Tutorial.pdf
WSDM2015Tutorial.pdfWSDM2015Tutorial.pdf
WSDM2015Tutorial.pdfssuser2c7393
 

Similar to Researcher Dilemmas using Behavioral Big Data in Healthcare (INFORMS DMDA Workshop) (20)

Behavioral Big Data & Healthcare Research
Behavioral Big Data & Healthcare ResearchBehavioral Big Data & Healthcare Research
Behavioral Big Data & Healthcare Research
 
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS TaipeiBehavioral Big Data & Healthcare Research: Talk at WiDS Taipei
Behavioral Big Data & Healthcare Research: Talk at WiDS Taipei
 
Sdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalSdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) final
 
Wake up Pharma and look into your Big data
Wake up Pharma and look into your Big data Wake up Pharma and look into your Big data
Wake up Pharma and look into your Big data
 
Improving health care outcomes with responsible data science
Improving health care outcomes with responsible data scienceImproving health care outcomes with responsible data science
Improving health care outcomes with responsible data science
 
Precision and Participatory Medicine - MEDINFO 2015 Panel on big data
Precision and Participatory Medicine - MEDINFO 2015 Panel on big dataPrecision and Participatory Medicine - MEDINFO 2015 Panel on big data
Precision and Participatory Medicine - MEDINFO 2015 Panel on big data
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient Advocate
 
From personal health data to a personalized advice
From personal health data to a personalized adviceFrom personal health data to a personalized advice
From personal health data to a personalized advice
 
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
Research Using Behavioral Big Data: A Tour and Why Mechanical Engineers Shoul...
 
Person-generated health data: How can it help us to feel better?
Person-generated health data: How can it help us to feel better?Person-generated health data: How can it help us to feel better?
Person-generated health data: How can it help us to feel better?
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...
Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...
Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...
 
Day 1: Real-World Data Panel
Day 1: Real-World Data Panel Day 1: Real-World Data Panel
Day 1: Real-World Data Panel
 
1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx
 
The shared value of personal and population data
The shared value of personal and population dataThe shared value of personal and population data
The shared value of personal and population data
 
Health Data Innovation (Wolfram Data Summit)
Health Data Innovation (Wolfram Data Summit)Health Data Innovation (Wolfram Data Summit)
Health Data Innovation (Wolfram Data Summit)
 
Medinfo2015 workshop-adherence mangement-patient_driven-publicized
Medinfo2015 workshop-adherence mangement-patient_driven-publicizedMedinfo2015 workshop-adherence mangement-patient_driven-publicized
Medinfo2015 workshop-adherence mangement-patient_driven-publicized
 
Integrated health monitoring
Integrated health monitoringIntegrated health monitoring
Integrated health monitoring
 
WSDM2015Tutorial.pdf
WSDM2015Tutorial.pdfWSDM2015Tutorial.pdf
WSDM2015Tutorial.pdf
 

More from Galit Shmueli

“Improving” prediction of human behavior using behavior modification
“Improving” prediction of human behavior using behavior modification“Improving” prediction of human behavior using behavior modification
“Improving” prediction of human behavior using behavior modificationGalit Shmueli
 
Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...Galit Shmueli
 
To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?Galit Shmueli
 
Reinventing the Data Analytics Classroom
Reinventing the Data Analytics ClassroomReinventing the Data Analytics Classroom
Reinventing the Data Analytics ClassroomGalit Shmueli
 
Repurposing predictive tools for causal research
Repurposing predictive tools for causal researchRepurposing predictive tools for causal research
Repurposing predictive tools for causal researchGalit Shmueli
 
Statistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingStatistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingGalit Shmueli
 
Workshop on Information Quality
Workshop on Information QualityWorkshop on Information Quality
Workshop on Information QualityGalit Shmueli
 
Behavioral Big Data: Why Quality Engineers Should Care
Behavioral Big Data: Why Quality Engineers Should CareBehavioral Big Data: Why Quality Engineers Should Care
Behavioral Big Data: Why Quality Engineers Should CareGalit Shmueli
 
Statistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, DescribingStatistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, DescribingGalit Shmueli
 
Prediction-based Model Selection in PLS-PM
Prediction-based Model Selection in PLS-PMPrediction-based Model Selection in PLS-PM
Prediction-based Model Selection in PLS-PMGalit Shmueli
 
When Prediction Met PLS: What We learned in 3 Years of Marriage
When Prediction Met PLS: What We learned in 3 Years of MarriageWhen Prediction Met PLS: What We learned in 3 Years of Marriage
When Prediction Met PLS: What We learned in 3 Years of MarriageGalit Shmueli
 
A Tree-Based Approach for Addressing Self-selection in Impact Studies with B...
A Tree-Based Approach  for Addressing Self-selection in Impact Studies with B...A Tree-Based Approach  for Addressing Self-selection in Impact Studies with B...
A Tree-Based Approach for Addressing Self-selection in Impact Studies with B...Galit Shmueli
 
A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...
A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...
A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...Galit Shmueli
 
Research Using Behavioral Big Data (BBD)
Research Using Behavioral Big Data (BBD)Research Using Behavioral Big Data (BBD)
Research Using Behavioral Big Data (BBD)Galit Shmueli
 
Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral Issues
Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral IssuesAnalyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral Issues
Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral IssuesGalit Shmueli
 
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...Galit Shmueli
 
Information Quality: A Framework for Evaluating Empirical Studies
Information Quality: A Framework for Evaluating Empirical Studies Information Quality: A Framework for Evaluating Empirical Studies
Information Quality: A Framework for Evaluating Empirical Studies Galit Shmueli
 
E.SUN Academic Award presentation (Jan 2016)
E.SUN Academic Award presentation (Jan 2016)E.SUN Academic Award presentation (Jan 2016)
E.SUN Academic Award presentation (Jan 2016)Galit Shmueli
 
Big Data & Analytics in the Digital Creative Industries
Big Data & Analytics in the Digital Creative IndustriesBig Data & Analytics in the Digital Creative Industries
Big Data & Analytics in the Digital Creative IndustriesGalit Shmueli
 
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)Galit Shmueli
 

More from Galit Shmueli (20)

“Improving” prediction of human behavior using behavior modification
“Improving” prediction of human behavior using behavior modification“Improving” prediction of human behavior using behavior modification
“Improving” prediction of human behavior using behavior modification
 
Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...
 
To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?
 
Reinventing the Data Analytics Classroom
Reinventing the Data Analytics ClassroomReinventing the Data Analytics Classroom
Reinventing the Data Analytics Classroom
 
Repurposing predictive tools for causal research
Repurposing predictive tools for causal researchRepurposing predictive tools for causal research
Repurposing predictive tools for causal research
 
Statistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingStatistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and Predicting
 
Workshop on Information Quality
Workshop on Information QualityWorkshop on Information Quality
Workshop on Information Quality
 
Behavioral Big Data: Why Quality Engineers Should Care
Behavioral Big Data: Why Quality Engineers Should CareBehavioral Big Data: Why Quality Engineers Should Care
Behavioral Big Data: Why Quality Engineers Should Care
 
Statistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, DescribingStatistical Modeling in 3D: Explaining, Predicting, Describing
Statistical Modeling in 3D: Explaining, Predicting, Describing
 
Prediction-based Model Selection in PLS-PM
Prediction-based Model Selection in PLS-PMPrediction-based Model Selection in PLS-PM
Prediction-based Model Selection in PLS-PM
 
When Prediction Met PLS: What We learned in 3 Years of Marriage
When Prediction Met PLS: What We learned in 3 Years of MarriageWhen Prediction Met PLS: What We learned in 3 Years of Marriage
When Prediction Met PLS: What We learned in 3 Years of Marriage
 
A Tree-Based Approach for Addressing Self-selection in Impact Studies with B...
A Tree-Based Approach  for Addressing Self-selection in Impact Studies with B...A Tree-Based Approach  for Addressing Self-selection in Impact Studies with B...
A Tree-Based Approach for Addressing Self-selection in Impact Studies with B...
 
A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...
A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...
A Tree-Based Approach for Addressing Self-Selection in Impact Studies with Bi...
 
Research Using Behavioral Big Data (BBD)
Research Using Behavioral Big Data (BBD)Research Using Behavioral Big Data (BBD)
Research Using Behavioral Big Data (BBD)
 
Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral Issues
Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral IssuesAnalyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral Issues
Analyzing Behavioral Big Data: Methodological, Practical, Ethical & Moral Issues
 
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...
 
Information Quality: A Framework for Evaluating Empirical Studies
Information Quality: A Framework for Evaluating Empirical Studies Information Quality: A Framework for Evaluating Empirical Studies
Information Quality: A Framework for Evaluating Empirical Studies
 
E.SUN Academic Award presentation (Jan 2016)
E.SUN Academic Award presentation (Jan 2016)E.SUN Academic Award presentation (Jan 2016)
E.SUN Academic Award presentation (Jan 2016)
 
Big Data & Analytics in the Digital Creative Industries
Big Data & Analytics in the Digital Creative IndustriesBig Data & Analytics in the Digital Creative Industries
Big Data & Analytics in the Digital Creative Industries
 
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
 

Recently uploaded

毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 

Recently uploaded (20)

毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 

Researcher Dilemmas using Behavioral Big Data in Healthcare (INFORMS DMDA Workshop)

  • 1. Researcher Dilemmas using Behavioral Big Data in Healthcare INFORMS Workshop on Data Mining & Decision Analysis Houston, TX, Oct 21, 2017 Galit Shmueli 徐茉莉 Institute of Service Science
  • 2. What is Behavioral Big Data (BBD) Special type of Big Data • Behavioral: people’s measurable “everyday” behavior, interactions, self- reported opinions, thoughts, feelings • Human and social aspects: Intentions, deception, emotion, reciprocation, herding,… When aware of data collection -> modified behavior (legal risks, embarrassment, unwanted solicitation)
  • 3. BBD vs. Inanimate Big Data Behavioral Big Data Researcher Human Subjects Research Question Inanimate Big Data Researcher Research Question 1. Aware, ongoing interaction with the BBD - “contaminate” BBD with intention, deception, emotion, herding… 2. Can be harmed by BBD
  • 4. Figure 1: The types of physiological data points and the wearable sensors under development or on the market to monitor them. Elenko, Underwood & Zohar (2015), “Defining Digital Medicine”, Nature Biotechnology 33, 456-461 Physiological Big Data Human Subjects
  • 5. BBD vs. Physiological Big Data • Physical/bio measurements • Data collection timing often set by medical system • Clinical trials: awareness & vested interest • People’s daily actions, interactions, self-reported feelings, opinions, thoughts (UGC) • Data generation timing often chosen by user • Experiments: users often unaware; goal not always in user’s interest Different research methods in life sciences and behavioral sciences • Measurement instruments • Models (latent variable models, social network analysis) • Human subjects risks
  • 6. “Behavioral Health” Data vs. BBD • Behaviors: substance abuse & mental health • Population: patients with mental illness / substance abuse • Specific (defined) behavior of patient • “Big”? www.carolinashealthcare.org/medical-services/prevention-wellness/behavioral-health
  • 8. Hospital Data on Patients, Staff, Assets Patients Personal info Medical history (visits, tests, medication, hospitalization...) Scheduled events, billing Physicians Scheduled + actual appointments, procedures, prescriptions,… Entries of patient info/data Nurses Location, work hours,… Pharmacy staff Speed of service Quality of service Lab staff Speed of service Quality of service Other staff Finance/accounting Cleaning Receptionists Volunteers Food court… Data Collection Technologies: • Medical devices • HIT systems (EHR, HR for Health Info System) --- Smart Hospital • Cameras • Sensors • GPS • IoT
  • 9. Typical Research Fields using Hospital Data Operations Researchers and Industrial Engineers For: Hospital Management and Operations (staffing, scheduling,…) Medical/Healthcare Researchers & Clinicians For: Improved Medical Treatment (safety, effectiveness,…) Information Systems Researchers For: Improved Design & Use of Medical IS (value of IS, effectiveness, standardization,…)
  • 11. Behavioral big data also on… Interactions between Patients – doctors/nurses Doctors – other doctors Patients – other patients Patient family – hospital staff Patients – social network ”friends” ...
  • 12. Health-related BBD: Online • Medical/health websites • Online forums • Social networks • Search engines Info voluntarily entered by users: personal details, photos, comments, messages, search terms, likes, payment information, connections with “friends” Passive footprints: duration on the website, pages browsed, sequence, referring website, Internet browser, operating system, location, IP address
  • 14. “Some hospitals are collecting new information from patients directly, while others have sought data from companies that sell consumer and financial information, or federal agencies that provide statistics on poverty, housing density and unemployment.” The big obstacle: access to the data. Doctors and nurses have limited time to collect new data and patients bombarded with questions about their lives may suffer “interview fatigue” Health-unrelated BBD
  • 15. Research Using New Medical BBD: Challenges Behavioral Big Data Researcher Human Subjects Research Question Scientific vs. Clinical vs. Commercial Explain vs. Predict Different (conflicting) Goals: Unit of analysis vs. Unit of measurement Under/over- coverage New risks (privacy, liability, security, HIPAA compliance) New ethical challenges: Generalization Challenges: Acquire + analyze data Users (self-selection, spill-over, knowledge of allocation, network) Company algorithms Average effect vs. individual effect Data contaminated by:New modes of connection & information (social networks, forums, IoT) ATE vs. Individual Technical expertise
  • 16. Sample Behavioral Healthcare-Related BBD Studies Vocal Minority and Silent Majority: How Do Online Ratings Reflect Population Perception of Quality? Gao et al. (MISQ 2015) Outcomes matter: estimating pre-transplant survival rates of kidney-transplant patients using simulator-based propensity scores Yahav & Shmueli (Annals of Oper. Research, 2014) Emotional Contagion in Social Networks Kramer et al. (PNAS, 2014) Detecting influenza epidemics using search engine query data Ginsberg et al. (Nature, 2009)
  • 17. Emotional Contagion in Social Networks Kramer et al. (2014) Proceedings of the National Academies of Sciences • Can emotional states be transferred to others via emotional contagion? • BBD from large-scale experiment run by FB, manipulating users’ exposure level to emotional expressions in their Facebook News Feed • No IRB “[The work] was consistent with Facebook’s Data Use Policy, to which all users agree prior to creating an account on Facebook, constituting informed consent for this research.” • PNAS editorial Expression of Concern • Varied response from public, academia, press, ethicists, corporates
  • 18. Behavioral Big Data Researcher Human Subjects Research Question Scientific vs. Clinical vs. Commercial Explain vs. Predict Different (conflicting) Goals: Unit of analysis vs. Unit of measurement Under/over- coverage New risks (privacy, liability, security, HIPAA compliance) New ethical challenges: Generalization Challenges: Acquire + analyze data Users (self-selection, spill-over, knowledge of allocation, network) Company algorithms Average effect vs. individual effect Data contaminated by:New modes of connection & information (social networks, forums, IoT) ATE vs. Individual Technical expertise
  • 20. Detecting influenza epidemics using search engine query data Ginsberg et al. (2009), Nature • “Up-to-date influenza estimates may enable public health officials and health professional to better respond to seasonal epidemics” • Researchers from Google and CDC • BBD: automated search results for 50M keywords on Google.com (2003- 2007). For each query, collected {query text, IP address} • Analysis: Fit 450M different models, correlating each query text with CDC data; Combined 45 queries with highest correlation
  • 21. Researchers: epidemiologists + data science academics Dalton et al. (2016), “Flutracking weekly online community survey of influenza-like illness annual report, 2015” Communicable diseases intelligence quarterly report Challenge: Acquire data
  • 22. • The algorithm detects “flu” or “winter” • Persistent over-estimation • Performs worse than lagged CDC 3-week-old data • Never released 45 terms used • Changes made by Google’s search algorithm to display potential diagnoses + recommend search for treatment (more advertising) -> increased search • Lazer et al. recommend combining/ calibrating GFT with CDC data
  • 23. Behavioral Big Data Researcher Human Subjects Research Question Scientific vs. Clinical vs. Commercial Explain vs. Predict Different (conflicting) Goals: Unit of analysis vs. Unit of measurement Under/over- coverage New risks (privacy, liability, security, HIPAA compliance) New ethical challenges: Generalization Challenges: Acquire + analyze data Users (self-selection, spill-over, knowledge of allocation, network) Company algorithms Average effect vs. individual effect Data contaminated by:New modes of connection & information (social networks, forums, IoT) ATE vs. Individual Technical expertise
  • 24.
  • 25. Telemedicine / Telehealth Remote Patient Monitoring mHealth/ eHealth Health-”unrelated” behavior Health-related behaviorNew healthcare BBD offers new research opportunities
  • 26. … and new challenges Behavioral Big Data Researcher Human Subjects Research Question Scientific vs. Clinical vs. Commercial Explain vs. Predict Different (conflicting) Goals: Unit of analysis vs. Unit of measurement Under/over- coverage New risks (privacy, liability, security, HIPAA compliance) New ethical challenges: Generalization Challenges: Acquire + analyze data Users (self-selection, spill-over, knowledge of allocation, network) Company algorithms Average effect vs. individual effect Data contaminated by:New modes of connection & information (social networks, forums, IoT) ATE vs. Individual Technical expertise
  • 27. Anal yt ics Humanit y Responsibil it y Galit Shmueli 徐茉莉 Institute of Service Science

Editor's Notes

  1. Inanimate: Medical devices and drug manufacturing (quality control, safety) Laboratory testing
  2. www.nature.com/nbt/journal/v33/n5/fig_tab/nbt.3222_F1.html
  3. UGC = User Generated Content
  4. https://www.wsj.com/articles/doctors-dig-for-more-data-about-patients-1474855681
  5. “patients as well as medical staff will be communicating in a non-private environment.  It is very important to understand, monitor and control your own content for its privacy implications.  More dangerous and needing control will be the reach of patient-to-patient identification and communication.” - http://www.medicalwebtimes.com/thetimes/medical-headlines/top-10-pros-cons-for-medical-practices-using-social-networking-web-sites/  Kayhan Parsi, JD, PhD, and Nanette Elster, JD, MPH. Why Can't We Be Friends? A Case-Based Analysis of Ethical Issues with Social Media in Health Care. AMA Journal of Ethics, November 2015 DOI: 10.1001/journalofethics.2015.17.11.peer1-1511
  6. HHS propose new IRB exemption criteria for publicly available data (or even buying it) Council for Big Data, Ethics & Society’s letter: “these criteria for exclusion focus on the status of the dataset… not the content of the dataset nor what will be done with the dataset, which are more accurate criteria for determining the risk profile of the proposed research
  7. “patients as well as medical staff will be communicating in a non-private environment.  It is very important to understand, monitor and control your own content for its privacy implications.  More dangerous and needing control will be the reach of patient-to-patient identification and communication.” - http://www.medicalwebtimes.com/thetimes/medical-headlines/top-10-pros-cons-for-medical-practices-using-social-networking-web-sites/  Kayhan Parsi, JD, PhD, and Nanette Elster, JD, MPH. Why Can't We Be Friends? A Case-Based Analysis of Ethical Issues with Social Media in Health Care. AMA Journal of Ethics, November 2015 DOI: 10.1001/journalofethics.2015.17.11.peer1-1511
  8. https://academic.oup.com/cid/article-lookup/doi/10.1093/cid/ciu647
  9. How Does Flutracking work? It takes only 10 - 15 seconds each week. We ask if you have had fever or cough in the last week. This will help us find ways to detect both seasonal influenza and hopefully pandemic influenza and other diseases so we can better protect the community from epidemics. FluNearYou.org
  10. https://academic.oup.com/cid/article-lookup/doi/10.1093/cid/ciu647