SlideShare a Scribd company logo
1 of 21
Social media and it's use
    in disease surveillance


March 2010
✤   How do we improve disease surveillance?

✤   Can social media (e.g. twitter) be effectively
    used to monitor disease outbreaks?
Tweets: disease reports

✤
    Omg.. The never-ending flu+sore throat.. ☹ bleh.. ☹

✤   Stomach flu. Urgh.

✤   i love puking... f@#k you flu

✤   Having a sore throat,sucks.Having flu,sucks even
    MORE.DAMMIT!

✤   Feeling dizzy/ feverish ever since that class at the gym!
    overexertion or the flu??
Tweets: non disease reports

✤   Study finds H1N1 flu in pregnancy is critical
    risk - Reuters - http://bit.ly/bLiLnz
✤   This March Madness turns out to be the flu!
✤   Smiling is infectious, You can catch it like the
    flu. Someone smiled at me today, And I
    started smiling too.
We need Natural Language
Processing (NLP)


✤   We need a NLP engine in order to process
    tweets:
✤   Tweet → NLP Engine → It's the flu!
Maybe we need NLP + Ontologies


✤   Do we just search for simple keywords?
✤   An ontology can provide us with organized
    concepts relevant to a domain (i.e. health,
    biomedicine)
✤   How about processing natural language to match
    concepts organized in an ontology?
Ontologies help answer these
questions

✤   How do we know if a user is referring to a
    symptom or a disease?
✤   We seem to need a set of keywords. Where do get
    this set of symptoms and disease names?
✤   How do we link references to one or more
    symptom to a specific disease?
The UMLS Ontology

✤   A comprehensive thesaurus and ontology of
    biomedical concepts
✤   Facilitates development of computer systems that
    behave as if they "understand" the meaning of the
    language of biomedicine and health.
✤   Integrates 2+ million names for ~900k concepts
    from 60+ families of biomedical vocabularies, and
    12 million relations among these concepts.
UMLS & MetaMap


✤   MetaMap is a tool that given an arbitrary
    piece of text, finds and returns the relevant
    concepts available in the UMLS Ontology
✤   MetaMap is a software interface to query
    the “MetaThesaurus” and the “Semantic
    Network”, both a component of UMLS
Concept mapping with MetaMap
✤   Using MetaMap to query the
    MetaThesaurus, we can map the following
    text strings to the concept "Atrial
    Fibrillation"
     ✤ Atrial fibrillation!
     ✤ AF!

     ✤ AFib!


     ✤ Atrial fibrillation (disorder)
✤   But who actually tweets “atrial
           fibrillation” ??
“Having a sore throat, sucks.
Having flu, sucks even MORE”

✤   Matches:
    ✤   SORETHROAT (Sore Throat) [Sign or
        Symptom]
    ✤   Flu (Influenza) [Disease or Syndrome]
    ✤   Sucking [Physiologic Function]
“i love puking... damn you flu”


✤   Matches:
    ✤   I (Iodides) [Inorganic Chemical]
    ✤   Love [Mental Process]
    ✤   Flu (Influenza) [Disease or Syndrome]
“Feeling dizzy/ feverish ever since that class at
the gym! overexertion or the flu??”

✤   Matches:
    ✤   Feeling dizzy [Sign or Symptom]
    ✤   Feverish (Fever) [Finding]
    ✤   Overexertion (Exhaustion due to excessive
        exertion) [Injury or Poisoning]
    ✤   Flu (Influenza) [Disease or Syndrome]
“Smiling is infectious, u can catch it like the
flu; someone smiled at me today, and I started
smiling too”
✤   Matches:

    ✤   Smiling [Social Behavior]

    ✤   Infection [Disease or Syndrome]

    ✤   Catch (Catch - Finding of sensory dimension of pain)
        [Sign or Symptom]

    ✤   Flu (Influenza) [Disease or Syndrome]

    ✤   Today [Temporal Concept]
✤   Not the best results but it’s a
                start...
Using MetaMap

✤   Free of Charge!

✤   MetaMap Transfer (MMTx) is a java-based distributable
    version of the MetaMap program

✤   Requires 7GB disk space (uncompressed) and at least 1GB
    of RAM (2GB recommended)

✤   “MetaMap is not an end user product. Users will need a
    moderate amount of programming knowledge to use
    MMTx effectively.” - from UMLS website
We identified tweets that mention
a concept...SO WHAT?


✤   We can't assume its a case report!
✤   How the we go around this?
✤   Are we done here?
Supervised learning to improve
the results?


✤   What if we use machine learning?
✤   Supervised learning is a machine learning
    technique for deducing a function from
    training data
Is it feasible?

✤   Weka is a collection of machine learning algorithms for data
    mining tasks.

✤   Algorithms can be applied directly to a dataset or called from
    your own Java code.

✤   Input: dataset of concept matches; Output: Classifier Java
    Class

✤   This automatically generated java class can be easily be used
    to answer if a tweet matching X and Y medical concepts is or is
    not a disease report
Processing a tweet overview

✤   Get Tweet
✤   Process tweet using MetaMap
✤   Get matching concepts from MetaMap
✤   Feed the matches to the Classifier Java Class
✤   Get a True or False answer indicator “it's a disease
    report”

More Related Content

Similar to Social media and it's use in disease surveillance

On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!Joyce Lee
 
Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.Lisa Richardson
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision TreesSara Hooker
 
Automated health responses
Automated health responses Automated health responses
Automated health responses Austin Powell
 
Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.Diana Hole
 
Mental Health and AI.pptx
Mental Health and AI.pptxMental Health and AI.pptx
Mental Health and AI.pptxSPMujuni
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis csandit
 
Psychology Research Paper Examples
Psychology Research Paper ExamplesPsychology Research Paper Examples
Psychology Research Paper ExamplesErin Ross
 
Contoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang FotosintesisContoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang FotosintesisAndrea Lawson
 
Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...Manuel Armayones
 
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free DowPPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free DowNat Rice
 
Pairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYPairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYMichele Kiss
 
Uchealth Character Analysis
Uchealth Character AnalysisUchealth Character Analysis
Uchealth Character AnalysisJennifer Baker
 
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface LumpsROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface LumpsReynaldo Joson
 
David papini escape emotional intelligence traps
David papini   escape emotional intelligence trapsDavid papini   escape emotional intelligence traps
David papini escape emotional intelligence trapsDavid Papini
 
The JoyPop Resilience App for Youth
The JoyPop Resilience App for YouthThe JoyPop Resilience App for Youth
The JoyPop Resilience App for YouthChristine Wekerle
 
Critique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique PapCritique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique PapLisa Graves
 

Similar to Social media and it's use in disease surveillance (20)

Meet Empifany
Meet EmpifanyMeet Empifany
Meet Empifany
 
On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!On Norman doors, sign design, and why you need to think like a designer!
On Norman doors, sign design, and why you need to think like a designer!
 
Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.Homework Is Necessary Essay. Online assignment writing service.
Homework Is Necessary Essay. Online assignment writing service.
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
 
Automated health responses
Automated health responses Automated health responses
Automated health responses
 
Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.Bibliographic Essay Guidelines. Online assignment writing service.
Bibliographic Essay Guidelines. Online assignment writing service.
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Mental Health and AI.pptx
Mental Health and AI.pptxMental Health and AI.pptx
Mental Health and AI.pptx
 
Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis Explore the Effects of Emoticons on Twitter Sentiment Analysis
Explore the Effects of Emoticons on Twitter Sentiment Analysis
 
Psychology Research Paper Examples
Psychology Research Paper ExamplesPsychology Research Paper Examples
Psychology Research Paper Examples
 
Contoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang FotosintesisContoh Soal Essay Tentang Fotosintesis
Contoh Soal Essay Tentang Fotosintesis
 
Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...Online communities as a vehicle to improving interactions between patients an...
Online communities as a vehicle to improving interactions between patients an...
 
The Cold Extractor
The Cold ExtractorThe Cold Extractor
The Cold Extractor
 
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free DowPPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
PPT - Urgent Essay Writing Help PowerPoint Presentation, Free Dow
 
Pairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHYPairing Analytics With Qualitative Methods to Understand the WHY
Pairing Analytics With Qualitative Methods to Understand the WHY
 
Uchealth Character Analysis
Uchealth Character AnalysisUchealth Character Analysis
Uchealth Character Analysis
 
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface LumpsROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
ROJoson PEP Talk: Clinical Diagnostic Algorithm for Common Surface Lumps
 
David papini escape emotional intelligence traps
David papini   escape emotional intelligence trapsDavid papini   escape emotional intelligence traps
David papini escape emotional intelligence traps
 
The JoyPop Resilience App for Youth
The JoyPop Resilience App for YouthThe JoyPop Resilience App for Youth
The JoyPop Resilience App for Youth
 
Critique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique PapCritique Paper Example - How To Write Critique Pap
Critique Paper Example - How To Write Critique Pap
 

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 

Recently uploaded (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 

Social media and it's use in disease surveillance

  • 1. Social media and it's use in disease surveillance March 2010
  • 2. How do we improve disease surveillance? ✤ Can social media (e.g. twitter) be effectively used to monitor disease outbreaks?
  • 3. Tweets: disease reports ✤ Omg.. The never-ending flu+sore throat.. ☹ bleh.. ☹ ✤ Stomach flu. Urgh. ✤ i love puking... f@#k you flu ✤ Having a sore throat,sucks.Having flu,sucks even MORE.DAMMIT! ✤ Feeling dizzy/ feverish ever since that class at the gym! overexertion or the flu??
  • 4. Tweets: non disease reports ✤ Study finds H1N1 flu in pregnancy is critical risk - Reuters - http://bit.ly/bLiLnz ✤ This March Madness turns out to be the flu! ✤ Smiling is infectious, You can catch it like the flu. Someone smiled at me today, And I started smiling too.
  • 5. We need Natural Language Processing (NLP) ✤ We need a NLP engine in order to process tweets: ✤ Tweet → NLP Engine → It's the flu!
  • 6. Maybe we need NLP + Ontologies ✤ Do we just search for simple keywords? ✤ An ontology can provide us with organized concepts relevant to a domain (i.e. health, biomedicine) ✤ How about processing natural language to match concepts organized in an ontology?
  • 7. Ontologies help answer these questions ✤ How do we know if a user is referring to a symptom or a disease? ✤ We seem to need a set of keywords. Where do get this set of symptoms and disease names? ✤ How do we link references to one or more symptom to a specific disease?
  • 8. The UMLS Ontology ✤ A comprehensive thesaurus and ontology of biomedical concepts ✤ Facilitates development of computer systems that behave as if they "understand" the meaning of the language of biomedicine and health. ✤ Integrates 2+ million names for ~900k concepts from 60+ families of biomedical vocabularies, and 12 million relations among these concepts.
  • 9. UMLS & MetaMap ✤ MetaMap is a tool that given an arbitrary piece of text, finds and returns the relevant concepts available in the UMLS Ontology ✤ MetaMap is a software interface to query the “MetaThesaurus” and the “Semantic Network”, both a component of UMLS
  • 10. Concept mapping with MetaMap ✤ Using MetaMap to query the MetaThesaurus, we can map the following text strings to the concept "Atrial Fibrillation" ✤ Atrial fibrillation! ✤ AF! ✤ AFib! ✤ Atrial fibrillation (disorder)
  • 11. But who actually tweets “atrial fibrillation” ??
  • 12. “Having a sore throat, sucks. Having flu, sucks even MORE” ✤ Matches: ✤ SORETHROAT (Sore Throat) [Sign or Symptom] ✤ Flu (Influenza) [Disease or Syndrome] ✤ Sucking [Physiologic Function]
  • 13. “i love puking... damn you flu” ✤ Matches: ✤ I (Iodides) [Inorganic Chemical] ✤ Love [Mental Process] ✤ Flu (Influenza) [Disease or Syndrome]
  • 14. “Feeling dizzy/ feverish ever since that class at the gym! overexertion or the flu??” ✤ Matches: ✤ Feeling dizzy [Sign or Symptom] ✤ Feverish (Fever) [Finding] ✤ Overexertion (Exhaustion due to excessive exertion) [Injury or Poisoning] ✤ Flu (Influenza) [Disease or Syndrome]
  • 15. “Smiling is infectious, u can catch it like the flu; someone smiled at me today, and I started smiling too” ✤ Matches: ✤ Smiling [Social Behavior] ✤ Infection [Disease or Syndrome] ✤ Catch (Catch - Finding of sensory dimension of pain) [Sign or Symptom] ✤ Flu (Influenza) [Disease or Syndrome] ✤ Today [Temporal Concept]
  • 16. Not the best results but it’s a start...
  • 17. Using MetaMap ✤ Free of Charge! ✤ MetaMap Transfer (MMTx) is a java-based distributable version of the MetaMap program ✤ Requires 7GB disk space (uncompressed) and at least 1GB of RAM (2GB recommended) ✤ “MetaMap is not an end user product. Users will need a moderate amount of programming knowledge to use MMTx effectively.” - from UMLS website
  • 18. We identified tweets that mention a concept...SO WHAT? ✤ We can't assume its a case report! ✤ How the we go around this? ✤ Are we done here?
  • 19. Supervised learning to improve the results? ✤ What if we use machine learning? ✤ Supervised learning is a machine learning technique for deducing a function from training data
  • 20. Is it feasible? ✤ Weka is a collection of machine learning algorithms for data mining tasks. ✤ Algorithms can be applied directly to a dataset or called from your own Java code. ✤ Input: dataset of concept matches; Output: Classifier Java Class ✤ This automatically generated java class can be easily be used to answer if a tweet matching X and Y medical concepts is or is not a disease report
  • 21. Processing a tweet overview ✤ Get Tweet ✤ Process tweet using MetaMap ✤ Get matching concepts from MetaMap ✤ Feed the matches to the Classifier Java Class ✤ Get a True or False answer indicator “it's a disease report”

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n